Graph data structure for MLD #3779

TheMarex · 2017-03-06T13:24:32Z

We need a data structure that makes the following operations fast:

Scanning all boundary arcs of a node at a certain level
Scanning all arcs to nodes of the same cell on level 1
Identify when a node is a boundary node on a certain level

For this I propose the following data structure that basically just relies on a StaticGraph but enforces a node ordering that is specific to the partition.

Sort all nodes by the following criterion:
1. The highest level for which it is a boundary node (or never)
2. The cell in which it is contained on that level
3. Node ID (even better: BFS ID, but we can do that later)
Node IDs of boundary nodes at a certain level will be consecutive. That way we can easily implement a fast map: Node ID -> Boundary ID (row/column) for each cell using a vector for each level, without too much memory overhead.
We will get a lot of cache efficiency because the search on the level 0 base graph will only be within one part of the adjacency array.

We can create this in the partitioner just ~~after we have determined the MultiLevelPartition and~~ before we will create the CellStorage. CellStorage will then depend on the graph having a certain ID ordering to facilitate fast lookups.

EDIT: We need to do this before the MultiLevelPartition because we store data reference by node ID.

/cc @oxidase

The text was updated successfully, but these errors were encountered:

TheMarex · 2017-03-07T00:02:32Z

Will try to sketch this data structure out today.

TheMarex · 2017-03-07T14:29:00Z

Resorting the edges makes us dependent on #3737 since the current format of the lookup file assumes edges are always sorted in the way we emit them in the EdgeBasedGraphFactory.

TheMarex · 2017-05-10T23:21:25Z

Thinking a little bit more on how to do this, I think it can be done bypassing some of the structural problems here:

What we want to re-number are the EdgeBasedNode ids. These are referenced in multiple files after executing osrm-extract:
- .fileIndex stores a reference to the forward/backward node in the search graph.
- .ebg stores edges as (source, target, data)
- .ebg_node stores additional information indexed by node ID
We already read the .ebg file in osrm-partition anyway, we could also renumber every edge and write it out again.
Loading the .fileIndex file in osrm-partition and renumbering it is kind of ugly but works.
There should be no downside from doing something like osrm-extract then run osrm-partition (modified the .ebg file) osrm-contract (reads the .ebg file modied by osrm-partition). Best case this will also improve cache locality for CH.
Multiple runs of osrm-partition are not affected by the renumbered file.

Brain dump on how to do this:

Compute partition, don't create MultiLevelPartition yet, work on the raw vectors.
Scan all nodes and mark every node that is a border node.
Create a permutation array for every node.
Partition nodes first by level on which they are a border node.
Top-down stable sort on each level partition

TheMarex · 2017-05-23T16:59:19Z

First progress report: Got the renumbering working up until the customization step (the .ebg_nodes and .fileIndex files still need to be renumbered).
This already enabled me to measure the impact of the new numbering on the customization:

Heat map of the access pattern on the graph during customization:

The X-axis shows the node ID space, the Y-axis is the time of execution (axis is flipped, top is start).

master

renumbered

As can already be guessed from these images, the renumbering reduces the cache-misses by more then 40%.

That said, this actually seems to decrease performance, which doesn't really make sense. That is the output of perf. The number of cycles is higher for this branch. My current theory is that maybe the graph is corrupted, can't really tell before the whole pipeline works.

Never mind this was a measurement error locally, new results confirm this translates to speedup as expected:

master

 Performance counter stats for './osrm-customize ../../osrm-data/bayern-latest.osrm':

      11855.821761      task-clock:u (msec)       #    2.737 CPUs utilized          
                 0      context-switches:u        #    0.000 K/sec                  
                 0      cpu-migrations:u          #    0.000 K/sec                  
             9,461      page-faults:u             #    0.798 K/sec                  
    36,612,730,500      cycles:u                  #    3.088 GHz                    
    34,531,501,103      instructions:u            #    0.94  insn per cycle         
     7,020,323,106      branches:u                #  592.141 M/sec                  
       367,569,729      branch-misses:u           #    5.24% of all branches        

       4.331524738 seconds time elapsed

renumbered

 Performance counter stats for './osrm-customize ../../osrm-data/bayern-latest.osrm':

      11382.990306      task-clock:u (msec)       #    2.803 CPUs utilized          
                 0      context-switches:u        #    0.000 K/sec                  
                 0      cpu-migrations:u          #    0.000 K/sec                  
             9,445      page-faults:u             #    0.830 K/sec                  
    35,151,807,276      cycles:u                  #    3.088 GHz                    
    33,939,053,523      instructions:u            #    0.97  insn per cycle         
     6,866,671,197      branches:u                #  603.240 M/sec                  
       337,103,764      branch-misses:u           #    4.91% of all branches        

       4.061277007 seconds time elapsed

The new numbering uses the partition information to sort border nodes first to compactify storages that need access indexed by border node ID. We also get an optimized cache performance for free sincr we can also recursively sort the nodes by cell ID. This implements issue #3779.

TheMarex · 2017-06-08T14:49:58Z

This was addressed with #4089.

TheMarex added the MLD label Mar 6, 2017

TheMarex mentioned this issue Mar 7, 2017

Add graph with efficient border edge / internal edge scan #3782

Merged

3 tasks

TheMarex mentioned this issue Apr 18, 2017

Reduce file sizes #3955

Closed

danpat added this to the 5.8.0 milestone May 10, 2017

TheMarex self-assigned this May 10, 2017

TheMarex mentioned this issue May 24, 2017

Renumber graph nodes after partitioning them #4089

Merged

5 tasks

TheMarex closed this as completed Jun 8, 2017

oxidase mentioned this issue Mar 10, 2018

[Question] What is the effect of 'makePermutation' after 'bisectionToPartition' during partition #4942

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graph data structure for MLD #3779

Graph data structure for MLD #3779

TheMarex commented Mar 6, 2017 •

edited

Loading

TheMarex commented Mar 7, 2017

TheMarex commented Mar 7, 2017

TheMarex commented May 10, 2017 •

edited

Loading

TheMarex commented May 23, 2017 •

edited

Loading

TheMarex commented Jun 8, 2017

Graph data structure for MLD #3779

Graph data structure for MLD #3779

Comments

TheMarex commented Mar 6, 2017 • edited Loading

TheMarex commented Mar 7, 2017

TheMarex commented Mar 7, 2017

TheMarex commented May 10, 2017 • edited Loading

TheMarex commented May 23, 2017 • edited Loading

TheMarex commented Jun 8, 2017

TheMarex commented Mar 6, 2017 •

edited

Loading

TheMarex commented May 10, 2017 •

edited

Loading

TheMarex commented May 23, 2017 •

edited

Loading