A Parallel Algorithm for the Single-Source Shortest Path Problem - PowerPoint PPT Presentation

A Parallel Algorithm for the Single-Source Shortest Path Problem Kevin Kelley, Tao Schardl May 12, 2010

Outline ◮ The Problem ◮ Dijkstra’s Algorithm ◮ Gabow’s Scaling Algorithm ◮ Optimizing Gabow ◮ Parallelizing Gabow ◮ Theoretical Performance of Gabow ◮ Empirical Performance of Gabow ◮ Future Work

The Problem Single Source Shortest Paths (SSSP) Given a graph G = ( V , E ) with non-negative edge weights and a starting vertex v 0 , find the shortest path from v 0 to every v ∈ V . Some Definitions: ◮ The weight of a path is the sum of the weights of the edges along that path. ◮ The length of a path is the number of edges along the path. ◮ A “shortest path” from v 0 is the minimum weight path from v 0 . ◮ The distance from v 0 to v , v . dist , is the sum of the weights on the minimum weight path from v 0 to v .

Dijkstra’s Algorithm Use a priority queue keyed on distance. 1. Set v 0 . dist = 0 and v . dist = ∞ for all v � = v 0 . 2. Create priority queue Q on all vertices in V . 3. While Q is not empty: 3.1 v = Extract-Min ( Q ) 3.2 For all u such that ( v , u ) ∈ E 3.2.1 u . dist = v . dist + w ( v , u ), where w ( v , u ) is the weight of edge ( v , u ). 3.2.2 Decrease-Key ( Q , u , u . dist ) Running Time: O ( E · T Decrease-Key + V · T Extract-Min ). Can we parallelize Dijkstra’s Algorithm? ◮ Priority queue is a serial bottleneck. ◮ Only definitely useful operation is to process the minimum element of priority queue at each step.

Gabow’s Scaling Algorithm Idea : Consider the edge weights one bit at a time. ◮ The weight of the minimum weight path from v 0 to v using just the most significant bit of the weight is an approximation for the weight of the minimum weight path from v 0 to v . ◮ Incrementally introduce additional bits of the weight to refine our approximation of the minimum weight paths. ◮ Once all of the bits of the weights are considered, we’re done.

Gabow’s Scaling Algorithm ◮ At each iteration, for some edge ( u , v ) we define the difference in approximate distances u . dist − v . dist to be the potential across ( u , v ). ◮ We define the cost of an edge to be its refined weight at some iteration plus the potential across it: l i ( u , v ) = w i ( u , v ) + u . dist − v . dist . ◮ Since the sum of costs along a path telescopes, these costs preserve the minimum weight paths in the graph. ◮ We guarantee that the cost of an edge is always nonnegative. ◮ = > We can repeatedly find minimum weight paths on graphs of cost values.

Optimizing Gabow We can restrict the size of the priority queue used on each step. ◮ The length of a path with p edges can increase by at most p on each subsequent iteration of Gabow. ◮ Let p i , max be the length of the longest minimum weight path after the i th iteration of Gabow. ◮ The sum of the costs on a minimum weight path during the i + 1st iteration can be no more than p i , max . ◮ The i th iteration of Gabow can find the minimum weight paths using a monotone priority queue with only p i − 1 , max bins.

Parallelizing Gabow Can we do it? ◮ The priority queue must store V items in p i , max bins. ◮ p i , max ≤ V , but we expect p i , max < V in many cases. ◮ ⇒ We expect bins to contain multiple items. ◮ We can process the contents of each bin in parallel.

Parallelizing Gabow Issues with parallelizing Gabow: ◮ Parallel threads will try to set the distance for a vertex simultaneously. We want the minimum distance to win. ◮ Parallel threads will be adding vertices to a priority queue in parallel. We want the priority queue to work properly anyway. ◮ A vertex may have many neighbors connected with zero-length edges. We need to manage these neighbors efficiently.

Parallelizing Gabow Race condition for distance value: “Double-setting” ◮ Let the race be. ◮ When removing a vertex from its minimum bin in the priority queue, ensure its distance value is correct before proceeding. ◮ At the point when a vertex is removed from its minimum bin, we know its correct distance. ◮ ⇒ The non-benign race becomes benign.

Parallelizing Gabow Parallel priority queue: ◮ Don’t use a Decrease-Key operation; just Insert . ◮ When we encounter a vertex we have evaluated already, skip it. ◮ Currently, we used a locked data structure for each bin to resolve a race for inserting into the same bin. ◮ Alternatively, use TLS for each bin to remove contention on writing to the same bin. ◮ Parallel threads can insert into the queue with no contention.

Parallelizing Gabow Zero-weight edges: ◮ Keep two buffers for each bin. Fill the second while processing the first. ◮ Once the first is done, if the second is non-empty, swap the buffers and repeat. ◮ If the second buffer gets sufficiently large, spawn off a separate thread to process it.

Theoretical Performance of Gabow Let G = ( V , E ) be a simple connected weighted directed graph. Let W be the maximum edge weight in G . Let ∆ be the maximum out-degree of a vertex v ∈ V . ◮ Work: Θ( E lg W ). ◮ Span: O ( V lg W lg ∆) worst-case. ◮ Bits of weight are processed serially in phases. Θ(lg W ) ◮ Within each phase, each bin in the priority queue is processed serially. ◮ Within a bin, the longest chain of vertices connected by zero-weight edges is processed serially. ◮ Edges on minimum weight paths from previous phase may have weight of 0 or 1. ◮ In the worst case, every vertex appears in some bin’s longest chain of zero-weight edges once. ◮ Total length of zero-weight edge chains in all bins is O ( V ) worst case. ◮ Each vertex has ∆ neighbors to explore, which requires O (lg ∆) span.

Theoretical Performance of Gabow Suppose we have random edge weights, and let D be the length of the longest minimum weight path in any phase of Gabow. ◮ Work: Θ( E lg W ). ◮ Span: O ( D lg W lg ∆ lg V / D ) ◮ Each phase must examine D bins serially. ◮ The length of the longest zero-weight edge chain in a bin is O (lg V / D ) with high probability. ◮ Total length of zero-weight edge chains in all bins is O ( D lg V / D ). ◮ Ω( E / V lg ∆) parallelism worst-case. ◮ Ω( E / ( D lg ∆ lg V / D )) parallelism with random edge weights.

Empirical Performance of Gabow We tested our parallel Gabow implementation on a few input graphs, including the New York and San Francisco Bay road networks. ◮ First, we collected metrics on the priority queue data structure during Gabow’s execution, including Bin size, Queue size, and longest zero-weight edge chain. ◮ Second, we compared Gabow’s serial and parallel performance to a simple Dijkstra implementation. The data presented here comes from running Gabow on the San Francisco Bay road network. V = 321270, E = 800172. Parallelism according to Cilkview: 4 . 76 (2 . 29 burdened)

Empirical Performance of Gabow Number of Evaluated Vertices in Each Bin (San Francisco Bay road network) 16 15 14 13 12 11 10 9 Bit 8 Median 7 Mean 6 Max 5 4 3 2 1 0 0 200 400 600 800 1000 1200 1400 1600 Number of Vertices

Empirical Performance of Gabow Number of Ignored Vertices in Each Bin (San Francisco Bay road network) 16 15 14 13 12 11 10 9 Bit 8 Mean 7 Max 6 5 4 3 2 1 0 0 500 1000 1500 2000 2500 3000 3500 Number of Vertices

Empirical Performance of Gabow Maximum Length of Zero-Weight Edge Chain (San Francisco Bay road network) 16 15 14 13 12 11 10 9 Bit 8 7 6 5 4 3 2 1 0 0 200 400 600 800 1000 1200 1400 Maximum Chain Length

Empirical Performance of Gabow Queue Size (San Francisco Bay road network): Min 523 Median 1026 Mean 20119 Max 321270 = V

Empirical Performance of Gabow Performance on San Francisco Bay road network: Dijkstra (ms) Gabow, 1 proc (ms) 791 5116 8 parallelism burdened speedup trials 7 6 5 Speedup 4 3 2 1 0 0 1 2 3 4 5 6 7 8 Processors

Future Work ◮ Remove lingering unnecessary serial code in parallel Gabow implementation. ◮ Use TLS for each bin in the priority queue, rather than a locked vector, to remove lingering contention. ◮ Investigate memory bandwidth issues. ◮ Experiment with alternative graph layouts.

Empirical Performance of Gabow as of 05-10-2010 Performance on random graph, V = 1 . 5 M , E = 4 M 8 7 6 5 Speedup 4 3 2 1 parallelism burdened speedup Parallel Gabow Dijkstra 0 0 1 2 3 4 5 6 7 8 Processors

Empirical Performance of Gabow as of 05-10-2010 Performance on road network for northeastern U.S. 8 7 6 5 Speedup 4 3 2 1 parallelism burdened speedup Parallel Gabow Dijkstra 0 0 1 2 3 4 5 6 7 8 Processors

A Parallel Algorithm for the Single-Source Shortest Path Problem - PowerPoint PPT Presentation

A Parallel Algorithm for the Single-Source Shortest Path Problem Kevin Kelley, Tao Schardl May 12, 2010 Outline The Problem Dijkstras Algorithm Gabows Scaling Algorithm Optimizing Gabow Parallelizing Gabow

= k w ( p ) w v ( , v ) i 1 i Single Single- -destination shortest

ECE 242 Data Structures Lecture 31 Shortest Path Algorithms November 30, 2009 ECE242 L31:

Graphs II - Shortest paths Single Source Shortest Paths All Sources Shortest Paths some drawings

Faster Parallel Algorithm for Approximate Shortest Path Jason Li (CMU) STOC 2020 March 2, 2020

4.4 Shortest Paths in a Graph shortest path from Princeton CS department to Einstein's house

Single-Source Shortest Paths Introduction Negative Weights and Cycles Initialize-Single-Source

Finding Shortest Paths Shortest Path Problem Shortest Path Problem We are given a graph G = ( V ,

Finding Shortest Paths Shortest Path Problem Shortest Path Problem Given a graph G = ( V , E )

Shortest Paths with Arbitrary Edge Weights Cormen et. al.24.1 Shortest Path Problem Shortest

Outline and Reading Weighted graphs (7.1) Shortest Paths Shortest path problem Shortest

CMSC 132: Object-Oriented Programming II Single Source Shortest Path Algorithm Department of

Shortest Paths Shortest Paths path between two given vertices path between two given vertices

TEDI: Efficient Shortest Path Query Answering on Graphs Fang Wei University of Freiburg SIGMOD

Mechanically checked proof on Dijkstras shortest path algorithm Qiang Zhang J Moore October

Network layer The Dijkstra Algorithm or Dijkstras Shortest Path First Algorithm Non-Adaptive

Dijkstra s Algorithm Shortest Path Problem Directed graph G = (V , E) Source s l e = length

CSE 101 Algorithm Design and Analysis Sanjoy Dasgupta, Russell Impagliazzo, Ragesh Jaiswal

Introduction Computer Science & Engineering 423/823 I Given a weighted, directed graph G = ( V

A New 3/2-Approximation Algorithm for the b-Edge Cover Problem Arif Khan Alex Pothen Computer

A Fast Max-Cut Algorithm on Planar Graphs Frauke Liers Gregor Pardella Institut fr Informatik

Exploring our data Edmund Hart Instructor DataCamp Network Analysis in R: Case Studies Bike

Spatio-temporal Analysis of Reverted Wikipedia Edits Johannes Kiesel , Martin Potthast, Matthias

COMP 2103 Programming 3 Part 4 Jim Diamond CAR 409 Jodrey School of Computer Science

Adversarial Machine Learning Daniel Lowd University of Oregon

A Parallel Algorithm for the Single-Source Shortest Path Problem - PowerPoint PPT Presentation

A Parallel Algorithm for the Single-Source Shortest Path Problem Kevin Kelley, Tao Schardl May 12, 2010 Outline The Problem Dijkstras Algorithm Gabows Scaling Algorithm Optimizing Gabow Parallelizing Gabow

= k w ( p ) w v ( , v ) i 1 i Single Single- -destination shortest

ECE 242 Data Structures Lecture 31 Shortest Path Algorithms November 30, 2009 ECE242 L31:

Graphs II - Shortest paths Single Source Shortest Paths All Sources Shortest Paths some drawings

Faster Parallel Algorithm for Approximate Shortest Path Jason Li (CMU) STOC 2020 March 2, 2020

4.4 Shortest Paths in a Graph shortest path from Princeton CS department to Einstein's house

Single-Source Shortest Paths Introduction Negative Weights and Cycles Initialize-Single-Source

Finding Shortest Paths Shortest Path Problem Shortest Path Problem We are given a graph G = ( V ,

Finding Shortest Paths Shortest Path Problem Shortest Path Problem Given a graph G = ( V , E )

Shortest Paths with Arbitrary Edge Weights Cormen et. al.24.1 Shortest Path Problem Shortest

Outline and Reading Weighted graphs (7.1) Shortest Paths Shortest path problem Shortest

CMSC 132: Object-Oriented Programming II Single Source Shortest Path Algorithm Department of

Shortest Paths Shortest Paths path between two given vertices path between two given vertices

TEDI: Efficient Shortest Path Query Answering on Graphs Fang Wei University of Freiburg SIGMOD

Mechanically checked proof on Dijkstras shortest path algorithm Qiang Zhang J Moore October

Network layer The Dijkstra Algorithm or Dijkstras Shortest Path First Algorithm Non-Adaptive

Dijkstra s Algorithm Shortest Path Problem Directed graph G = (V , E) Source s l e = length

CSE 101 Algorithm Design and Analysis Sanjoy Dasgupta, Russell Impagliazzo, Ragesh Jaiswal

Introduction Computer Science &amp; Engineering 423/823 I Given a weighted, directed graph G = ( V

A New 3/2-Approximation Algorithm for the b-Edge Cover Problem Arif Khan Alex Pothen Computer

A Fast Max-Cut Algorithm on Planar Graphs Frauke Liers Gregor Pardella Institut fr Informatik

Exploring our data Edmund Hart Instructor DataCamp Network Analysis in R: Case Studies Bike

Spatio-temporal Analysis of Reverted Wikipedia Edits Johannes Kiesel , Martin Potthast, Matthias

COMP 2103 Programming 3 Part 4 Jim Diamond CAR 409 Jodrey School of Computer Science

Adversarial Machine Learning Daniel Lowd University of Oregon

Introduction Computer Science & Engineering 423/823 I Given a weighted, directed graph G = ( V