Fast Shortest Path Distance Estimation in Large Networks Michalis - PowerPoint PPT Presentation

Fast Shortest Path Distance Estimation in Large Networks Michalis Potamias Francesco Bonchi Carlos Castillo Aristides Gionis

Context-aware Search …use shortest-path distance in wikipedia links-graph! S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 2

Social Search John searches Mary Ellie Mary B Ranking: 1. Mary A Jack 2. Mary B Jim Ron 3. Mary C John Mary C Mary A Joe Frodo …use shortest-path distance in friendship graph! S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 3

Problem and Solutions • DB: Graph G = ( V , E ) • Query: Nodes s and t in V • Goal: Compute fast shortest path d ( s , t ) • Exact Solution – BFS - Dijkstra – Bidirectional - Dijkstra with A* (aka ALT methods) • [Ikeda, 1994] [Pohl, 1971] [Goldberg and Harrelson, SODA 2005] • Heuristic Solution s t – Avoid traversals – Use Random Landmarks • [Kleinberg et al, FOCS 2004] [Vieira et al, CIKM 2007] u – Can we choose Better Landmarks ?!? S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 4

The Landmarks’ Method • Offline – Precompute distance of all nodes to a small set of nodes (landmarks) – Each node is associated with a vector with its SP-distance from each landmark (embedding) • Query-time – d ( s , t ) = ? – Combine the embeddings of s and t to get an estimate of the query S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 5

Contribution 1. Proved that covering the network with landmarks is NP-hard. 2. Devised heuristics for good landmarks. 3. Experiments with 5 large real-world networks and more than 30 heuristics. Comparison with state of the art. 4. Application to Social Search. S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 6

Algorithmic Framework • Triangle Inequality s t u • Observation: the case of equality S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 7

The Landmarks’ Method 1. Selection: Select k landmarks 2. Offline: Run k BFS/Dijkstra and store the embeddings of each node: Φ ( s ) = < d ( s, u 1 ), d ( s , u 2 ), … , d ( s, u k )> = < s 1 , s 2 , …, s k > 3. Query-time: d ( s , t ) = ? – Fetch Φ ( s ) and Φ ( t ) – Compute min i { s i + t i } (i.e. inf of UB) ... in time O ( k ) S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 8

Example query: d ( s , t ) d (_, u 1 ) d (_, u 2 ) d(_, u 3 ) d(_, u 4 ) Φ ( s ) 2 4 5 2 Φ ( t ) 3 5 1 4 UB 5 9 6 6 LB 1 1 4 2 S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 9

Coverage Using Upper Bounds • A landmark u covers a pair ( s , t ), if u lies on a shortest path from s to t • Problem Definition: find a set of k landmarks that cover as many pairs ( s , t ) in V x V as possible – NP-hard – k = 1 : node with the highest betweenness centrality – k > 1 : greedy set-cover (approximation - too expensive) …central nodes are a good start for devising heuristics! S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 0

Landmarks Selection: Basic Heuristics • Random (baseline) • Choose central nodes! – Degree – Closeness centrality • Closeness of u is the average distance of u to any vertex in G • Caveat: many central nodes may cover the same pairs: newly added landmarks should cover different pairs …spread the landmarks in the graph! S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 1

Constrained Heuristics • Remove immediate neighborhood 1. Rank all nodes according to Degree or Centrality 2. Iteratively choose the highest ranking nodes. Remove h -neighbors of each selected node from candidate set • Denote as – Degree/h – Closeness/h – Best results for h = 1 S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 2

Partitioning-based Heuristics • Use graph-partitioning to spread nodes. • Utilize any partitioning scheme and – Degree/P • Pick the node with the highest degree in each partition – Closeness/P • Pick the node with the highest closeness in each partition – Border/P • Pick the node closer to the border in each partition. Maximize the border-value that is given from the following formula: S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 3

Versus Random - error S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 5

Versus Random - triangulation random landmarks have theoretical guarantees [FOCS04] S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 6

Versus ALT - efficiency Ours (10%) 20 100 500 50 50 Operations ALT LB 60K 40K 80K 20K 2K Operations >300x >400x >160x >400x >40x ALT 7K 10K 20K 2K 2K Visited Nodes state of the art exact ALT methods [SODA05] S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 7

Social Search Task random landmarks have been used [CIKM07] S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 8

Conclusion • Novel search paradigms need distance as primitive – Approximations should be computed in milliseconds • Heuristic landmarks yield remarkable tradeoffs for SP- distance estimation in huge graphs – Hard to find the optimal landmarks – Border and Centrality heuristics: • outperform Random even by a factor of 250. • are, for a 10% error, many orders of magnitude faster than state of the art exact algorithms (ALT) • Future Work – Provide fast estimation for more graph primitives! S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 1 9

Thank you! ? S h o r t e s t P a t h s i n L a r g e N e t w o r k s @ C I K M 2 0 0 9 2 0

Fast Shortest Path Distance Estimation in Large Networks Michalis - PowerPoint PPT Presentation

Fast Shortest Path Distance Estimation in Large Networks Michalis Potamias Francesco Bonchi Carlos Castillo Aristides Gionis Context-aware Search use shortest-path distance in wikipedia links-graph! S h o r t e s t

Three Graph Algorithms Shortest Distance Paths Distance/Cost of a path in weighted graph sum of

Finding Shortest Paths Shortest Path Problem Shortest Path Problem We are given a graph G = ( V ,

ECE 242 Data Structures Lecture 31 Shortest Path Algorithms November 30, 2009 ECE242 L31:

Finding Shortest Paths Shortest Path Problem Shortest Path Problem Given a graph G = ( V , E )

4.4 Shortest Paths in a Graph shortest path from Princeton CS department to Einstein's house

Shortest Paths with Arbitrary Edge Weights Cormen et. al.24.1 Shortest Path Problem Shortest

Shortest Paths Shortest Paths path between two given vertices path between two given vertices

TEDI: Efficient Shortest Path Query Answering on Graphs Fang Wei University of Freiburg SIGMOD

Outline and Reading Weighted graphs (7.1) Shortest Paths Shortest path problem Shortest

Graphs II - Shortest paths Single Source Shortest Paths All Sources Shortest Paths some drawings

= k w ( p ) w v ( , v ) i 1 i Single Single- -destination shortest

Design of capacitated networks with unsplittable shortest path routing Andreas Bley Zuse

Shortest Paths Shortest path problem. Given a directed graph G = (V, E), with edge weights c vw ,

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Mechanically checked proof on Dijkstras shortest path algorithm Qiang Zhang J Moore October

Network layer The Dijkstra Algorithm or Dijkstras Shortest Path First Algorithm Non-Adaptive

Landmark Landmark-based routing based routing Landmark Landmark-based routing based routing

SEIF, EnKF, EKF SLAM Pieter Abbeel UC Berkeley EECS Information Filter From an analytical point

Landmark-based Meta Best-First Search Bachelor Thesis Presentation By Samuel Hugger STRIPS

Landmark-Based and SLAM EKF localization with landmarks assume that a unicycle-like robot is

A Practical Approach to Landmark Deployment for Indoor Localization Yingying Chen, John Chen,

Network Discovery and Landmarks in Graphs Thomas Erlebach Joint work with: Zuzana Beerliova,

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image Denis Tom

Landmarks, the Universe, and Everything Julie Porteous Laura Sebastia J org Hoffmann