V-Combiner: Speeding-up Iterative Graph Processing on a - PowerPoint PPT Presentation

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging Azin Heidarshenas † , Serif Yesil † , Dimitrios Skarlatos † , Sasa Misailovic † , Adam Morrison*, Josep Torrellas † University of Illinois Urbana-Champaign † Tel-Aviv University* International Conference on Supercomputing (ICS), June 2020

Iterative graph processing 50-200 Iterations Page Rank parallel for v in vertices Community Detection for u in v.neighbors Update all vertices in parallel HITS … // update v Belief Propagation Computational complexity ∝ #Iterations Converged? yes Finish 2

Graph processing can be approximate Example : CEO of Company X wants to invest only on the most influential customers in their network Vertex Page Rank 1 0.0510103 4 3 0.0255164 hub hub 1 2 3 … 4 7.3626e-05 2000 2 5.16674e-05 2000 1000 Computing Page Ranks of Vertices 2 and 4 is useless. 3

Pruning graphs can be effective Removing useless computation Removing certain vertices / edges (pruning) Time Pre-processing Compute Build Graph Graph Algorithm Original graph Pre-processing Compute Prune Build Graph Graph Algorithm Approximate graph 4

Overview of Sparsification and K-core 4 4 1 2 3 1 2 3 Sparsification 1 K-core 2 Prunes only edges, Prunes vertices (along probabilistically from with their edges), until the dense regions remaining vertices have a degree of at least K [1] Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 2013 [2] K-core decomposition of large networks on a single PC, VLDB, 2015 5

Limitations of Sparsification and K-core Desirable speedup > 2x Accuracy is the ratio of vertices found in the top ranking. At the highest accuracy (~80%), Sparsification achieves 1.6x for Page Rank. Degree of pruning Accuracy is the ratio of vertices with correct communities. High speedup is achieved only at low Accuracy (<60%) for Community Detection. Degree of pruning 6

Addressing the Limitations 4 4 4 1 2 3 1 2 3 1 2 3 V-Combiner Sparsification 1 K-core 2 Prunes and merges certain Prunes only edges, Prunes vertices (along vertices into hubs ( in the probabilistically from with their edges), until the direction of information dense regions remaining vertices have a flow ), so that hubs stay degree of at least K connected to the rest of the graph [1] Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 2013 [2] K-core decomposition of large networks on a single PC, VLDB, 2015 7

Overview of V-Combiner Time Pre-processing Compute Build Graph Graph Algorithm Pre-processing Post-processing Compute Build Graph Graph Algorithm V-Combiner Baseline Prune + Merge Recovery More merging vs. pre-processing time vs. performance savings 8

Different Vertex Merging Scenarios Merge in-neighbors Example App. Information flow Edges Page Rank, Directed One-way Comm. Detection Merge in-neighbors Merge out-neighbors Directed Two-way HITS Merge all neighbors Belief Propagation Undirected Two-way 9

Classification of Vertices in V-Combiner Supernode : Large in-degree (but not too large) Large in-degree for supernode à More mergings per supernode Subnode : Small in- and out-degree, at least one supernode in its out- neighborhood Small in- and out-degree for subnode à Less distortion after pruning Regular : Neither a supernode nor a subnode Supernode 4 Subnode 1 2 3 Regular Regular 10

Prune + Merge in V-Combiner for e in edges //MERGE if e.dst is a subnode and e.src is NOT a subnode then // Increment in-degree of the supernode by one //PRUNE if e.src is a subnode and e.dst is NOT a subnode then // Decrement in-degree of the e.dst by one Vertex Old in-degree New in-degree 1 6 6 4 4 2 1 0 1 2 3 2 1 3 3 5 5 4 2 1 One increment and one decrement cancel out. 11

Recovery in V-Combiner No subnodes in the approximate graph Recover using the in-neighbors’ values and the graph algorithm operator • More efficient using Delta graph • As if an extra iteration of the algorithm is run, but only for the subnodes 4 4 2 1 3 1 2 3 Delta graph Approximate graph For Page Rank: Pr[2] = 0.85 Pr[1] / 2 + 0.15 12

Evaluation Setup End-to-end speedup measured. 44 Intel Xeon cores, no hyper-threading and DVFS 4 graph applications: • Page Rank (PR) • Community Detection (CD) • Hyperlink-Induced Topic Search (HITS) • Belief Propagation (BP) 5 graph inputs • Friendster social network (FS) • Twitter social network (TW) • Page-Level Domain graph (PLD) • Arabic domain network (AR) • Dbpedia network (DB) 13

Accuracy Metrics Top-K Accuracy: The ratio of vertices in the top ranking of the exact result that are also in the top ranking of the approximate result • Page Rank • HITS • Belief Propagation Classification Accuracy: The ratio of vertices that have been correctly assigned to their communities • Community Detection Accuracy threshold of 90%. 14

End-to-End Performance Algorithm Build 15

End-to-End Performance: V-Combiner Algorithm Prune/Merge Build Recovery 1.25 end-to-end speedup at mean accuracy of 91.8% 16

End-to-End Performance: Sparsification Algorithm Prune/Merge Build Recovery Sparsification fails to meet accuracy threshold in 1 benchmark 17

End-to-End Performance: K-core Algorithm Prune/Merge Build Recovery K-core fails to meet accuracy threshold in 4 benchmarks 18

More in the Paper • Details of other scenarios of the merging • Choosing the merging parameters • Algorithm performance and accuracy analysis • Analysis of connectivity • Analysis of the average length of the paths • Analysis of pruning/merging parameters • … 19

Take-away • Iterative graph processing is computationally expensive and can be approximate. • V-Combiner is a pruning + merging + recovery technique • It has the following advantages over the state-of-the-art pruning techniques: – Preserving average length of the paths – Maintaining connectivity – Improving load balancing – Modest pre-processing overhead 20

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging Azin Heidarshenas † , Serif Yesil † , Dimitrios Skarlatos † , Sasa Misailovic † , Adam Morrison*, Josep Torrellas † University of Illinois Urbana-Champaign † Tel-Aviv University*

V-Combiner: Speeding-up Iterative Graph Processing on a - PowerPoint PPT Presentation

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging Azin Heidarshenas , Serif Yesil , Dimitrios Skarlatos , Sasa Misailovic , Adam Morrison*, Josep Torrellas University of Illinois

Extension: Combiner Functions Recall earlier discussion about combiner function Pre-reduces

SLAC Project-X RF Development November 28 th 2012 Rosa Ciprian RF high power sources 325MHz &

Speeding up the Inter-Planetary File System (IPFS) Speeding up the Inter-Planetary File System

Speeding Up Your Mac A Joe ON Tech Guide Speeding Up Your Mac Basics Three factors affect

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Iterative methods for Image Processing Lothar Reichel Como, May 2018. Lecture 3: Block iterative

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

Transducer, Mode Splitter/Combiner X-Band OMT Design Review October 1, 2009 Gordon Coutts

Extension: Combiner Functions import org.apache.hadoop.io.IntWritable; import

Graph Covers and Iterative Decoding of Finite-Length Codes Pascal O. Vontobel (CSL, UIUC) Ralf

Batch & Stream Graph Processing with Apache Flink Vasia Kalavri vasia@apache.org @vkalavri

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Wheeler Road Virtual Community Meeting Summer 2020 Station 1 Speeding 2 Station 1

Lower Bounds on Lattice Enumeration with Extreme Pruning Yoshinori Aono Phong Nguyn Takenobu

C4.5 - pruning decision trees Quiz 1 Quiz 1 Q: Is a tree with only pure leafs always the best

Training Behavior of Sparse Neural Network Topologies Simon Alford, Ryan Robinett, Lauren

Problem with Heuristic Search YOSSI COHEN P R O F . A R I E L F E L N E R , D R . R O N I S

Parallel Test Generation and Execution with Korat Sasa Misailovic (Univ. of Belgrade) Aleksandar

Focusing on What Really Matters: Irrelevance Pruning in M&S Alvaro Torralba, Peter

A Novel Probabilistic Pruning Approach to Speed Up Similarity Queries in Uncertain Databases

More on games (Ch. 5.4-5.6) Review: Minimax Afro Deli Shuang Cheng Cheese- Fried Lo Mein

V-Combiner: Speeding-up Iterative Graph Processing on a - PowerPoint PPT Presentation

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging Azin Heidarshenas , Serif Yesil , Dimitrios Skarlatos , Sasa Misailovic , Adam Morrison*, Josep Torrellas University of Illinois

Extension: Combiner Functions Recall earlier discussion about combiner function Pre-reduces

SLAC Project-X RF Development November 28 th 2012 Rosa Ciprian RF high power sources 325MHz &amp;

Speeding up the Inter-Planetary File System (IPFS) Speeding up the Inter-Planetary File System

Speeding Up Your Mac A Joe ON Tech Guide Speeding Up Your Mac Basics Three factors affect

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Iterative methods for Image Processing Lothar Reichel Como, May 2018. Lecture 3: Block iterative

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

Transducer, Mode Splitter/Combiner X-Band OMT Design Review October 1, 2009 Gordon Coutts

Extension: Combiner Functions import org.apache.hadoop.io.IntWritable; import

Graph Covers and Iterative Decoding of Finite-Length Codes Pascal O. Vontobel (CSL, UIUC) Ralf

Batch &amp; Stream Graph Processing with Apache Flink Vasia Kalavri vasia@apache.org @vkalavri

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Wheeler Road Virtual Community Meeting Summer 2020 Station 1 Speeding 2 Station 1

Lower Bounds on Lattice Enumeration with Extreme Pruning Yoshinori Aono Phong Nguyn Takenobu

C4.5 - pruning decision trees Quiz 1 Quiz 1 Q: Is a tree with only pure leafs always the best

Training Behavior of Sparse Neural Network Topologies Simon Alford, Ryan Robinett, Lauren

Problem with Heuristic Search YOSSI COHEN P R O F . A R I E L F E L N E R , D R . R O N I S

Parallel Test Generation and Execution with Korat Sasa Misailovic (Univ. of Belgrade) Aleksandar

Focusing on What Really Matters: Irrelevance Pruning in M&amp;S Alvaro Torralba, Peter

A Novel Probabilistic Pruning Approach to Speed Up Similarity Queries in Uncertain Databases

More on games (Ch. 5.4-5.6) Review: Minimax Afro Deli Shuang Cheng Cheese- Fried Lo Mein

SLAC Project-X RF Development November 28 th 2012 Rosa Ciprian RF high power sources 325MHz &

Batch & Stream Graph Processing with Apache Flink Vasia Kalavri vasia@apache.org @vkalavri

Focusing on What Really Matters: Irrelevance Pruning in M&S Alvaro Torralba, Peter