PowerGraph : Distributed Graph-Parallel Computation on Natural - PowerPoint PPT Presentation

PowerGraph : Distributed Graph-Parallel Computation on Natural Graphs Gonzales et al. James Trever

What are Graphs? Graphs are everywhere and used to encode relationships

So what are they used for? Data Mining - Targeted ads - Natural Language Processing - Identifying influential Machine Learning people and information

Natural Graphs Graphs derived from real world phenomena

Challenges with Natural Graphs Power-Law Degree Distribution

Graph-Parallel Abstraction - A Vertex-Program, designed by the user, runs on every vertex - Vertex-Programs interact with one another along their edges - Multiple Vertex-Programs are run simultaneously

Challenges with Natural Graphs - Power-Law Graphs are very difficult to partition/cut - Often incurs a large communication or storage overhead

Pregel Existing & Systems GraphLab

Pregel - Bulk Synchronous Message Passing Abstraction - Uses messages to communicate with other vertices - Waits until all vertex programs have finished before starting the next “super step” - Uses message combiners

Pregel Fan-In Fan-Out

GraphLab - Asynchronous Distributed Shared-Memory Abstraction - Vertex-Programs have shared access to distributed graph with data stored on each vertex and edge and can access the current vertex, adjacent edges and adjacent vertices irrespective of edge direction - Vertex-Programs have the ability to schedule other vertices’ execution in the future

GraphLab GraphLab Ghosting

Challenges with Natural Graphs

PowerGraph

PowerGraph - GAS Decomposition - Distribute Vertex-Programs - Parallelise high degree vertices - Vertex Partitioning - Distribute power-law graphs more efficiently

GAS Decomposition

Vertex Partitioning Edge Cuts Vertext Cuts

Vertex Partitioning

How the vertices are partitioned - Evenly assign edges to machines - 3 different approaches - Random edge placement - Greedy placement - Coordinated edge placement - Oblivious edge placement

Random Edge Placements

Greedy Edge Placements - Place edges on machines that already have the vertices in that edge - If there are multiple options, choose the less loaded machine

Greedy Edge Placements - Minimises the expected number of machines spanned - Coordinated: - Requires coordination to place each edge - Slower but has higher quality cuts - Oblivious: - Approximate greedy objective without coordination - Faster but lower quality cuts

Experiments - Graph Partitioning

Experiments - Synthetic Work Imbalance and Communication

Experiments - Synthetic Runtime

Experiments - Machine Learning

Other Features - 3 different execution modes: - Bulk Synchronous - Asynchronous - Asynchronous Serialisable - Delta Caching

Critical Evaluation - Lots of talk of performance, not many tests comparing systems - Delta caching only briefly touched on - Future work lacks detail - Lots of unbacked up claims - Greedy edge placement not very clear - No mention of fault tolerance

Bibliography J. Gonzalez, Y. Low, H. Gu, D. Bickson, and C. Guestrin: Powergraph: distributed graph-parallel computation on naturalgraphs. OSDI, 2012. And his original presentation found here: http://www.cs.berkeley.edu/~jegonzal/talks/powergraph_osdi12.pptx

PowerGraph : Distributed Graph-Parallel Computation on Natural - PowerPoint PPT Presentation

PowerGraph : Distributed Graph-Parallel Computation on Natural Graphs Gonzales et al. James Trever What are Graphs? Graphs are everywhere and used to encode relationships So what are they used for? Data Mining - Targeted ads - Natural

PowerGraph Distributed Graph-Parallel Computation on Natural Graphs by Gonzalez, Joseph E., et

PowerGraph: Distributed Graph- Parallel Computation on Natural Graphs J. E. Gonzales, Y. Low, H.

PowerGraph Distributed Graph-Parallel Computation on Natural Graphs JOSHUA SEND 24/10/2017

Computation on Natural Graphs Presenter: Mengxiao Wang Problem: Existing distributed graph

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Models of Parallel Computation Mark Greenstreet CpSc 418 Oct. 10, 2013 The RAM Model of

CSL 860: Modern Parallel Computation Computation Hello OpenMP #pragma omp parallel { // I am

CS 744: Powergraph Shivaram Venkataraman Fall 2019 ADMINISTRIVIA - Midterm grades (end of)

Parallel and Distributed Programming Introduction Kenjiro Taura 1 / 21 Contents 1 Why Parallel

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Distributed VS Parallel implementations of graph algorithms Alexis SIRETA,Lazar PETROV Outline

Outline Parallel / Distributed Computers CSCI 8220 Parallel and Distributed Air

Formal Definition of Computation Formal Definition of Computation p.1/28 Computation

Massively Parallel Computation Philip Bille Sequential Computation Computation. Read and

Complexity Measures for Parallel Computation Complexity Measures for Parallel Computation

3 Common Pitfalls in Microservice Integration (Bonus : And how to avoid them ) credit to Bernd

Synchronous and asynchronous clusterings Matthieu Durut September 20, 2012 Matthieu Durut

Linear-in- lower bounds in the LOCAL model Mika Gs, University of Toronto Juho Hirvonen ,

OUTLINE WHY GO MOBILE? WHAT IS 23 MOBILE THINGS? WHY PH & SG REMIX? OUR UPs

Acceleration of an Adaptive Cartesian Mesh CFD Code in the Current Generation Processor

Methacton Plan for Reopening Methacton Schools Considerations Athletic and Activities Health

The (Continuing) Evolution of the ERCOT System Dan Woodfin Sr. Director, System Operations,

Welcome to STEM Connecting with our New to STEM Families Never Stop Innovating Tonights

PowerGraph : Distributed Graph-Parallel Computation on Natural - PowerPoint PPT Presentation

PowerGraph : Distributed Graph-Parallel Computation on Natural Graphs Gonzales et al. James Trever What are Graphs? Graphs are everywhere and used to encode relationships So what are they used for? Data Mining - Targeted ads - Natural

PowerGraph Distributed Graph-Parallel Computation on Natural Graphs by Gonzalez, Joseph E., et

PowerGraph: Distributed Graph- Parallel Computation on Natural Graphs J. E. Gonzales, Y. Low, H.

PowerGraph Distributed Graph-Parallel Computation on Natural Graphs JOSHUA SEND 24/10/2017

Computation on Natural Graphs Presenter: Mengxiao Wang Problem: Existing distributed graph

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Models of Parallel Computation Mark Greenstreet CpSc 418 Oct. 10, 2013 The RAM Model of

CSL 860: Modern Parallel Computation Computation Hello OpenMP #pragma omp parallel { // I am

CS 744: Powergraph Shivaram Venkataraman Fall 2019 ADMINISTRIVIA - Midterm grades (end of)

Parallel and Distributed Programming Introduction Kenjiro Taura 1 / 21 Contents 1 Why Parallel

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Distributed VS Parallel implementations of graph algorithms Alexis SIRETA,Lazar PETROV Outline

Outline Parallel / Distributed Computers CSCI 8220 Parallel and Distributed Air

Formal Definition of Computation Formal Definition of Computation p.1/28 Computation

Massively Parallel Computation Philip Bille Sequential Computation Computation. Read and

Complexity Measures for Parallel Computation Complexity Measures for Parallel Computation

3 Common Pitfalls in Microservice Integration (Bonus : And how to avoid them ) credit to Bernd

Synchronous and asynchronous clusterings Matthieu Durut September 20, 2012 Matthieu Durut

Linear-in- lower bounds in the LOCAL model Mika Gs, University of Toronto Juho Hirvonen ,

OUTLINE WHY GO MOBILE? WHAT IS 23 MOBILE THINGS? WHY PH &amp; SG REMIX? OUR UPs

Acceleration of an Adaptive Cartesian Mesh CFD Code in the Current Generation Processor

Methacton Plan for Reopening Methacton Schools Considerations Athletic and Activities Health

The (Continuing) Evolution of the ERCOT System Dan Woodfin Sr. Director, System Operations,

Welcome to STEM Connecting with our New to STEM Families Never Stop Innovating Tonights

OUTLINE WHY GO MOBILE? WHAT IS 23 MOBILE THINGS? WHY PH & SG REMIX? OUR UPs