Graph Processing Frameworks Lecture 24 CSCI 4974/6971 5 Dec 2016 - PowerPoint PPT Presentation

Pregel Execution 3. The master assigns a portion of the user’s input to each worker. The input is treated as a set of records, each of which contains an arbitrary number of vertices and edges. After the input has finished loading, all vertices are marked are active. Pregel 30

Pregel Execution 4. The master instructs each worker to perform a superstep. The worker loops through its active vertices, and call Compute() for each active vertex. It also delivers messages that were sent in the previous superstep. When the worker finishes it responds to the master with the number of vertices that will be active in the next superstep. Pregel 31

Pregel Execution Pregel 32

Pregel Execution Pregel 33

Fault Tolerance • Checkpointing is used to implement fault tolerance. – At the start of every superstep the master may instruct the workers to save the state of their partitions in stable storage. – This includes vertex values, edge values and incoming messages. • Master uses “ping“ messages to detect worker failures. Pregel 34

Fault Tolerance • When one or more workers fail, their associated partitions’ current state is lost. • Master reassigns these partitions to available set of workers. – They reload their partition state from the most recent available checkpoint. This can be many steps old. – The entire system is restarted from this superstep. • Confined recovery can be used to reduce this load Pregel 35

Applications PageRank Pregel 36

PageRank PageRank is a link analysis algorithm that is used to determine the importance of a document based on the number of references to it and the importance of the source documents themselves. [This was named after Larry Page (and not after rank of a webpage)] Pregel 37

PageRank A = A given page T 1 …. T n = Pages that point to page A (citations) d = Damping factor between 0 and 1 (usually kept as 0.85) C(T) = number of links going out of T PR(A) = the PageRank of page A PR ( T ) PR ( T ) PR ( T )        1 2 n PR ( A ) ( 1 d ) d ( ........ ) C ( T ) C ( T ) C ( T ) 1 2 n Pregel 38

PageRank Courtesy: Wikipedia Pregel 39

PageRank PageRank can be solved in 2 ways: • A system of linear equations • An iterative loop till convergence We look at the pseudo code of iterative version Initial value of PageRank of all pages = 1.0; While ( sum of PageRank of all pages – numPages > epsilon) { for each Page P i in list { PageRank(P i ) = (1-d); for each page P j linking to page P i { PageRank(P i ) += d × (PageRank(P j )/numOutLinks(P j )); } } Pregel 40 }

PageRank in MapReduce – Phase I Parsing HTML • Map task takes (URL, page content) pairs and maps them to (URL, (PR init , list-of-urls)) – PR init is the “seed” PageRank for URL – list-of-urls contains all pages pointed to by URL • Reduce task is just the identity function Pregel 41

PageRank in MapReduce – Phase 2 PageRank Distribution • Map task takes (URL, (cur_rank, url_list)) – For each u in url_list, emit ( u , cur_rank/|url_list|) – Emit (URL, url_list) to carry the points-to list along through iterations • Reduce task gets (URL, url_list) and many (URL, val ) values – Sum val s and fix up with d – Emit (URL, (new_rank, url_list)) Pregel 42

PageRank in MapReduce - Finalize • A non-parallelizable component determines whether convergence has been achieved • If so, write out the PageRank lists - done • Otherwise, feed output of Phase 2 into another Phase 2 iteration Pregel 43

PageRank in Pregel Class PageRankVertex : public Vertex<double, void, double> { public: virtual void Compute(MessageIterator* msgs) { if (superstep() >= 1) { double sum = 0; for (; !msgs->done(); msgs->Next()) sum += msgs->Value(); *MutableValue() = 0.15 + 0.85 * sum; } if (supersteps() < 30) { const int64 n = GetOutEdgeIterator().size(); SendMessageToAllNeighbors(GetValue() / n); } else { VoteToHalt(); }}}; Pregel 44

PageRank in Pregel The pregel implementation contains the PageRankVertex, which inherits from the Vertex class. The class has the vertex value type double to store tentative PageRank and message type double to carry PageRank fractions. The graph is initialized so that in superstep 0, value of each vertex is 1.0 . Pregel 45

PageRank in Pregel In each superstep, each vertex sends out along each outgoing edge its tentative PageRank divided by the number of outgoing edges. Also, each vertex sums up the values arriving on messages into sum and sets its own tentative PageRank to   0 . 15 0 . 85 sum For convergence, either there is a limit on the number of supersteps or aggregators are used to detect convergence. Pregel 46

Apache Giraph Large-scale Graph Processing on Hadoop Claudio Martella <claudio@apache.org> @claudiomartella Hadoop Summit @ Amsterdam - 3 April 2014

Graphs are simple 3

A computer network 4

A social network 5

A semantic network 6

A map 7

Graphs are huge • Google’s index contains 50B pages • Facebook has around1.1B users • Google+ has around 570M users • T witter has around 530M users VERY rough estimates! 8

Graphs aren’t easy 10

Graphs are nasty. 11

Each vertex depends on its neighbours, recursively. 12

Recursive problems are nicely solved iteratively. 13

PageRank in MapReduce • Record: < v_i, pr, [ v_j, ..., v_k ] > • Mapper: emits < v_j, pr / #neighbours > • Reducer: sums the partial values 14

MapReduce datafmow 15

Drawbacks • Each job is executed N times • Job bootstrap • Mappers send PR values and structure • Extensive IO at input, shuffme & sort, output 16

Timeline • Inspired by Google Pregel (2010) • Donated to ASF by Yahoo! in 2011 • T op-level project in 2012 • 1.0 release in January 2013 • 1.1 release in days 2014 18

Plays well with Hadoop 19

Vertex-centric API 20

BSP machine 21

BSP & Giraph 22

Advantages • No locks: message-based communication • No semaphores: global synchronization • Iteration isolation: massively parallelizable 23

Architecture 24

Giraph job lifetime 25

Designed for iterations • Stateful (in-memory) • Only intermediate values (messages) sent • Hits the disk at input, output, checkpoint • Can go out-of-core 26

A bunch of other things • Combiners (minimises messages) • Aggregators (global aggregations) • MasterCompute (executed on master) • WorkerContext (executed per worker) • PartitionContext (executed per partition) 27

Shortest Paths 28

Shortest Paths 29

Shortest Paths 30

Shortest Paths 31

Shortest Paths 32

Composable API 33

Checkpointing 34

No SPoFs 35

Giraph scales ref: https://www.facebook.com/notes/facebook-engineering/scaling-apache-giraph-to-a-trillion-e dges/10151617006153920 36

Giraph is fast • 100x over MR (Pr) • jobs run within minutes • given you have resources ;-) 37

Serialised objects 38

Primitive types • Autoboxing is expensive • Objects overhead (JVM) • Use primitive types on your own • Use primitive types-based libs (e.g. fastutils) 39

Sharded aggregators 40

Many stores with Gora 41

And graph databases 42

Current and next steps • Out-of-core graph and messages • Jython interface • Remove Writable from < I V E M > • Partitioned supernodes • More documentation 43

GraphLab: A New Framework for Parallel Machine Learning Yucheng Low, Aapo Kyrola, Carlos Guestrin, Joseph Gonzalez, Danny Bickson, Joe Hellerstein Presented by Guozhang Wang DB Lunch, Nov.8, 2010

Overview  Programming ML Algorithms in Parallel ◦ Common Parallelism and MapReduce ◦ Global Synchronization Barriers  GraphLab ◦ Data Dependency as a Graph ◦ Synchronization as Fold/Reduce  Implementation and Experiments  From Multicore to Distributed Environment

Parallel Processing for ML  Parallel ML is a Necessity ◦ 13 Million Wikipedia Pages ◦ 3.6 Billion photos on Flickr ◦ etc  Parallel ML is Hard to Program ◦ Concurrency v.s. Deadlock ◦ Load Balancing ◦ Debug ◦ etc

MapReduce is the Solution?  High-level abstraction: Statistical Query Model [Chu et al, 2006] Weighted Linear Regression: only sufficient statistics 𝚺 = A -1 b, A = 𝚻 w i (x i x i T ), b = 𝚻 w i (x i y i )

MapReduce is the Solution?  High-level abstraction: Statistical Query Model [Chu et al, 2006] K-Means: only data assignments Embarrassingly Parallel independent computation class mean = avg( x i ), x i in class No Communication needed

Graph Processing Frameworks Lecture 24 CSCI 4974/6971 5 Dec 2016 - PowerPoint PPT Presentation

Graph Processing Frameworks Lecture 24 CSCI 4974/6971 5 Dec 2016 1 / 13 Todays Biz 1. Reminders 2. Review 3. Graph Processing Frameworks 4. 2D Partitioning 2 / 13 Reminders Assignment 6: due date Dec 8th Final Project

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

Batch & Stream Graph Processing with Apache Flink Vasia Kalavri vasia@apache.org @vkalavri

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Web Frameworks Web Frameworks Banned for homework assignments Now that you're starting

GraVF: GraVF: A Vertex-Centric A Vertex-Centric Graph Processing Graph Processing Framework

FOOD PROCESSING FOOD PROCESSING GREEN BEAN PROCESSING GREEN BEAN PROCESSING GREEN BEAN

Multiscale Processing on Networks and Community Mining Part 1 - Communities in networks Graph

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Counting d.o.f.s in periodic frameworks Louis Theran (Aalto University / AScI, CS) Frameworks

Frameworks Concepts set of cooperating classes Frameworks extending some class

15-388/688 - Practical Data Science: Graph and network processing J. Zico Kolter Carnegie Mellon

GraphPIM : Enabling Instruction-Level PIM Offloading in Graph Computing Frameworks Lifeng Nai

Building a Graph Processing System Amitabha Roy (LABOS) 1 X-Stream Graph processing system

Medusa Simplified Graph Processing on GPUs Motivation Graph processing algorithms are often

TEM for magnetism: challenges and competitors Olivier Fruchart Institut Nel (CNRS-UJF-INPG)

Exploiting Locality in Distributed SDN Control Stefan Schmid (TU Berlin & T-Labs) Jukka

Matching Using LSH Forest Michael Cochez * 1st International KEYSTONE Conference * Industrial

Large-scale learning for image classification Zaid Harchaoui CVML13, July 2013 Zaid Harchaoui

Multilevel domain decomposition at extreme scales S. Badia, A. Martin, J. Principe Universitat

Data Preparation Discretization Data cleaning (Data pre-processing) Data

Measuring Sustainability CodeGreen Approach 5/20/2014 1 1 About

SCS Scorecard System V3.0 Super Admin (SHRU) Setup agency, category, location, period type

Graph Processing Frameworks Lecture 24 CSCI 4974/6971 5 Dec 2016 - PowerPoint PPT Presentation

Graph Processing Frameworks Lecture 24 CSCI 4974/6971 5 Dec 2016 1 / 13 Todays Biz 1. Reminders 2. Review 3. Graph Processing Frameworks 4. 2D Partitioning 2 / 13 Reminders Assignment 6: due date Dec 8th Final Project

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

Batch &amp; Stream Graph Processing with Apache Flink Vasia Kalavri vasia@apache.org @vkalavri

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Web Frameworks Web Frameworks Banned for homework assignments Now that you're starting

GraVF: GraVF: A Vertex-Centric A Vertex-Centric Graph Processing Graph Processing Framework

FOOD PROCESSING FOOD PROCESSING GREEN BEAN PROCESSING GREEN BEAN PROCESSING GREEN BEAN

Multiscale Processing on Networks and Community Mining Part 1 - Communities in networks Graph

Graph Indexing: Tree + Delta Delta &gt;= Graph &gt;= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Counting d.o.f.s in periodic frameworks Louis Theran (Aalto University / AScI, CS) Frameworks

Frameworks Concepts set of cooperating classes Frameworks extending some class

15-388/688 - Practical Data Science: Graph and network processing J. Zico Kolter Carnegie Mellon

GraphPIM : Enabling Instruction-Level PIM Offloading in Graph Computing Frameworks Lifeng Nai

Building a Graph Processing System Amitabha Roy (LABOS) 1 X-Stream Graph processing system

Medusa Simplified Graph Processing on GPUs Motivation Graph processing algorithms are often

TEM for magnetism: challenges and competitors Olivier Fruchart Institut Nel (CNRS-UJF-INPG)

Exploiting Locality in Distributed SDN Control Stefan Schmid (TU Berlin &amp; T-Labs) Jukka

Matching Using LSH Forest Michael Cochez * 1st International KEYSTONE Conference * Industrial

Large-scale learning for image classification Zaid Harchaoui CVML13, July 2013 Zaid Harchaoui

Multilevel domain decomposition at extreme scales S. Badia, A. Martin, J. Principe Universitat

Data Preparation Discretization Data cleaning (Data pre-processing) Data

Measuring Sustainability CodeGreen Approach 5/20/2014 1 1 About

SCS Scorecard System V3.0 Super Admin (SHRU) Setup agency, category, location, period type

Batch & Stream Graph Processing with Apache Flink Vasia Kalavri vasia@apache.org @vkalavri

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

Exploiting Locality in Distributed SDN Control Stefan Schmid (TU Berlin & T-Labs) Jukka