Pregelix: Big(ger) Graph Analytics on A Dataflow Engine Yingyi Bu - PowerPoint PPT Presentation

Pregelix: Big(ger) Graph Analytics on A Dataflow Engine Yingyi Bu (UC Irvine) Joint work with: Vinayak Borkar (UC Irvine) , Michael J. Carey (UC Irvine), Tyson Condie (UCLA), Jianfeng Jia (UC Irvine)

Outline Introduction Pregel Semantics The Pregel Logical Plan The Pregelix System Experimental Results Related Work Conclusions

Introduction Big Graphs are becoming common ○ web graph ○ social network ○ ......

Introduction ● How Big are Big Graphs? ○ Web: 8.53 Billion pages in 2012 ○ Facebook active users: 1.01 Billion ○ de Bruijn graph: 3 Billion nodes ○ ...... ● Weapons for mining Big Graphs ○ Pregel (Google) ○ Giraph (Facebook, LinkedIn, Twitter, etc.) ○ Distributed GraphLab (CMU) ○ GraphX (Berkeley)

Programming Model ● Think like a vertex ○ receive messages ○ update states ○ send messages

Programming Model ● Vertex public abstract class Vertex <I extends WritableComparable, V extends Writable, E extends Writable, M extends Writable> implements Writable{ public abstract void compute ( Iterator<M> incomingMessages ); ....... } ● Helper methods ○ sendMsg(I vertexId, M msg) ○ voteToHalt() ○ getSuperstep()

More APIs ● Message Combiner ○ Combine messages ○ Reduce network traffic ● Global Aggregator ○ Aggregate statistics over all live vertices ○ Done for each iteration ● Graph Mutations ○ Add vertex ○ Delete vertex ○ A conflict resolution function

Pregel Semantics ● Bulk-synchronous ○ A global barrier between iterations ● Compute invocation ○ Once per active vertex in each superstep ○ A halted vertex is activated when receiving messages ● Global halting ○ Each vertex is halted ○ No messages are in flight ● Graph mutations ○ Partial ordering of operations ○ User-defined resolve function

Process-centric runtime superstep:3 master halt: false Vertex { id: 1 Vertex { id: 2 halt: false halt: false value: 3.0 value: 2.0 edges: (3,1.0), edges: (3,1.0), (4,1.0) (4,1.0) } } Vertex { id: 3 Vertex{ id: 4 halt: false <3, 1.0> <2, 3.0> halt: false value: 3.0 value: 1.0 edges: (2,1.0), <4, 3.0> edges: (1,1.0) (3,1.0) } } <5, 1.0> worker-1 worker-2 message <id, payload> control signal

Issues and Opportunities ● Out-of-core support “I’m trying to run the sample connected components algorithm on a large data set on a cluster, but I get a “java.lang.OutOfMemoryError: Java heap space” error.” 26 similar threads on Giraph-users mailing list during the past year!

Issues and Opportunities ● Physical flexibility ○ PageRank, SSSP, CC, Triangle Counting ○ Web graph, social network, RDF graph ○ 8 machine school cluster, 200 machine Facebook data center One-size fits-all?

Issues and Opportunities ● Software simplicity Pregel Vertex/map/msg data structures GraphLab Giraph Hama Task scheduling ...... Memory management Message delivery Network management

The Pregelix Approach Relation Schema Vertex (vid, halt, value, edges) Msg (vid, payload) (halt, aggregate, superstep) GS vid payload 3.0 2 4 3.0 5 1.0 1 1.0 msg value vid halt edges vid=vid Msg 3.0 (3,1.0),(4,1.0) 1 NULL false false 3 3.0 1.0 (2,1.0),(3,1.0) edges vid halt value 5 1.0 NULL NULL NULL (3,1.0),(4,1.0) 2 2.0 false 2.0 (3,1.0),(4,1.0) 2 false 4 false 1.0 3.0 (1,1.0) 4 1.0 (1,1.0) 1 false 3.0 (3,1.0),(4,1.0) 3.0 false 3 false 3.0 (2,1.0),(3,1.0) Vertex

Pregel UDFs ● compute ○ Executed at each active vertex in each superstep ● combine ○ Aggregation function for messages ● aggregate ○ Aggregate function for the global states ● resolve ○ Used to resolve graph mutations

Logical Plan D7 Vertex i+1 vid combine Msg i+1 Flow Data D3 D2 D2 Vertex tuples D4,D5,D6 … D3 Msg tuples UDF Call ( compute ) D7 Msg tuples D1 after combination ( V.halt =false || M.payload != NULL ) M.vid=V.vid Msg i (M) Vertex i (V)

Logical Plan Flow Data GS i+1 D4 The global halting state contribution D10 D8 D9 superstep=G.superstep+1 D5 Values for aggregate Agg(aggregate) Agg(bool-and) D8 The global halt state D4 D5 D2,D3,D6 … D9 The global aggregate value UDF Call ( compute ) GS i (G) D10 The increased superstep D1 Vertex i+1 Flow Data D6 Vertex tuples for deletions and vid ( resolve ) insertions D6 D2,D3,D4,D5 … UDF Call ( compute ) D1

The Pregelix System Pregel Physical Plans Vertex/map/msg data structures Operators Access methods Task scheduling Record/Index Task scheduling management Memory management Buffer Data exchanging management Message delivery Connection management Network management A general purpose parallel dataflow engine

The Runtime ● Runtime Choice? Hyracks Hadoop ● The Hyracks data-parallel execution engine ○ Out-of-core operators ○ Connectors ○ Access methods ○ User-configurable task scheduling ○ Extensibility

Parallelism msg halt value vid edges msg halt value edges vid 3.0 1 NULL false (3,1.0),(4,1.0) (3,1.0),(4,1.0) 2 3.0 2.0 false false 3 3.0 1.0 (2,1.0),(3,1.0) 4 3.0 false 1.0 (1,1.0) 5 1.0 NULL NULL NULL vid=vid vid=vid vid msg edges msg edges vid vid halt value halt value vid 3.0 2 2.0 (3,1.0) (4,1.0) 1.0 1 3.0 (3,1.0) (4,1.0) 2 false 3 false 4 3.0 4 false 1.0 (1,1.0) 5 3 false 3.0 (2,1.0),(3,1.0) 1.0 Msg-1 Msg-2 Vertex-1 Vertex-2 msg vid vid msg 1.0 3.0 3 2 4 3.0 5 1.0 output-Msg-2 output-Msg-1 Worker-1 Worker-2

Physical Choices ● Vertex storage B-Tree LSM B-Tree ● Group-by ○ Pre-clustered group-by ○ Sort-based group-by ○ HashSort group-by ● Data redistribution ○ m-to-n merging partitioning connector ○ m-to-n partitioning connector ● Join ○ Index Full outer join ○ Index Left outer join

Data Storage ● Vertex ○ Partitioned B-tree or LSM B-tree ● Msg ○ Partitioned local files, sorted ● GS ○ Stored on HDFS ○ Cached in each worker

Physical Plan: Message Combination vid combine vid combine vid combine vid combine vid combine vid combine (Sort-based) (Sort-based) (Sort-based) (HashSort) (HashSort) (HashSort) vid combine vid combine vid combine vid combine vid combine vid combine (Sort-based) (Sort-based) (HashSort) (Sort-based) (HashSort) (HashSort) Sort-Groupby-M-to-N-Partitioning HashSort-Groupby-M-to-N-Partitioning vid combine vid combine vid combine vid combine vid combine vid combine (Preclustered) (Preclustered) (Preclustered) (Preclustered) (Preclustered) (Preclustered) vid combine vid combine vid combine vid combine vid combine vid combine (Sort-based) (Sort-based) (HashSort) (HashSort) (Sort-based) (HashSort) Sort-Groupby-M-to-N-Merge-Partitioning HashSort-Groupby-M-to-N-Merge-Partitioning M-to-N Partitioning Connector M-To-N Partitioning Merging Connector

Physical Plan: Message Delivery … D12 Function Call ( NullMsg ) D2 -- D6 Vid i+1 ( halt = false ) UDF Call ( compute ) … D1 D11 D2 -- D6 UDF Call ( compute ) (V. halt = false || M. paylod != NULL ) D1 Index Left Outer Index Full Outer Join Join Merge ( choose() ) M.vid=V.vid M.vid=I.vid M.vid=V.vid Msg i (M) Vertex i (V) Msg i (M) Vid i (I) Vertex i (V)

Caching Pregel, Giraph, GraphLab all have caches for this kind of iterative jobs. What do you do for caching? ● Iteration-aware (sticky) scheduling? ○ 1 Loc: location constraints ● Caching of invariant data? ○ B-tree buffer pool -- customized flushing policy: never flush dirty pages ○ File system cache -- free

Experimental Results ● Setup ○ Machines a UCI cluster ~ 32 machines 4 cores, 8GB memory, 2 disk drives. ○ Datasets ■ Yahoo! webmap (1,413,511,393 vertice, adjacency list, ~70GB) and its samples. ■ The Billions of Tuples Challenge dataset (172,655,479 vertices, adjacency list, ~17GB), its samples, and its scale-ups. ○ Giraph ■ Latest trunk (revision 770) ■ 4 vertex computation threads, 8GB JVM heap

Execution Time In-memory In-memory Out-of-core Out-of-core

Parallel Speedup

Parallel Scale-up

Throughput

Plan Flexibility In-memory Out-of-core 15x

Software Simplicity ● Lines-of-Code ○ Giraph: 32,197 ○ Pregelix: 8,514

More systems

More Systems

Related Work ● Parallel Data Management ○ Gama, GRACE, Teradata ○ Stratosphere (TU Berlin) ○ REX (UPenn) ○ AsterixDB (UCI) ● Big Graph Processing Systems ○ Pregel (Google) ○ Giraph (Facebook, LinkedIn, Twitter, etc.) ○ Distributed GraphLab (CMU) ○ GraphX (Berkeley) ○ Hama (Sogou, etc.) --- Too slow!

Conclusions ● Pregelix offers: ○ Transparent out-of-core support ○ Physical flexibility ○ Software simplicity ● We target Pregelix to be an open-source production system, rather than just a research prototype: ○ http://pregelix.ics.uci.edu

Pregelix: Big(ger) Graph Analytics on A Dataflow Engine Yingyi Bu - PowerPoint PPT Presentation

Pregelix: Big(ger) Graph Analytics on A Dataflow Engine Yingyi Bu (UC Irvine) Joint work with: Vinayak Borkar (UC Irvine) , Michael J. Carey (UC Irvine), Tyson Condie (UCLA), Jianfeng Jia (UC Irvine) Outline Introduction Pregel Semantics

VIDEO MARKETING SHOOTING/EDITING YOUR BRIDGE VIDEO @mykeme t z ger @mykeme t z ger

Endoscopic Transsphenoidal Sur ger y an educational perspective Paolo Cappabianca DIVISION

Naiad (Timely Dataflow) & Streaming Systems CS 848: Models and Applications of Distributed

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Google Cloud Dataflow Cosmin Arad , Senior Software Engineer carad@google.com August 7, 2015

Quantifying Dataflow Analysis with Gradients in LLVM Gabriel Ryan 1 , Abhishek Shah 1 , Dongdong

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

Pregelix: Think Like a Vertex, Scale Like Spandex Yingyi Bu (UC Irvine) Work with: Vinayak

Space Robotics and the GER: An Industrial Perspective Christian Sallaberger Vice-President &

Apache Spark: A Unified Engine for Big Data Processing Presented by: Huanyi Chen Apache Spark:

Graph Analytics on Massively Parallel Processing Databases Frank McQuillan Feb 2017 MPP

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

One-Slide Summary Programming The substitution model for evaluating Scheme does with not

What is fuzzing? A kind of random testing Goal : make sure certain bad things dont

Monadic Imperatjve languages C# Java C / C++ Fortran Subtract abstractjons Scala Add

Opensource Column Store Databases: MariaDB ColumnStore vs. ClickHouse Alexander Rubin

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

VIRUSES AND MALWARE Ben Livshits, Microsoft Research Overview of Todays Lecture 2 Viruses

Modern Fuzzing of Media-processing projects Max Moroz, FOSDEM 2017 Agenda Fuzzing

Sequence Alignment: Scoring Schemes COMP 571 Luay Nakhleh, Rice University 2 Scoring Schemes

Pregelix: Big(ger) Graph Analytics on A Dataflow Engine Yingyi Bu - PowerPoint PPT Presentation

Pregelix: Big(ger) Graph Analytics on A Dataflow Engine Yingyi Bu (UC Irvine) Joint work with: Vinayak Borkar (UC Irvine) , Michael J. Carey (UC Irvine), Tyson Condie (UCLA), Jianfeng Jia (UC Irvine) Outline Introduction Pregel Semantics

VIDEO MARKETING SHOOTING/EDITING YOUR BRIDGE VIDEO @mykeme t z ger @mykeme t z ger

Endoscopic Transsphenoidal Sur ger y an educational perspective Paolo Cappabianca DIVISION

Naiad (Timely Dataflow) &amp; Streaming Systems CS 848: Models and Applications of Distributed

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Google Cloud Dataflow Cosmin Arad , Senior Software Engineer carad@google.com August 7, 2015

Quantifying Dataflow Analysis with Gradients in LLVM Gabriel Ryan 1 , Abhishek Shah 1 , Dongdong

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

Pregelix: Think Like a Vertex, Scale Like Spandex Yingyi Bu (UC Irvine) Work with: Vinayak

Space Robotics and the GER: An Industrial Perspective Christian Sallaberger Vice-President &amp;

Apache Spark: A Unified Engine for Big Data Processing Presented by: Huanyi Chen Apache Spark:

Graph Analytics on Massively Parallel Processing Databases Frank McQuillan Feb 2017 MPP

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

One-Slide Summary Programming The substitution model for evaluating Scheme does with not

What is fuzzing? A kind of random testing Goal : make sure certain bad things dont

Monadic Imperatjve languages C# Java C / C++ Fortran Subtract abstractjons Scala Add

Opensource Column Store Databases: MariaDB ColumnStore vs. ClickHouse Alexander Rubin

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

VIRUSES AND MALWARE Ben Livshits, Microsoft Research Overview of Todays Lecture 2 Viruses

Modern Fuzzing of Media-processing projects Max Moroz, FOSDEM 2017 Agenda Fuzzing

Sequence Alignment: Scoring Schemes COMP 571 Luay Nakhleh, Rice University 2 Scoring Schemes

Naiad (Timely Dataflow) & Streaming Systems CS 848: Models and Applications of Distributed

Space Robotics and the GER: An Industrial Perspective Christian Sallaberger Vice-President &