GraBi : Communication-Efficient and Workload-Balanced Partitioning - PowerPoint PPT Presentation

The 49th International Conference on Parallel Processing (ICPP’20) GraBi : Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs 1 Feng Sheng, 1 Qiang Cao, 2 Hong Jiang, and 1 Jie Yao 1 Huazhong University of Science and Technology 2 University of Texas at Arlington 17-20 August 2020, Edmonton, AB, Canada

GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs Outline   Background  Motivation  Design of GraBi ➢ Vertical Partitioning: Vertex-vector Chunking ➢ Horizontal Partitioning: Vertex-chunk Assignment  Evaluation  Conclusion Feng Sheng, Qiang Cao, Hong Jiang, and Jie Yao

Graph Partitioning Graph partitioning distributes vertices and edges over computing nodes. Background · Motivation · Design · Evaluation · Conclusion

Graph Partitioning Graph partitioning distributes vertices and edges over computing nodes. Vertex Master Vertex Replica Node 1 Node 2 1 2 Node 1 1 2 1 2 1 2 1 2 4 4 2 Node 2 4 3 4 3 4 3 4 3 Node 3 4 3 Node 3 (a) Edge-cut (b) Vertex-cut ➢ Edge-cut equally distributes vertices among nodes. ➢ Vertex-cut equally distributes edges among nodes. Background · Motivation · Design · Evaluation · Conclusion

Graph Partitioning Graph partitioning distributes vertices and edges over computing nodes. Vertex Master Vertex Replica Node 1 Node 2 1 2 Node 1 1 2 1 2 1 2 1 2 4 4 2 Node 2 4 3 4 3 4 3 4 3 Node 3 4 3 Node 3 (a) Edge-cut (b) Vertex-cut ➢ Edge-cut equally distributes vertices among nodes. ➢ Vertex-cut equally distributes edges among nodes. ➢ replication factor ( 𝜇 ): the average number of replicas per vertex. Background · Motivation · Design · Evaluation · Conclusion

Bipartite graphs & MLDM algorithms • Bipartite graphs ➢ Vertices are separated into two disjoint subsets. ➢ Every edge connects one vertex each from the two subsets. Background · Motivation · Design · Evaluation · Conclusion

Bipartite graphs & MLDM algorithms • Bipartite graphs ➢ Vertices are separated into two disjoint subsets. ➢ Every edge connects one vertex each from the two subsets. • Machine Learning and Data Mining (MLDM) algorithms ➢ Bipartite graphs have been widely used in MLDM applications . Background · Motivation · Design · Evaluation · Conclusion

Bipartite graphs & MLDM algorithms • Bipartite graphs ➢ Vertices are separated into two disjoint subsets. ➢ Every edge connects one vertex each from the two subsets. • Machine Learning and Data Mining (MLDM) algorithms ➢ Bipartite graphs have been widely used in MLDM applications. Y # of items D R(u,v) p 1 1 1 q 1 Q v X # of users p 2 2 2 q 2 Q 𝑈 ≈ P D R x ... ... P u R(u,v) X Y p X q Y (b) View of Graph (a) View of Matrix Background · Motivation · Design · Evaluation · Conclusion

Observations • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. Background · Motivation · Design · Evaluation · Conclusion

Observations • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. ➢ The authors of 𝐷𝑉𝐶𝐹 [1] associate each vertex with a vector of up to 128 elements. ➢ The users of 𝑄𝑝𝑥𝑓𝑠𝐻𝑠𝑏𝑞ℎ [2] can configure each vertex value as a vector of thousands of elements [1] M. Zhang, Y. Wu, K. Chen, et al. Exploring the Hidden Dimension in Graph Processing. In OSDI 2016. [2] J. E. Gonzalez, Y. Low, H. Gu, et al. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs. In OSDI 2012. Background · Motivation · Design · Evaluation · Conclusion

Observations • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. ➢ The authors of 𝐷𝑉𝐶𝐹 [1] associate each vertex with a vector of up to 128 elements. ➢ The users of 𝑄𝑝𝑥𝑓𝑠𝐻𝑠𝑏𝑞ℎ [2] can configure each vertex value as a vector of thousands of elements • Observation 2 : The sizes of two vertex-subsets in a bipartite graph can be highly lopsided. [1] M. Zhang, Y. Wu, K. Chen, et al. Exploring the Hidden Dimension in Graph Processing. In OSDI 2016. [2] J. E. Gonzalez, Y. Low, H. Gu, et al. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs. In OSDI 2012. Background · Motivation · Design · Evaluation · Conclusion

Observations • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. ➢ The authors of 𝐷𝑉𝐶𝐹 [1] associate each vertex with a vector of up to 128 elements. ➢ The users of 𝑄𝑝𝑥𝑓𝑠𝐻𝑠𝑏𝑞ℎ [2] can configure each vertex value as a vector of thousands of elements • Observation 2 : The sizes of two vertex-subsets in a bipartite graph can be highly lopsided. ➢ In 𝑂𝑓𝑢𝑔𝑚𝑗𝑦 [3] , the number of users is about 27x that of movies. ➢ In 𝐹𝑜𝑕𝑚𝑗𝑡ℎ 𝑋𝑗𝑙𝑗𝑞𝑓𝑒𝑗𝑏 [4] , the number of articles is about 98x that of words. [1] M. Zhang, Y. Wu, K. Chen, et al. Exploring the Hidden Dimension in Graph Processing. In OSDI 2016. [2] J. E. Gonzalez, Y. Low, H. Gu, et al. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs. In OSDI 2012. [3] http://www.netflixprize.com/community/viewtopic.php?pid=9857 [4] https://dumps.wikimedia.org/ Background · Motivation · Design · Evaluation · Conclusion

Observations • Observation 3: Within a vertex-subset, the vertices usually exhibit power-law degree distribution Background · Motivation · Design · Evaluation · Conclusion

Observations • Observation 3: Within a vertex-subset, the vertices usually exhibit power-law degree distribution ➢ Both the two vertex-subsets in 𝐸𝐶𝑀𝑄 [1] exhibit power-law degree distribution. (a) Author Degree Distribution (b) Publication Degree Distribution [1] https://dumps.wikimedia.org/ Background · Motivation · Design · Evaluation · Conclusion

Opportunities • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. Each vertex vector can be divided into multiple sub-vectors. Background · Motivation · Design · Evaluation · Conclusion

Opportunities • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. Each vertex vector can be divided into multiple sub-vectors. • Observation 2 : The sizes of two vertex-subsets in a bipartite graph can be highly lopsided The two vertex-subsets can be processed with different priorities. Background · Motivation · Design · Evaluation · Conclusion

Opportunities • Observation 1 : The vertex value in MLDM algorithms is a multi-element vector. Each vertex vector can be divided into multiple sub-vectors. • Observation 2 : The sizes of two vertex-subsets in a bipartite graph can be highly lopsided The two vertex-subsets can be processed with different priorities. • Observation 3 : Within a vertex-subset, the vertices usually exhibit power-law degree distribution The vertices of different degrees should be distinguished. Background · Motivation · Design · Evaluation · Conclusion

Overview of GraBi ➢ GraBi is a communication-efficient and workload-balanced partitioning framework for bipartite graphs. Background · Motivation · Design · Evaluation · Conclusion

Overview of GraBi ➢ GraBi is a communication-efficient and workload-balanced partitioning framework for bipartite graphs. ➢ GraBi comprehensively exploits the above three observations of bipartite graphs and MLDM algorithms. Background · Motivation · Design · Evaluation · Conclusion

Overview of GraBi ➢ GraBi is a communication-efficient and workload-balanced partitioning framework for bipartite graphs. ➢ GraBi comprehensively exploits the above three features of bipartite graphs and MLDM algorithms. ➢ GraBi partitions a bipartite graph first vertically, and then horizontally, to realize high-quality partitioning. Background · Motivation · Design · Evaluation · Conclusion

Vertical Partitioning: Vertex-vector Chunking Replica 1 Vertex 2 Vertex 2 Replica 3 Vertex 1 Vertex 1 Node 1 Horizontal Replica 2 Vertex 3 Vertex 3 Partitioning Node 2 Node 3 Vertex Master Vertex Replica inter-vertex Comm. Intra-vertex Comm. Background · Motivation · Design · Evaluation · Conclusion

Vertical Partitioning: Vertex-vector Chunking Replica 1 Vertex 2 Vertex 2 Replica 3 Vertex 1 Vertex 1 Node 1 Horizontal Replica 2 Vertex 3 Vertex 3 Partitioning Node 2 Node 3 Vertex Master Vertex Replica inter-vertex Comm. Intra-vertex Comm. The whole vector of a vertex is assigned to a computing node. Background · Motivation · Design · Evaluation · Conclusion

Vertical Partitioning: Vertex-vector Chunking Replica 1 Vertex 2 Vertex 2 Replica 3 Vertex 1 Vertex 1 Node 1 Horizontal Replica 2 Vertex 3 Vertex 3 Partitioning Node 2 Node 3 Vertex Master Vertex Replica inter-vertex Comm. Intra-vertex Comm. • Inter-vertex Communication happens between computing nodes The whole vector of a vertex • is assigned to a computing node. Intra-vertex Communication happens within a computing node Background · Motivation · Design · Evaluation · Conclusion

GraBi : Communication-Efficient and Workload-Balanced Partitioning - PowerPoint PPT Presentation

The 49th International Conference on Parallel Processing (ICPP20) GraBi : Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs 1 Feng Sheng, 1 Qiang Cao, 2 Hong Jiang, and 1 Jie Yao 1 Huazhong University of Science

GPrioSwap: Towards a Swapping Policy for GPUs Jens Kehne , Jonathan Metter, Martin Merkel, Marius

Preference Formation in School Choice COMSOC Summer School on Matching Problems, Markets and

Cadet-Branching at U.S. Army Programs Tayfun S onmez, BC 1/68 Based on: Matching with

Programmable Packet Scheduling at Line Rate Anirudh Sivaraman , Suvinay Subramanian, Mohammad

Parallel & Distributed Real-Time Systems Lecture #5 Risat Pathan Department of Computer

Notice of Intent (NOI) The Notice of Intent (NOI) is not a requirement for submitting a

Real-Time Embedded Systems Wan Fokkink Vrije Universiteit Amsterdam & CWI Jane W.S. Liu

Static Scheduling in Clouds Thomas A. Henzinger Anmol V. Singh Vasu Singh Thomas Wies Damien

Welcome President, CHCI Event Facilitator 2015 HUMAN CAPITAL CONGRESS Dr. Robert Goldenkoff

The Well Visit Planner Orientation Webinar December 3, 2012 Engaging Parents as Partners to

To jump the queue or wait my turn? D. Manjunath IIT Bombay January 14, 2014 D. Manjunath (IIT

scheduling for real-time systems Lukas Pirl Operating Systems and Middleware Group Hasso

A Simple Algorithm for Solving Qualitative Probabilistic Parity Games Sven Schewe University of

State Strategies for Integrating Health Care and Housing for Homeless Individuals and Families

CSI5126 . Algorithms in bioinformatics Hidden Markov Models Marcel Turcotte School of Electrical

Logic Programming with PROFETA Corrado Santoro ARSLAB - Autonomous and Robotic Systems Laboratory

Transient Solidarities Charles Heckscher January, 2014 Shared sense of obligation Solidarity

Arya Samaj Principle: 9 th Principle By: Vipashchit Nanda The Ten Principles Formulated by

Functors and natural transformations functors category morphisms natural transformations

A model-independent theory of -categories joint with Dominic Verity Joint International

Complete positivity for Mixed Unitary Categories Robin Cockett, and Priyaa V. Srinivasan FMCS

Braided notions of dialogue categories Paul-Andr Mellis CNRS & Universit Paris Denis

The Internal Structure of Coordinate Categories Roger Levy UC San Diego Department of

Learning Objectives At the end of the class you should be able to: Explain the components and the

GraBi : Communication-Efficient and Workload-Balanced Partitioning - PowerPoint PPT Presentation

The 49th International Conference on Parallel Processing (ICPP20) GraBi : Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs 1 Feng Sheng, 1 Qiang Cao, 2 Hong Jiang, and 1 Jie Yao 1 Huazhong University of Science

GPrioSwap: Towards a Swapping Policy for GPUs Jens Kehne , Jonathan Metter, Martin Merkel, Marius

Preference Formation in School Choice COMSOC Summer School on Matching Problems, Markets and

Cadet-Branching at U.S. Army Programs Tayfun S onmez, BC 1/68 Based on: Matching with

Programmable Packet Scheduling at Line Rate Anirudh Sivaraman , Suvinay Subramanian, Mohammad

Parallel &amp; Distributed Real-Time Systems Lecture #5 Risat Pathan Department of Computer

Notice of Intent (NOI) The Notice of Intent (NOI) is not a requirement for submitting a

Real-Time Embedded Systems Wan Fokkink Vrije Universiteit Amsterdam &amp; CWI Jane W.S. Liu

Static Scheduling in Clouds Thomas A. Henzinger Anmol V. Singh Vasu Singh Thomas Wies Damien

Welcome President, CHCI Event Facilitator 2015 HUMAN CAPITAL CONGRESS Dr. Robert Goldenkoff

The Well Visit Planner Orientation Webinar December 3, 2012 Engaging Parents as Partners to

To jump the queue or wait my turn? D. Manjunath IIT Bombay January 14, 2014 D. Manjunath (IIT

scheduling for real-time systems Lukas Pirl Operating Systems and Middleware Group Hasso

A Simple Algorithm for Solving Qualitative Probabilistic Parity Games Sven Schewe University of

State Strategies for Integrating Health Care and Housing for Homeless Individuals and Families

CSI5126 . Algorithms in bioinformatics Hidden Markov Models Marcel Turcotte School of Electrical

Logic Programming with PROFETA Corrado Santoro ARSLAB - Autonomous and Robotic Systems Laboratory

Transient Solidarities Charles Heckscher January, 2014 Shared sense of obligation Solidarity

Arya Samaj Principle: 9 th Principle By: Vipashchit Nanda The Ten Principles Formulated by

Functors and natural transformations functors category morphisms natural transformations

A model-independent theory of -categories joint with Dominic Verity Joint International

Complete positivity for Mixed Unitary Categories Robin Cockett, and Priyaa V. Srinivasan FMCS

Braided notions of dialogue categories Paul-Andr Mellis CNRS &amp; Universit Paris Denis

The Internal Structure of Coordinate Categories Roger Levy UC San Diego Department of

Learning Objectives At the end of the class you should be able to: Explain the components and the

Parallel & Distributed Real-Time Systems Lecture #5 Risat Pathan Department of Computer

Real-Time Embedded Systems Wan Fokkink Vrije Universiteit Amsterdam & CWI Jane W.S. Liu

Braided notions of dialogue categories Paul-Andr Mellis CNRS & Universit Paris Denis