Planar: Parallel Lightweight Architecture-Aware Adaptive Graph Repartitioning
Angen Zheng, Alexandros Labrinidis, and Panos K. Chrysanthis
University of Pittsburgh
1
Planar: Parallel Lightweight Architecture-Aware Adaptive Graph - - PowerPoint PPT Presentation
Planar: Parallel Lightweight Architecture-Aware Adaptive Graph Repartitioning Angen Zheng , Alexandros Labrinidis, and Panos K. Chrysanthis University of Pittsburgh 1 Graph Partitioning Applications of Graph Partitioning Scientific
1
2
3
4
5
Figure 1. Pair-Wise Network Bandwidth (J. Xue, BigData’15) STD DEV: 416.82Mb/s STD DEV: 358.34Mb/s STD DEV: 269 . 71Mb/s
Balanced Graph (Re)Partitioning
Partitioners (static graphs) Repartitioners (dynamic graphs)
Offline Methods (High Quality)
(Poor Scalability)
Online Methods
(Moderate Quality) (High Scalability)
Offline Methods
(High Quality) (Poor Scalability)
Online Methods
(Moderate~High Quality) (High Scalability)
Architecture-Aware Architecture-Aware
6
7
Introduction Planar Evaluation Conclusions
8
Network Cost
Phase-1: Logical Vertex Migration Phase-2: Physical Vertex Migration Phase-3: Convergence Check
★ Migration Planning ○ What vertices to move? ○ Where to move? ★ Still beneficial? ★ Perform the Migration Plan Sk Sk+1 Sk+2 Sk+4 Sk+5
Planar Planar Planar Planar Planar
○ Phase-1a: Minimizing Comm Cost ○ Phase-1b: Ensuring Balanced Partitions
9
N1 N2 N3 N1
6 1
N2
6 1
N3
1 1
N1 N2 N3
6 6 1 1
10
■ make a migration decision on my own ■ Probabilistic vertex migration N1 N2 N3
6 6 1 1
11
N1 N2 N3
6 6 1 1
12
■ make a migration decision on my own ■ Probabilistic vertex migration
N1 N2 N3
6 6 1 1
13
■ make a migration decision on my own ■ Probabilistic vertex migration
Max Gain: 0 Optimal Dest: N1
N1 N2 N3
14
■ make a migration decision on my own ■ Probabilistic vertex migration
Max Gain: 0 Optimal Dest: N1
6 6 1 1
N1 N2 N3
6 1 1
N2 N3
6 6 1
N1
15
■ make a migration decision on my own ■ Probabilistic vertex migration
Max Gain: 9 Optimal Dest: N3
1
N1 N2 N3
1 1 1 1
Partition
N1
Boundary Vtx
a
Migration Dest N3 Gain
9
N2
b d
N3 N3
2 3 Migration Planning Probability
9/9
2/3 3/3 Max Gain
9
3
16
N1 N2 N3
6 6 1 1
N3
e g
N3 N3
17
Phase-1: Logical Vertex Migration Phase-2: Physical Vertex Migration Phase-3: Convergence Check
★ Migration Planning ○ What vertices to move? ○ Where to move? ★ Still beneficial? ★ Perform the Migration Plan Sk Sk+1 Sk+2 Sk+4 Sk+5
Planar Planar Planar Planar Planar
○ Phase-1a: Minimizing Comm Cost ○ Phase-1b: Ensuring Balanced Partitions
18
Phase-1: Logical Vertex Migration Phase-2: Physical Vertex Migration Phase-3: Convergence Check
★ Migration Planning ○ What vertices to move? ○ Where to move? ★ Still beneficial? ★ Perform the Migration Plan Sk Sk+1 Sk+2 Sk+4 Sk+5
Planar Planar Planar Planar Planar
○ Phase-1a: Minimizing Comm Cost ○ Phase-1b: Ensuring Balanced Partitions
19
Sk Sk+1 Sk+2 Sk+4 Sk+5
Converge Enough changes (structure/load)
Planar Planar Planar Planar Planar
20
21
22
wave 156.317 2,118,662 FEM auto 448,695 6,629,222 333SP 3,712,815 22,217,266 CA-CondMat 108,300 373, 756 Collaboration Network DBLP 317,080 1,049,866 Email-Eron 36,692 183,831 as-skitter 1,696,415 22,190,596 Internet Topology Amazon 334,863 925,872 Product Network USA-roadNet 23,947,347 58,333,344 Road Network roadNet-PA 1,090,919 6,167,592 YouTube 3,223,589 24,447,548 Social Network Com-LiveJournal 4,036,537 69,362,378 Friendster 124,836,180 3,612,134,270
Improv. Max Avg. HP 68% 53% DG 46% 24% LDG 69% 48%
24
25
Cluster Configuration PittMPICluster (FDR Infiniband) Gordon
(QDR Infiniband)
# of Nodes 32 1024 Network Topology Single Switch (32 nodes / switch) 4*4*4 3D Torus of Switches (16 nodes / switch) Network Bandwidth 56Gbps 8Gbps Node Configuration PittMPICluster (Intel Haswell) Gordon (Intel Sandy Bridge) # of Sockets 2 (10 cores / socket) 2 (8 cores / socket) L3 Cache 25MB 20MB Memory Bandwidth 65GB/s 85GB/s
26
Intra-Node Network Comm Cost Maximal Inter-Node Network Comm Cost Degree of Contention
(A. Zheng EDBT’16)
27
Balanced Graph (Re)Partitioning
Partitioners (static graphs) Repartitioners (dynamic graphs)
Offline Methods (High Quality)
(Poor Scalability)
Online Methods
(Moderate Quality) (High Scalability)
Offline Methods
(High Quality) (Poor Scalability)
Online Methods
(Moderate~High Quality) (High Scalability)
uniPlanar Initial Partitioner: DG
28
9x 7.5x 5.8x 4.1x 1.48x 1.37x 1x
29
★ as-skitter: |V|=1.6M, |E| = 22M ★ 60 Partitions: three 20-core machines
★ as-skitter: |V|=1.6M, |E| = 22M ★ 60 Partitions: three 20-core machines
Reduction Intra-Socket Inter-Socket
DG
51% 38%
METIS
51% 36%
PARMETIS
47% 34%
uniPLANAR
44% 28%
ARAGON
4.3% 0.8%
PARAGON
5.2% 2.6%
30
3.2x 1.05x 1.16x 1.21x
1x
31
★ as-skitter: |V|=1.6M, |E| = 22M ★ 48 Partitions: three 16-core machines
51% 25% 11% 0.1%
32
★ as-skitter: |V|=1.6M, |E| = 22M ★ 48 Partitions: three 16-core machines
Acknowledgments:
Funding:
33
34
35
Initial Partitioner: DG (Deterministic Greedy) # of Parts: 40 (two 20-core nodes)
36
★ friendster: |V| = 124M, |E|=3.6B ★ 60 of Partitions: three 20-core machines
Speedup (60 cores) DG 1.55x uniPLANAR 1.32x PARAGON 1.08x
37
38
★ friendster: |V| = 124M, |E|=3.6B ★ 60 of Partitions: three 20-core machines
Speedup (120 cores) DG 2.9x uniPLANAR 1.30x PARAGON 1.15x
★ friendster: |V| = 124M, |E|=3.6B ★ 60~120 of Partitions: three to six 20-core machines)
39
40
★ friendster: |V| = 124M, |E|=3.6B ★ 60~120 of Partitions: three to six 20-core machines)
41
Cached Send/Shared/Receive Buffer
42
Cached Send/Shared Buffer Cached Receive/Shared Buffer
43
Network Node#1 Node#2 RDMA-enabled
44
[1]. C. Binnig et. al. The End of Slow Networks: It’sTime for a Redesign. CoRR, 2015
★ Dual-socket Xeon E5v2 server with
○ DDR3-1600 ○ 2 FDR 4x NICs per socket
★ Infiniband: 1.7GB/s~37.5GB/s ★ DDR3: 6.25GB/s~16.6GB/s
45
Send Buffer Sending Core
Node#1
IB HCA Receive Buffer Sending Core
Node#2
IB HCA
46