DNA Interaction Follow Network Network User-Product Network - - PowerPoint PPT Presentation

dna interaction
SMART_READER_LITE
LIVE PREVIEW

DNA Interaction Follow Network Network User-Product Network - - PowerPoint PPT Presentation

Social Network Social Network Web Network DNA Interaction Follow Network Network User-Product Network Nonuniform network comm costs Nonuniform comp requirement Contentiousness of the memory Nonuniform comm requirement


slide-1
SLIDE 1
slide-2
SLIDE 2

Web Network Social Network Social Network User-Product Network Follow Network

DNA Interaction Network

slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5

✓ Nonuniform network comm costs ✓ Contentiousness of the memory subsystems ✓ Nonuniform comp requirement ✓ Nonuniform comm requirement ✓ Time-varying skewness

slide-6
SLIDE 6

Architecture- and Workload-Aware Graph (Re)Partitioning

Aragon [BigGraphs’14]

(small dynamic graphs)

Paragon [EDBT’16]

(median-size dynamic graphs)

Planar [ICDE’16]

(large dynamic graphs)

Planar+ [To submit’17]

(large dynamic graphs)

Argo [BigData’16]

(static graphs)

Sargon [ICDE’17]

(skew-resistant)

slide-7
SLIDE 7

Architecture- and Workload-Aware Graph (Re)Partitioning

Aragon [BigGraphs’14]

(small dynamic graphs)

Paragon [EDBT’16]

(median-size dynamic graphs)

Planar [ICDE’16]

(large dynamic graphs)

Planar+ [To submit’17]

(large dynamic graphs)

Argo [BigData’16]

(static graphs)

Sargon [ICDE’17]

(skew-resistant)

slide-8
SLIDE 8

➢ ➢ ➢ ➢

slide-9
SLIDE 9

★ Migration Planning ○ What vertices to move? ○ Where to move? ★ Still beneficial? ★ Perform the Migration Plan Sk Sk+1 Sk+2 Sk+4 Sk+5

Planar Planar Planar Planar Planar

Phase-1: Logical Vertex Migration Phase-2: Physical Vertex Migration Phase-3: Convergence Check

○ Phase-1a: Minimizing Comm Cost ○ Phase-1b: Ensuring Balanced Partitions

slide-10
SLIDE 10

Phase-3: Convergence Check

★ Migration Planning ○ What vertices to move? ○ Where to move? ★ Still beneficial?

Phase-2: Physical Vertex Migration

★ Perform the Migration Plan Sk Sk+1 Sk+2 Sk+4 Sk+5

Planar Planar Planar Planar Planar

Phase-1: Logical Vertex Migration

○ Phase-1a: Minimizing Comm Cost ○ Phase-1b: Ensuring Balanced Partitions

Phase-2: Vertex Location Update

★ Each vertex has up-to-date locations of their neighbors

slide-11
SLIDE 11

Sk Sk+1 Sk+2 Sk+4 Sk+5

Converge Starts Repartitioning

Planar Planar Planar Planar Planar

Physical Vertex Migration

slide-12
SLIDE 12
slide-13
SLIDE 13

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 0 L1 L1

… …

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 1 L1 L1

… …

Machine 0

QPI/ HT

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 1 L1 L1

… …

Machine 1

QPI/ HT

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 0 L1 L1

… …

slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16
slide-17
SLIDE 17

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 0 L1 L1

… …

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 1 L1 L1

… …

Machine 0

QPI/ HT

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 1 L1 L1

… …

Machine 1

QPI/ HT

Memory

core core

… … …

L2 L2 L3

Memory Controller Inter-socket Link Controller

Socket 0 L1 L1

… …

slide-18
SLIDE 18

★ ★ ★ ★

1x 1.5x 1.7x 1.18x 2.8x Hours CPU Time Saving PARAGON

25h

PLANAR

27h

PLANAR+

43h

uniPLANAR+

10h

λ

slide-19
SLIDE 19

λ

○ ○

★ ★ ★ ★

1x 1.5x 1.7x 1.18x 2.8x Hours CPU Time Saving PARAGON

25h

PLANAR

27h

PLANAR+

43h

uniPLANAR+

10h

slide-20
SLIDE 20

○ ■ ■ ○ ○ ■ ■

slide-21
SLIDE 21

Architecture- and Workload-Aware Graph (Re)Partitioning

Aragon [BigGraphs’14]

(small dynamic graphs)

Paragon [EDBT’16]

(median-size dynamic graphs)

Planar [ICDE’16]

(large dynamic graphs)

Planar+ [To submit’17]

(large dynamic graphs)

Argo [BigData’16]

(static graphs)

Sargon [ICDE’17]

(skew-resistant)

slide-22
SLIDE 22

■ ■

slide-23
SLIDE 23
slide-24
SLIDE 24

Partitioner ... ... Vertex Stream

slide-25
SLIDE 25

○ ○

slide-26
SLIDE 26

○ ○

slide-27
SLIDE 27

∈ ✓

○ ○

Bottleneck Network Memory

slide-28
SLIDE 28

★ ★ ★

○ ○

slide-29
SLIDE 29

50x 38x 9x 4x 3x 6x 1x 1x 1x 9x 1.2x 12x

✓ ✓

★ ★ ★

slide-30
SLIDE 30

✓ ✓

★ ★ ★

slide-31
SLIDE 31
slide-32
SLIDE 32

m:s:c SSSP Execution Time (s) METIS LDG 1:2:8 633 2,632 2:2:4 654 2,565 4:2:2 521 631 8:2:1 222 280

9x

slide-33
SLIDE 33

m:s:c SSSP Execution Time (s) METIS LDG 1:2:8 633 2,632 2:2:4 654 2,565 4:2:2 521 631 8:2:1 222 280 m:s:c SSSP LLC Misses (in Millions) METIS LDG 1:2:8 10,292 44,117 2:2:4 10,626 44,689 4:2:2 2,541 1,061 8:2:1 96 187

9x 235x

slide-34
SLIDE 34

m:s:c SSSP LLC Misses (in Millions) METIS LDG 1:2:8 10,292 44,117 2:2:4 10,626 44,689 4:2:2 2,541 1,061 8:2:1 96 187 m:s:c SSSP Execution Time (s) METIS LDG 1:2:8 633 2,632 2:2:4 654 2,565 4:2:2 521 631 8:2:1 222 280

9x 235x

slide-35
SLIDE 35

m:s:c SSSP Execution Time (s) METIS LDG 1:2:8 633 2,632 2:2:4 654 2,565 4:2:2 521 631 8:2:1 222 280 m:s:c SSSP LLC Misses (in Millions) METIS LDG 1:2:8 10,292 44,117 2:2:4 10,626 44,689 4:2:2 2,541 1,061 8:2:1 96 187

9x 235x

✓ METIS had lower execution time and LLC misses than LDG.

Edge-cut matters.

Higher edge-cut-->higher comm-->higher contention

slide-36
SLIDE 36

○ ○

○ ○ ○

■ ■

slide-37
SLIDE 37

Architecture- and Workload-Aware Graph (Re)Partitioning

Aragon [BigGraphs’14]

(small dynamic graphs)

Paragon [EDBT’16]

(median-size dynamic graphs)

Planar [ICDE’16]

(large dynamic graphs)

Planar+ [To submit’17]

(large dynamic graphs)

Argo [BigData’16]

(static graphs)

Sargon [ICDE’17]

(skew-resistant)

slide-38
SLIDE 38
slide-39
SLIDE 39
slide-40
SLIDE 40
slide-41
SLIDE 41
slide-42
SLIDE 42
slide-43
SLIDE 43
  • Assign a label vector to each vertex to indicate:

○ the time periods the vertex is active in ○ whether it is a high- or low-degree vertex ○ the hotness of the vertex

slide-44
SLIDE 44

Partitioner ... ... Vertex Stream

slide-45
SLIDE 45
slide-46
SLIDE 46

Workloads BFS and SSSP (one randomly selected source vertex) Dataset Orkut (|V|=3M, |E|=234M) # of Traces Collected 5 Similarity Percentage of the vertices overlapped in the peak superstep

Workloads

  • Avg. Similarity
  • Std. Deviation

BFS 60.80% 8.43% SSSP 64.73% 10.63%

slide-47
SLIDE 47

★ ★ ★

1.68x 2x 1.57x 1x

✓ Up to 2x speedups (hours CPU time saving).

slide-48
SLIDE 48

○ ○ ○

○ ○

slide-49
SLIDE 49

Architecture- and Workload-Aware Graph (Re)Partitioning

Aragon [BigGraphs’14]

(small dynamic graphs)

Paragon [EDBT’16]

(median-size dynamic graphs)

Planar [ICDE’16]

(large dynamic graphs)

Planar+ [To submit’17]

(large dynamic graphs)

Argo [BigData’16]

(static graphs)

Sargon [ICDE’17]

(skew-resistant)

▪ ▪ ▪ ▪ ▪

Thanks!

slide-50
SLIDE 50