GraFBoost:
Using accelerated flash storage for external graph analytics
Sang-Woo Jun, Andy Wright, Sizhuo Zhang, Shuotao Xu and Arvind
MIT CSAIL
1
GraFBoost: Using accelerated flash storage for external graph - - PowerPoint PPT Presentation
GraFBoost: Using accelerated flash storage for external graph analytics Sang-Woo Jun, Andy Wright, Sizhuo Zhang, Shuotao Xu and Arvind MIT CSAIL Funded by: 1 Large Graphs are Found Everywhere in Nature Human neural Structure of Social
Sang-Woo Jun, Andy Wright, Sizhuo Zhang, Shuotao Xu and Arvind
1
2
1) Connectomics graph of the brain - Source: “Connectomics and Graph Theory” Jon Clayden, UCL 2) (Part of) the internet - Source: Criteo Engineering Blog 3) The Graph of a Social Network – Source: Griff’s Graphs
3
4
5
6
7
8
9
Sort
10
11
12
13
Edge Property Vertex Value
Update Log (xs)
14
“Distributed GraphLab: a framework for machine learning and data mining in the cloud,” VLDB 2012 “FlashGraph: Processing billion-node graphs on an array of commodity SSDs,” FAST 2015 “X-Stream: edge-centric graph processing using streaming partitions,“ SOSP 2013 “GraphChi: Large-scale graph computation on just a PC,“ USENIX 2012
15
16
SE1 SE2 GraFBoost GraFBoost2
17
SE1 SE2 GraFBoost GraFBoost2
18
SE1 SE2 GraFBoost GraFBoost2
19
20
Normalized Performance IN SE1 SE2 EX GraFBoost GraFBoost2
Fastest! Slowest Slowest Slowest GraFSoft
21
Normalized Performance 5xIN SE1 SE2 EX GraFBoost
GraFSoft
22
20 40 60 80 100
Conventional GraFBoost
5 10 15 20 25 30 35
Conventional GraFBoost
200 400 600 800
Conventional GraFBoost
23
24
25
26 Host Server (24-Core) FPGA (VC707) minFlash minFlash Host Server (24-Core) FPGA (VC707) minFlash minFlash Host Server (24-Core) FPGA (VC707) minFlash minFlash
PCIe 4GB/s FMC Ethernet 10Gbps network ×8 1 TB
27