Parallel Triangle Counting in MPI
Jason Li and David Wise
Parallel Triangle Counting in MPI Jason Li and David Wise - - PowerPoint PPT Presentation
Parallel Triangle Counting in MPI Jason Li and David Wise Background A triangle in a undirected graph is a collection of 3 vertices such that all 3 pairs of vertices are connected by an edge. Triangle counting has emerged as an
Jason Li and David Wise
graph is a collection of 3 vertices such that all 3 pairs of vertices are connected by an edge.
emerged as an important building block in the study of social networks, identifying thematic structures of networks, spam and fraud detection, link classification and recommendation, and more” [1]
graph is a collection of 3 vertices such that all 3 pairs of vertices are connected by an edge.
emerged as an important building block in the study of social networks, identifying thematic structures of networks, spam and fraud detection, link classification and recommendation, and more” [1]
This graph has 2 triangles:
Similarly, sort the adjacency lists according to the same ordering.
respectively.
store the graph
each edge is assigned to its smaller neighbor, it can be shown that the sequential time complexity is O(m3/2).
this algorithm
assigned to a processor
needs to know the neighbors of each vertex on the edge
processor needs a copy of the whole graph
as few vertices as possible
vertices
v1, …, vr and assign each processor a pair (vi, vj)
for all edges going from a vertex in vi to vj.
each processor gets 1/P edges: we expect near perfect speedup
the adjacency lists
which on average has total size 2m/√P
4 processors 2 groups of vertices P1 P2 P3 P4
The thick arrows represent groups of edges.
Time (s) Number of Processors 1 4 9 16 25 36 49 64
gplus k5000 live skitter Speedup 17.23 12.72 1.95 3.98
Tangwongsan, Srikanta Tirthapura, Kun-Lung Wu, Proceedings of the VLDB Endowment VLDB Endowment Hompage archive, Volume 6 Issue 14, September 201, Pages 1870-1881.
cs/user/shuhaoy/www/Final_Project.pdf.