Darwini: Generating realistic large- scale social graphs
Avery Ching
Cheng Wang
University of Houston
Sergey Edunov
Maja Kabiljo
Dionysios Logothetis
Darwini: Generating realistic large- scale social graphs Dionysios - - PowerPoint PPT Presentation
Darwini: Generating realistic large- scale social graphs Dionysios Logothetis Cheng Wang Sergey Edunov Facebook University of Houston Facebook Avery Ching Maja Kabiljo Facebook Facebook Benchmark Graphs Benchmark to Social Graphs
Avery Ching
Cheng Wang
University of Houston
Sergey Edunov
Maja Kabiljo
Dionysios Logothetis
Clueweb 09 Twitter research Friendster Yahoo! web
1750 3500 5250 7000
Edges Vertices
Clueweb 09 Twitter research Friendster Yahoo! web 2015 Twitter Approx. 2015 Facebook Approx.
1750 3500 5250 7000 125000 250000 375000 50000
Edges Vertices
Run time difference (%) 10 20 30 40 Page Rank CC EIG BP BTER Kronecker BTER Kronecker BTER Kronecker BTER Kronecker
Original Graph M e a s u r e D a r w i n i Generated Graph
Create vertices Assign expected degree and clustering coefficient Group vertices that expect same number of triangles together Create random edges within each group Create random edges between groups
Group vertices that expected to participate in the same number of triangles together
Limit the size of each bucket, so that we don’t exceed expected degree
3
After this step, we will have enough triangles to get right clustering coefficient
For each vertex, that doesn’t have enough edges yet, pick random vertex and create an edge if another vertex doesn’t have enough edges either.
Hard to find counterparts for high degree vertices
groups.
group, connect them with probability
Original Graph Darwini BTER Kronecker
Run time difference (%) 10 20 30 40 Page Rank CC EIG BP Darwini BTERKronecker Darwini BTERKronecker Darwini BTERKronecker Darwini BTERKronecker