regularities and dynamics in bisimulation reductions of
play

Regularities and dynamics in bisimulation reductions of big graphs - PowerPoint PPT Presentation

Regularities and dynamics in bisimulation reductions of big graphs Yongming Luo , George Fletcher, Jan Hidders, Paul De Bra and Yuqing Wu GRADES 2013 SIGMOD/PODS 2013 The research of YL, GF, JH and PD is supported by the Nether- lands


  1. Regularities and dynamics in bisimulation reductions of big graphs Yongming Luo , George Fletcher, Jan Hidders, Paul De Bra and Yuqing Wu GRADES 2013 • SIGMOD/PODS 2013 The research of YL, GF, JH and PD is supported by the Nether- lands Organisation for Scientific Research. The research of YW is supported by Research Foundation Flan- ders during her sabbatical visit to Hasselt University, Bel- gium. Where innovation starts

  2. Outline 2/11 Motivation 1 6 P1 Experimental setup 2 4 7 P3 Results 3 5 8 P4 P2 An example of Insights bisimulation reduction department of mathematics and computer science

  3. Bisimulation reduction 3/11 ◮ Bisimulation partitioning is an important concept in many fields (computer science, modal logic, etc.), in DB research as well (structural index, graph reduction) ◮ It can be seen as a way of clustering nodes 1 6 P1 Figure: Bisimulation partition example, partition block graph 2 4 7 P3 (reduction graph) 3 5 8 P4 { P 2 ↔ P 1 → P 3 → P 4 } P2 department of mathematics and computer science

  4. Bisimulation reduction 3/11 ◮ Bisimulation partitioning is an important concept in many fields (computer science, modal logic, etc.), in DB research as well (structural index, graph reduction) ◮ It can be seen as a way of clustering nodes 1 6 P1 Figure: Bisimulation partition example, partition block graph 2 4 7 P3 (reduction graph) 3 5 8 P4 { P 2 ↔ P 1 → P 3 → P 4 } P2 ◮ Reduce graph size while preserving structural properties (e.g., reachability) ◮ Result can be seen as a graph ◮ Many algorithms, no work on analyzing the results department of mathematics and computer science

  5. Questions 4/11 Regularities, such as power-law distribution exists in real graphs. department of mathematics and computer science

  6. Questions 4/11 Regularities, such as power-law distribution exists in real graphs. ◮ Do graphs under bisimulation reduction also have such properties? department of mathematics and computer science

  7. Questions 4/11 Regularities, such as power-law distribution exists in real graphs. ◮ Do graphs under bisimulation reduction also have such properties? ◮ How would that knowledge help us? department of mathematics and computer science

  8. Experimental setup for investigation 5/11 ◮ Big graphs, from 1 Million to 1.4 Billion edges (Twitter, DBPedia, etc.) ◮ One dynamic social graph, from 17 Million to 33 Million edges (Flickr-grow) ◮ State-of-the-art I/O efficient algorithm for computing bisimulation reductions (k-bisim, k = 10) ◮ We use cumulative distribution function (CDF) to present distributions department of mathematics and computer science

  9. Regularities - bisimulation result 6/11 Power-law also exists in many attributes for bisimulation partition results for real graphs . But this is not the case for synthetic graphs . department of mathematics and computer science

  10. Regularities - bisimulation result 7/11 Partition block size distribution real graphs synthetic graphs cumulative % of PB with ≥ x 10 0 10 0 10 − 1 10 − 1 10 − 2 10 − 2 10 − 3 10 − 3 10 − 4 10 − 4 10 − 5 10 − 5 10 − 6 10 − 6 10 − 7 10 − 7 10 − 8 10 0 10 2 10 4 10 6 10 0 10 2 10 4 10 6 10 8 x (# of nodes per PB) x (# of nodes per PB) Jamendo LinkedMDB DBLP DBPedia WikiLinks Twitter Flickr-Grow BSBM SP2B Power Random department of mathematics and computer science

  11. Regularities - bisimulation result 8/11 Bisimulation graph in/out-degree distribution real graphs synthetic graphs cumulative % of N k with ≥ x 10 0 10 0 10 − 1 10 − 1 10 − 2 10 − 2 10 − 3 10 − 3 10 − 4 10 − 4 10 − 5 10 − 5 10 − 6 10 − 6 10 − 7 10 − 7 10 − 8 10 0 10 2 10 4 10 6 10 8 10 0 10 2 10 4 10 6 10 8 x (in-degree) x (in-degree) Jamendo LinkedMDB DBLP DBPedia WikiLinks Twitter Flickr-Grow BSBM SP2B Power Random department of mathematics and computer science

  12. Dynamics - a real growing social graph 9/11 ◮ Does the bisimulation result grow when the original graph grows? department of mathematics and computer science

  13. Dynamics - a real growing social graph 9/11 ◮ Does the bisimulation result grow when the original graph grows? • Yes. department of mathematics and computer science

  14. Dynamics - a real growing social graph 9/11 ◮ Does the bisimulation result grow when the original graph grows? • Yes. ◮ How fast does it grow? department of mathematics and computer science

  15. Dynamics - a real growing social graph 9/11 ◮ Does the bisimulation result grow when the original graph grows? • Yes. ◮ How fast does it grow? • Linearly with respect to the original graph. | N k | | E k | | N | | E | department of mathematics and computer science

  16. Insights 10/11 ◮ Power-law distributions in bisimulation results ⇒ skew expected in applications (indexes, data partitioned among machines, . . . ) department of mathematics and computer science

  17. Insights 10/11 ◮ Power-law distributions in bisimulation results ⇒ skew expected in applications (indexes, data partitioned among machines, . . . ) ◮ Behaviors of graph generators ⇒ some more work needs to be done for graph generators department of mathematics and computer science

  18. Insights 10/11 ◮ Power-law distributions in bisimulation results ⇒ skew expected in applications (indexes, data partitioned among machines, . . . ) ◮ Behaviors of graph generators ⇒ some more work needs to be done for graph generators ◮ Bisimulation result/graph grows ⇒ lower k or other adaptations (e.g., choose different k for different parts of the graph, different node/edge labeling) department of mathematics and computer science

  19. 11/11 Thank you! Q&A For more information, just google seeqr project or visit: bit.ly/seeqr department of mathematics and computer science

  20. Definition of k - bisimilar 11/11 Definition Let k be a non-negative integer and G = � N , E , λ N , λ E � be a graph. Nodes u , v ∈ N are called k - bisimilar (denoted as u ≈ k v ), iff the following holds: 1. λ N ( u ) = λ N ( v ) , 2. if k > 0, then for any edge ( u , u ′ ) ∈ E , there exists an edge ( v , v ′ ) ∈ E , such that u ′ ≈ k − 1 v ′ and λ E ( u , u ′ ) = λ E ( v , v ′ ) , and 3. if k > 0, then for any edge ( v , v ′ ) ∈ E , there exists an edge ( u , u ′ ) ∈ E , such that v ′ ≈ k − 1 u ′ and λ E ( v , v ′ ) = λ E ( u , u ′ ) . department of mathematics and computer science

  21. Definition of k - bisimilar 11/11 Definition Let k be a non-negative integer and G = � N , E , λ N , λ E � be a graph. Nodes u , v ∈ N are called k - bisimilar (denoted as u ≈ k v ), iff the following holds: 1. λ N ( u ) = λ N ( v ) , 2. if k > 0, then for any edge ( u , u ′ ) ∈ E , there exists an edge ( v , v ′ ) ∈ E , such that u ′ ≈ k − 1 v ′ and λ E ( u , u ′ ) = λ E ( v , v ′ ) , and 3. if k > 0, then for any edge ( v , v ′ ) ∈ E , there exists an edge ( u , u ′ ) ∈ E , such that v ′ ≈ k − 1 u ′ and λ E ( v , v ′ ) = λ E ( u , u ′ ) . M M w 1 2 In this example graph, nodes w l l 1 and 2 are 0- and 1- bisimilar l l but not 2-bisimilar. 4 5 3 6 l P P P P department of mathematics and computer science

  22. Regularities - original graphs 11/11 Power-law exists in in/out-degree distribution for most of the examined graphs. cumulative % of nodes with ≥ x real graphs synthetic graphs 10 0 10 0 10 − 1 10 − 1 10 − 2 10 − 2 10 − 3 10 − 3 10 − 4 10 − 4 10 − 5 10 − 5 10 − 6 10 − 6 10 − 7 10 − 7 10 − 8 10 − 8 10 0 10 2 10 4 10 6 10 0 10 2 10 4 10 6 10 8 x (in-degree) x (in-degree) Jamendo LinkedMDB DBLP DBPedia WikiLinks Twitter Flickr-Grow BSBM SP2B Power Random department of mathematics and computer science

  23. Signature length 11/11 cumulative % of nodes with ≥ x real graphs synthetic graphs 10 0 10 0 10 − 1 10 − 1 10 − 2 10 − 2 10 − 3 10 − 3 10 − 4 10 − 4 10 − 5 10 − 5 10 − 6 10 − 6 10 − 7 10 − 7 10 − 8 10 0 10 2 10 4 10 6 10 0 10 1 10 2 10 3 10 4 x (signature length) x (signature length) Jamendo LinkedMDB DBLP DBPedia WikiLinks Twitter Flickr-Grow BSBM SP2B Power Random department of mathematics and computer science

  24. Out-degree 11/11 real graphs synthetic graphs cumulative % of N k with ≥ x 10 0 10 0 10 − 1 10 − 1 10 − 2 10 − 2 10 − 3 10 − 3 10 − 4 10 − 4 10 − 5 10 − 5 10 − 6 10 − 6 10 − 7 10 − 7 10 − 8 10 0 10 2 10 4 10 6 10 0 10 2 10 4 10 6 10 8 x (out-degree) x (out-degree) Jamendo LinkedMDB DBLP DBPedia WikiLinks Twitter Flickr-Grow BSBM SP2B Power Random department of mathematics and computer science

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend