Graph Sampling and Sparsification
Lecture 19 CSCI 4974/6971 7 Nov 2016
1 / 10
Graph Sampling and Sparsification Lecture 19 CSCI 4974/6971 7 Nov - - PowerPoint PPT Presentation
Graph Sampling and Sparsification Lecture 19 CSCI 4974/6971 7 Nov 2016 1 / 10 Todays Biz 1. Reminders 2. Review 3. Graph Sampling/Sparsification 2 / 10 Reminders Assignment 4: due date November 10th Setting up and running on CCI
1 / 10
2 / 10
◮ Assignment 4: due date November 10th
◮ Setting up and running on CCI clusters
◮ Assignment 5: due date TBD (before Thanksgiving
◮ Assignment 6: due date TBD (early December) ◮ Tentative: No class November 14 and/or 17 ◮ Final Project Presentation: December 8th ◮ Project Report: December 11th ◮ Office hours: Tuesday & Wednesday 14:00-16:00 Lally
◮ Or email me for other availability 3 / 10
4 / 10
◮
5 / 10
6 / 10
7 / 10
* Computer Science and Information Engineering, National Taiwan University
# Institute of Information Science, Academic Sinica
sdlin@csie.ntu.edu.tw, miyen@iis.sinica.edu.tw, d98944005@csie.ntu.edu.tw Tutorial slides can be downloaded here: http://mslab.csie.ntu.edu.tw/tut‐pakdd13/
13/05/02 Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 2
by Paul Butler
3
What can be mined from this picture?
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 4
It costs >1TB memory to simply save the raw graph data (without attributes, labels nor content)
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 5
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 6
Link Types Friend Family Love Link Types Friend
13/05/02 7 Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 9
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 10
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 11
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 12
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 13
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 14
Seeds (i.e., ego)
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 16
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 17
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 18
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 19
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 20
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 21
S: the set of sampled nodes, N(S): the 1st neighbor set of S
∈
E G H F A B C D |N({A})|=4 |N({E}) – N({A}) ∪{A}|=|{F,G,H}|=3 |N({D}) – N({A}) ∪{A}|=|{F}|=1
qk ‐ sampled
node degree distribution
pk ‐ real node
degree distribution
13/05/02 23
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 24
1: : ≔ 1: : ≔ with probability : ≔ with probability 1
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 26
13/05/02 Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 27
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 28
Sampled Network Original Network
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 29
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 30
31
respondents
limited coupon c limited coupon c limited coupon c
S11 S12 S13 S21 S22 S23 S31 S32 S33
N‐step transition
P1 P2 P3 Transition Matrix
steady‐state vector
13/05/02 Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 32
Similarity of node type‐distribution Similarity of Intra‐link distribution
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 33
– Capture the dependency between each Node Type(NT) and Edge Type(ET) of a directed Heterogeneous Network – Consists of 4 Relational Matrices
NT ET NT Transition Matrix Transition Matrix ET Transition Matrix Transition Matrix
paper cites cites journal_of authored author
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 34
P A C J c p a P 0.44 0.22 0.22 0.11 0.44 0.33 0.22 A 1 1 C 1 1 J 1 1 c 1 0.22 0.44 0.33 p 0.5 0.33 0.17 0.66 0.33 a 0.5 0.5 0.6 0.4 P A C J c p a P 0.182 0.364 0.091 0.273 0.182 0.364 0.364 A 1 1 C 1 1 J 1 1 c 1 0.5 0.5 p 0.5 0.125 0.375 0.17 0.5 0.33 a 0.5 0.5 0.22 0.33 0.44 Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 35
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 36
D(v, Gs) = estimated change of RP given sampling v on the current graph Gs =E[ΔP(Gs, Gs+v)|Gs] , where ΔP = RMSERP
Exploiting the existing RP, P(type(v)=t|Gs) can be
v
which can be calculated as
v RP(type |type ) RP(type |type ) RP(type |type ) RP(type |type )
P(type|type) can be obtained from the existing RP
Gs
民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 1 5 9 13 17 21 25 29 33 37 41 45 49 Kendall‐Tau # Nodes Sampled (in 10s)
RW HDS RPS
民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 1 5 9 13 17 21 25 29 33 37 41 45 49 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 1 6 11 16 21 26 31 36 41 46
Hep Aca Movie Type dependency preservation Preserving relative node weights propagated throughout entire network
民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 民國前/通用格式 A c c u r a c y number of sampled nodes highDeg RandWalk RPS
Node Type Prediction Missing Relation Prediction
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 41
Lin et al., Sampling and Summarization for Social Networks, PAKDD 2013 tutorial 13/05/02 42
[Leskovec’06] [Adamic’01] [Ahmed’12][Ribeiro’10] [Kurant’12]
[Krishnamurthy’05] [Leskovec’06][Hubler’08] [Gjoka’10][Ribeiro’10] [Maiya’11][Kurant’11] [Gjoka’11][Li’11][Kurant’12] [Yang’13]
[Maiya’10][Satuluri’11][Mathioudakis’11] [Vattani’11][Ahmed’12]
a smart sampling strategy is needed)
8 / 10
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder
Department of Computer Science and Engineering, Heritage Institute of Technology, Kolkata, India
September 5, 2016
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Figure: The tendency of people to live in racially homogeneous neighborhoods[1]. In yellow and
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V.
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅ For overlapping communities, ∃i, j where Ci Cj = ∅
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅ For overlapping communities, ∃i, j where Ci Cj = ∅
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅ For overlapping communities, ∃i, j where Ci Cj = ∅
Figure: Zachary’s Karate Club Network
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅ For overlapping communities, ∃i, j where Ci Cj = ∅
Figure: Zachary’s Karate Club Network
C = {C1, C2, C3}, C1 = yellow nodes, C2 = green, C3 = blue is a disjoint cover
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅ For overlapping communities, ∃i, j where Ci Cj = ∅
Figure: Zachary’s Karate Club Network
C = {C1, C2, C3}, C1 = yellow nodes, C2 = green, C3 = blue is a disjoint cover However, ¯ C = { ¯ C1, ¯ C2}, ¯ C1 = yellow & green nodes and ¯ C2 = blue & green nodes is an overlapping cover
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
For a given graph G(V, E), find a cover C = {C1, C2, ..., Ck} such that
i
Ci = V. For disjoint communities, ∀i, j we have Ci Cj = ∅ For overlapping communities, ∃i, j where Ci Cj = ∅
Figure: Zachary’s Karate Club Network
C = {C1, C2, C3}, C1 = yellow nodes, C2 = green, C3 = blue is a disjoint cover However, ¯ C = { ¯ C1, ¯ C2}, ¯ C1 = yellow & green nodes and ¯ C2 = blue & green nodes is an overlapping cover For our problem, we concentrate on disjoint community detection
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Top 6 edges Edge cB(e) Type (10, 13) 0.3 inter (3, 5) 0.23333 inter (7, 15) 0.2079 inter (1, 8) 0.1873 inter (13, 15) 0.1746 intra (5, 7) 0.1476 intra
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Top 6 edges Edge cB(e) Type (10, 13) 0.3 inter (3, 5) 0.23333 inter (7, 15) 0.2079 inter (1, 8) 0.1873 inter (13, 15) 0.1746 intra (5, 7) 0.1476 intra Bottom 6 edges Edge cB(e) Type (8, 11) 0.022 intra (1, 2) 0.0269 intra (9, 11) 0.031 intra (8, 9) 0.0412 intra (12, 15) 0.052 intra (3, 4) 0.060 intra
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths Edges are selected on the basis of the edge betweenness centrality
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths Edges are selected on the basis of the edge betweenness centrality
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths Edges are selected on the basis of the edge betweenness centrality The algorithm
1 Compute centrality for all edges Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths Edges are selected on the basis of the edge betweenness centrality The algorithm
1 Compute centrality for all edges 2 Remove edge with largest centrality; ties can be broken randomly Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths Edges are selected on the basis of the edge betweenness centrality The algorithm
1 Compute centrality for all edges 2 Remove edge with largest centrality; ties can be broken randomly 3 Recalculate the centralities on the running graph Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Michelle Girvan and Mark Newman[2] in 2002 The Key Ideas Based on reachability of nodes - shortest paths Edges are selected on the basis of the edge betweenness centrality The algorithm
1 Compute centrality for all edges 2 Remove edge with largest centrality; ties can be broken randomly 3 Recalculate the centralities on the running graph 4 Iterate from step 2, stop when you get clusters of desirable quality Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
(a) Best edge: (10, 13) (f) Final graph (b) Best edge: (3, 5) (e) Best edge: (2, 11) (c) Best edge: (7, 15) (d) Best edge: (1, 8)
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Blondel et al[3] in 2008
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Blondel et al[3] in 2008 Takes the greedy maximization approach
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Blondel et al[3] in 2008 Takes the greedy maximization approach Very fast in practice, it’s the current state-of-the-art in disjoint community detection
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Blondel et al[3] in 2008 Takes the greedy maximization approach Very fast in practice, it’s the current state-of-the-art in disjoint community detection Performs hierarchical partitioning, stopping when there cannot be any further improvement in modularity
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Proposed by Blondel et al[3] in 2008 Takes the greedy maximization approach Very fast in practice, it’s the current state-of-the-art in disjoint community detection Performs hierarchical partitioning, stopping when there cannot be any further improvement in modularity Contracts the graph in each iteration thereby speeding up the process
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
1
Building Community Preserving Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
2
Fast Detection of Communities from the Sparsified Network Methodology and Visualizations Experimental Results
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Input: An unweighted network G(V, E) Output: A disjoint cover C
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Input: An unweighted network G(V, E) Output: A disjoint cover C
1 Use Jaccard coefficient to turn G into a weighted network G(V, E, W) Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Input: An unweighted network G(V, E) Output: A disjoint cover C
1 Use Jaccard coefficient to turn G into a weighted network G(V, E, W) 2 Construct an t-spanner of G(V, E, W). Take the complement of GS, call it Gcomm Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Input: An unweighted network G(V, E) Output: A disjoint cover C
1 Use Jaccard coefficient to turn G into a weighted network G(V, E, W) 2 Construct an t-spanner of G(V, E, W). Take the complement of GS, call it Gcomm 3 Use LINCOM to break Gcomm into small but pure fragments Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Input: An unweighted network G(V, E) Output: A disjoint cover C
1 Use Jaccard coefficient to turn G into a weighted network G(V, E, W) 2 Construct an t-spanner of G(V, E, W). Take the complement of GS, call it Gcomm 3 Use LINCOM to break Gcomm into small but pure fragments 4 Use the second phase of Louvain Method to piece all the small bits and pieces
together to get C
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Definition
where Γ(vi) is the neighborhood of the node vi ∴ wJ ∈ [0, 1] Jaccard works well in domains where local influence is important[4][5][6]
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Definition
where Γ(vi) is the neighborhood of the node vi ∴ wJ ∈ [0, 1] Jaccard works well in domains where local influence is important[4][5][6] The computation takes O(m) time
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Table: Jaccard weight statistics for top 10% edges in terms of wJ.
Network |E| intra-cluster top 10% edges in terms of wJ edge count Total edges Intra-edge Fraction Karate 78 21 7 7 1 Dolphin 159 39 15 15 1 Football 613 179 61 61 1 Les-Mis 254 56 25 25 1 Enron 180,811 48,498 18,383 18,220 0.99113 Epinions 405,739 146,417 40,573 36,589 0.90180 Amazon 925,872 54,403 92,587 92,584 0.99996 DBLP 1,049,866 164,268 104,986 104,986 1
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
A (α, β)-spanner of a graph G = (V, E, W) is a subgraph GS = (V, ES, WS), such that,
Authors Size Running Time Alth¨
O(n1+ 1
k )
O(m(n1+ 1
k + nlogn))
Alth¨
1 2n1+ 1
k
O(mn1+ 1
k )
Roddity et al. [2004] [8]
1 2n1+ 1
k
O(kn2+ 1
k )
Roddity et al. [2005] [9] O(kn1+ 1
k )
O(km) (det.) Baswana and Sen [2007] [10] O(kn1+ 1
k )
O(km) (rand.)
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
A (α, β)-spanner of a graph G = (V, E, W) is a subgraph GS = (V, ES, WS), such that,
A t-spanner is a special case of (α, β) spanner where α = t and β = 0 Authors Size Running Time Alth¨
O(n1+ 1
k )
O(m(n1+ 1
k + nlogn))
Alth¨
1 2n1+ 1
k
O(mn1+ 1
k )
Roddity et al. [2004] [8]
1 2n1+ 1
k
O(kn2+ 1
k )
Roddity et al. [2005] [9] O(kn1+ 1
k )
O(km) (det.) Baswana and Sen [2007] [10] O(kn1+ 1
k )
O(km) (rand.)
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: Original network n = 11, m = 18 δ(1, 5) = 5 Figure: A 3-spanner of the network n = 11, m = 11 δs(1, 5) = 12
Since δs(1, 5) < t . δ(1, 5), the edge (1, 5) is discarded The other edges are discarded in a similar fashion.
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: Dolphin network. n = 62, m = 159
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: 3-spanner. n = 62, m = 150
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: 5-spanner. n = 62, m = 148
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: 7-spanner. n = 62, m = 144
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: 9-spanner. n = 62, m = 138
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Name n Spanner #intra-community #inter-community Karate 34 Original 59 19 3 57 19 5 53 19 7 51 18 9 48 19 Dolphin 59 Original 120 39 3 117 38 5 102 38 7 100 38 9 90 38 Football 115 Original 447 163 3 385 166 5 376 166 7 293 166 9 286 165
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
Building Community Preserving Sparsified Network Fast Detection of Communities from the Sparsified Network Assigning Meaningful Weights to Edges Sparsification using t-spanner
Figure: Original US Football network Figure: Sparsified network Gcomm Figure: Final network with communities marked as separate components
Partha Basuchowdhuri, Satyaki Sikdar, Sonu Shreshtha, Subhasis Majumder Detecting Community Structures in Social Networks by Graph Sparsification
◮ Implement node and edge sampling methods ◮ Compare their efficacy on various networks
9 / 10
10 / 10