Local Algorithms and Large Scale Graph Mining
Silvio Lattanzi (Google Research NY) Charles River Workshop on Private Analysis of Social Networks
Local Algorithms and Large Scale Graph Mining Silvio Lattanzi - - PowerPoint PPT Presentation
Local Algorithms and Large Scale Graph Mining Silvio Lattanzi (Google Research NY) Charles River Workshop on Private Analysis of Social Networks Outline Problem and challenges Graph clustering, computation limitations. Local random walk and
Silvio Lattanzi (Google Research NY) Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
adjacent edges.
?
Charles River Workshop on Private Analysis of Social Networks
adjacent edges.
?
Charles River Workshop on Private Analysis of Social Networks
adjacent edges.
?
Charles River Workshop on Private Analysis of Social Networks
adjacent edges.
?
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
adjacent edges.
?
Joint work with: Alessandro Epasto (Sapienza University) Jon Feldman (Google Research NY) Stefano Leonardi (Sapienza University) Vahab Mirrokni (Google Research NY) WWW 2014
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Common neighbors: 2
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Common neighbors: 2 Jaccard similarity:
1 2
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Common neighbors: 2 Jaccard similarity:
1 2
Number of paths: 2
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Common neighbors: 2 Jaccard similarity:
1 2
Number of paths: 2 Short random walk(PPR)
– With probability , go back to v. – With probability , go to a neighbor uniformly at random.
v
1 2α 1 2(1 − α)
V
Charles River Workshop on Private Analysis of Social Networks
– With probability , go back to v. – With probability , go to a neighbor uniformly at random.
v
1 2α 1 2(1 − α)
V
Charles River Workshop on Private Analysis of Social Networks
– With probability , go back to v. – With probability , go to a neighbor uniformly at random.
v
1 2α 1 2(1 − α)
V
Charles River Workshop on Private Analysis of Social Networks
– With probability , go back to v. – With probability , go to a neighbor uniformly at random.
v
1 2α 1 2(1 − α)
V
Charles River Workshop on Private Analysis of Social Networks
– With probability , go back to v. – With probability , go to a neighbor uniformly at random.
v
1 2α 1 2(1 − α)
V
Charles River Workshop on Private Analysis of Social Networks
– With probability , go back to v. – With probability , go to a neighbor uniformly at random.
v
1 2α 1 2(1 − α)
V
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
Millions of campaigns and hundreds of millions
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v u v w z w z
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v u v w z w z
w(u, v) = X
q∈N(u)∩N(v)
w(u, q)w(q, v) d(q)
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries u v u v w z w z
w(u, v) = X
q∈N(u)∩N(v)
w(u, q)w(q, v) d(q)
PPR(u, v)B = 1 2 − αPPR(u, v)G
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
New York and sport
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
New York and pizza
Charles River Workshop on Private Analysis of Social Networks
Campaigns Queries
Also in this setting by using some pre-computation we can compute the PPR efficiently.
Joint work with: Raimondas Kiveris (Google Research NY) Vahab Mirrokni (Google Research NY)
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
* On public graphs with 8M nodes
* On YouTube co-watch Graph with 100M nodes with 100s of machines
* For sybil detection in social networks
Charles River Workshop on Private Analysis of Social Networks
v
Charles River Workshop on Private Analysis of Social Networks
v
Charles River Workshop on Private Analysis of Social Networks
v
Charles River Workshop on Private Analysis of Social Networks
v
Charles River Workshop on Private Analysis of Social Networks
v
Charles River Workshop on Private Analysis of Social Networks
v
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
1 17
Charles River Workshop on Private Analysis of Social Networks
Spectral algorithms [Jerrum&Sinclair’89] [Leighten-Rao’99] [Arora-Rao-Vazirani’04]
φ(S) = O( p log n)φ
φ(S) = O(log n)φ
φ(S) = O( p φ)
Charles River Workshop on Private Analysis of Social Networks
Spectral algorithms [Jerrum&Sinclair’89] [Leighten-Rao’99] [Arora-Rao-Vazirani’04]
φ(S) = O( p log n)φ
φ(S) = O(log n)φ
φ(S) = O( p φ)
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Approximation guarantee Running time Truncated random walk [Spielman-Teng’04] Truncated random walk [Spielman-Teng’08] PageRank random walk [Andersen-Chung-Lang’06] Evolving Set [Andersen-Peres’08] Evolving Set [Gharan-Trevisan’12]
φ
1 3 log 2 3 n
p φ log
3 2 n
p φ log n p φ log n
r
˜ O ✓V ol(S) φ5/3 ◆ ˜ O ✓V ol(S) φ2 ◆ ˜ O ✓V ol(S) φ ◆ ˜ O ✓V ol(S) √φ ◆
˜ O ✓V ol(S)1+✏ √φ ◆
Charles River Workshop on Private Analysis of Social Networks
Approximation guarantee Running time Truncated random walk [Spielman-Teng’04] Truncated random walk [Spielman-Teng’08] PageRank random walk [Andersen-Chung-Lang’06] Evolving Set [Andersen-Peres’08] Evolving Set [Gharan-Trevisan’12]
φ
1 3 log 2 3 n
p φ log
3 2 n
p φ log n p φ log n
r
˜ O ✓V ol(S) φ5/3 ◆ ˜ O ✓V ol(S) φ2 ◆ ˜ O ✓V ol(S) φ ◆ ˜ O ✓V ol(S) √φ ◆
˜ O ✓V ol(S)1+✏ √φ ◆
Cheeger’s inequality barrier
Charles River Workshop on Private Analysis of Social Networks
Approximation guarantee Running time Truncated random walk [Spielman-Teng’04] Truncated random walk [Spielman-Teng’08] PageRank random walk [Andersen-Chung-Lang’06] Evolving Set [Andersen-Peres’08] Evolving Set [Gharan-Trevisan’12]
φ
1 3 log 2 3 n
p φ log
3 2 n
p φ log n p φ log n
r
˜ O ✓V ol(S) φ5/3 ◆ ˜ O ✓V ol(S) φ2 ◆ ˜ O ✓V ol(S) φ ◆ ˜ O ✓V ol(S) √φ ◆
˜ O ✓V ol(S)1+✏ √φ ◆
Running time depends
Cheeger’s inequality barrier
S
φ
Charles River Workshop on Private Analysis of Social Networks
Approximation guarantee Running time Truncated random walk [Spielman-Teng’04] Truncated random walk [Spielman-Teng’08] PageRank random walk [Andersen-Chung-Lang’06] Evolving Set [Andersen-Peres’08] Evolving Set [Gharan-Trevisan’12]
φ
1 3 log 2 3 n
p φ log
3 2 n
p φ log n p φ log n
r
˜ O ✓V ol(S) φ5/3 ◆ ˜ O ✓V ol(S) φ2 ◆ ˜ O ✓V ol(S) φ ◆ ˜ O ✓V ol(S) √φ ◆
˜ O ✓V ol(S)1+✏ √φ ◆
Running time depends
Cheeger’s inequality barrier
S
φ
Charles River Workshop on Private Analysis of Social Networks
v 0.09 0.07 0.08 0.09 0.09 0.06 0.01 0.002 0.03
Charles River Workshop on Private Analysis of Social Networks
v
ppr(v, u) d(u)
0.03 0.035 0.04 0.03 0.0225 0.02 0.005 0.001 0.01
Charles River Workshop on Private Analysis of Social Networks
v 0.03 0.035 0.04 0.03 0.0225 0.02 0.005 0.001 0.01
Charles River Workshop on Private Analysis of Social Networks
Joint work with: Vahab Mirrokni (Google Research NY) Zeyaun Allen Zhu (MIT) ICML 2013
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
S⊆C
Charles River Workshop on Private Analysis of Social Networks
S⊆C
Charles River Workshop on Private Analysis of Social Networks
Approximation guarantee Running time Truncated random walk [Spielman-Teng’04] Truncated random walk [Spielman-Teng’08] PageRank random walk [Andersen-Chung-Lang’06] Evolving Set [Andersen-Peres’08] Evolving Set [Gharan-Trevisan’12]
φ
1 3 log 2 3 n
p φ log
3 2 n
p φ log n p φ log n
r
˜ O ✓V ol(S) φ5/3 ◆ ˜ O ✓V ol(S) φ2 ◆ ˜ O ✓V ol(S) φ ◆ ˜ O ✓V ol(S) √φ ◆
˜ O ✓V ol(S)1+✏ √φ ◆
Running time depends
Cheeger’s inequality barrier
S
φ
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
φ λ1 < C
Charles River Workshop on Private Analysis of Social Networks
˜ O ✓ φ ψ ◆
Charles River Workshop on Private Analysis of Social Networks
V
Charles River Workshop on Private Analysis of Social Networks
V
Charles River Workshop on Private Analysis of Social Networks
V
1 α φ α
Charles River Workshop on Private Analysis of Social Networks
X
u/ ∈S
pr(u) < 2φ α
V
Charles River Workshop on Private Analysis of Social Networks
1 ψ2
Charles River Workshop on Private Analysis of Social Networks
1 ψ2 1 ψ2
d(u) V ol(S)
Charles River Workshop on Private Analysis of Social Networks
1 ψ2
pr(v) ≥ ˜ pr(v) − prl(v)
Charles River Workshop on Private Analysis of Social Networks
1 ψ2
pr(v) ≥ ˜ pr(v) − prl(v)
X
v∈S
prl(v) = X
z / ∈S
ppr(z) ≤ 2 φ ψ2 < O ✓ 1 log n ◆
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Ω ✓ φ ψ ◆
Charles River Workshop on Private Analysis of Social Networks
˜ O ✓ φ ψ ◆
Ω ✓ φ ψ ◆
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks
Charles River Workshop on Private Analysis of Social Networks