Triangle counting in dynamic graph streams
Konstantin Kutzkov and Rasmus Pagh Work supported by:
1
Triangle counting in dynamic graph streams Konstantin Kutzkov and - - PowerPoint PPT Presentation
Triangle counting in dynamic graph streams Konstantin Kutzkov and Rasmus Pagh Work supported by: 1 Agenda Problem description and known results. Sampling-based approaches: - 2-path sampling - Edge sampling (Doulion, colorful sampling)
Konstantin Kutzkov and Rasmus Pagh Work supported by:
1
2
what is the number of triangles T3?
time ~ m2ω/(ω+1) = O(m1.41) where ω ≤ 2.3727 is the matrix multiplication exponent.
3
m edges, n nodes
what is the number of triangles T3?
time ~ m2ω/(ω+1) = O(m1.41) where ω ≤ 2.3727 is the matrix multiplication exponent.
running time O(m1.5) are fastest.
3
m edges, n nodes
4 Le Gall
next speaker!
too large to be loaded in memory.
vertex arrive in succession (each one twice).
5
give an O(1)-approximation of T3 with prob. 2/3.
6
give an O(1)-approximation of T3 with prob. 2/3.
6
give an O(1)-approximation of T3 with prob. 2/3.
http://arxiv.org/pdf/1404.4696v3.pdf
6
give an O(1)-approximation of T3 with prob. 2/3.
http://arxiv.org/pdf/1404.4696v3.pdf
6
give an O(1)-approximation of T3 with prob. 2/3.
http://arxiv.org/pdf/1404.4696v3.pdf
6
Both results: Very high update time.
be completed to a triangle later in the stream.
7
[Buriol et al. ’06]
T3 triangles, P2 2-paths
be completed to a triangle later in the stream.
incidence list streams: can compute P2 exactly.
7
[Buriol et al. ’06]
T3 = P2 𝛽(G)/3 T3 triangles, P2 2-paths
8
First sampled 2-path is not part of a triangle
9
Second sampled 2-path is part of a triangle
9
Second sampled 2-path is part of a triangle
P2 = 14 𝛽(G) ≈ 1/2 T3 = P2 𝛽(G)/3 ≈ 7/3
allow the deletion of edges.
assume insert-only streams.
arbitrary order and can be deleted.
10
probability p; multiply number of triangles by p-3.
11
[Tsourakakis et al. ’09, P.-Tsourakakis ’12]
probability p; multiply number of triangles by p-3.
endpoints have the same color; multiply number
11
[Tsourakakis et al. ’09, P.-Tsourakakis ’12]
probability p; multiply number of triangles by p-3.
endpoints have the same color; multiply number
have also sampled any triangle it is part of.
11
[Tsourakakis et al. ’09, P.-Tsourakakis ’12]
12
12
No triangle sampled
12
No triangle sampled
Estimate T3 ≈ 0
and check whether it is part of a triangle.
estimate transitivity coefficient of G.
to estimate the number of 2-paths in G.
13
and check whether it is part of a triangle.
estimate transitivity coefficient of G.
to estimate the number of 2-paths in G.
13
Central technical contribution: Show that correlations among sampled 2-paths do not matter (too much) so we do get an estimate of 𝛽(G).
14
14
15
15
no isolated edge it is possible to find b=Ω(m) 2- paths that overlap (pairwise) in at most 1 vertex.
Showing conjecture will improve our space.
16
17