Multiscale Processing on Networks and Community Mining Part 2 - - - PowerPoint PPT Presentation

multiscale processing on networks and community mining
SMART_READER_LITE
LIVE PREVIEW

Multiscale Processing on Networks and Community Mining Part 2 - - - PowerPoint PPT Presentation

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion Multiscale Processing on Networks and Community Mining Part 2 - Spectral Graph Wavelets and Multiscale Community Detection


slide-1
SLIDE 1

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Multiscale Processing on Networks and Community Mining Part 2 - Spectral Graph Wavelets and Multiscale Community Detection

Pierre Borgnat

CR1 CNRS – Laboratoire de Physique, ENS de Lyon, Université de Lyon Équipe SISYPHE : Signaux, Systèmes et Physique

05/2014

  • p. 1
slide-2
SLIDE 2

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Overview of the lecture

  • General objective: revisit the classical question of finding

communities in networks using multiscale processing methods on graphs.

  • The things we will discuss:
  • Recall the notion of community in networks and brief survey
  • f some aspects of community detection
  • Introduce you to the emerging field of graph signal

processing

  • Show a connexion between the two: detection of

communities with graph signal processing

  • Organization:
  • 1. A (short) lecture about communities in networks
  • 2. Signal processing on networks; Spectral graph wavelets
  • 3. Multiscale community mining with wavelets
  • p. 2
slide-3
SLIDE 3

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Graph Wavelets

  • Fourier is a global analysis. Fourier modes (eigenvectors of

the laplacian) are used in classical spectral clustering, but do not enable a jointly local and scale dependent analysis.

  • For that classical signal processing (or harmonic analysis)

teach us that we need wavelets.

  • Wavelets : local functions that act as well as a filter around

a chosen scale. A wavelet: – Translated: – Scaled

by analogy

  • p. 3
slide-4
SLIDE 4

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The classical wavelets

Each wavelet ψs,a is derived by translating and scaling a mother wavelet ψ: ψs,a(x) = 1 s ψ x − a s

  • Equivalently, in the Fourier domain:

ˆ ψs,a(ω) = ∞

−∞

1 s ψ x − a s

  • exp−iωx dx

= exp−iωa ∞

−∞

1 s ψ X s

  • exp−iωX dX

= exp−iωa ∞

−∞

ψ

  • X ′

exp−iωX ′ dX ′ = ˆ δa(ω) ˆ ψ(sω) where δa = δ(x − a) One possible definition: ψs,a(x) = ∞

−∞ ˆ

δa(ω) ˆ ψ(sω) expiωx dω

  • p. 4
slide-5
SLIDE 5

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The classical wavelets

ψs,a(x) = ∞

−∞ ˆ

δa(ω) ˆ ψ(sω) expiωx dω

  • In this definition, ˆ

ψ(sω) acts as a filter bank defined by scaling by a factor s a filter kernel function defined in the Fourier space: ˆ ψ(ω)

  • The filter kernel function ˆ

ψ(ω) is necessarily a bandpass filter with:

  • ˆ

ψ(0) = 0 : the mean of ψ is by definition null

  • lim

ω→+∞

ˆ ψ(ω) = 0 : the norm of ψ is by definition finite

(Note: the actual condition is the admissibility property)

  • p. 5
slide-6
SLIDE 6

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Classical wavelets

by analogy

− − − − − − → Graph wavelets

[Hammond et al. ACHA ’11]

Classical (continuous) world Graph world Real domain x node a Fourier domain ω eigenvalues λi Filter kernel ˆ ψ(ω) g(λi) ⇔ ˆ G Filter bank ˆ ψ(sω) g(sλi) ⇔ ˆ Gs Fourier modes exp−iωx eigenvectors χi Fourier transf. of f ˆ f(ω) = ∞

−∞ f(x) exp−iωx dx

ˆ f = χ⊤ f The wavelet at scale s centered around node a is given by: ψs,a(x) = ∞

−∞

ˆ δa(ω) ˆ ψ(sω) expiωx dω − − → ψs,a = χ ˆ Gs ˆ δa = χ ˆ Gs χ⊤ δa

  • p. 6
slide-7
SLIDE 7

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Examples of graph wavelets

A WAVELET: TRANSLATING: SCALING:

  • p. 7
slide-8
SLIDE 8

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Examples of wavelets: they encode the local topology

ψs=1,a ψs=35,a ψs=25,a ψs=50,a

  • p. 8
slide-9
SLIDE 9

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Complement: the graph scaling functions

  • Consider the following lowpass filter kernel h:

h(ω) = ∞

ω

|g(ω′)|2 ω′ dω′ 1/2 Classically, if g corresponds to a wavelet filter kernel, this equation defines the associated scaling function filter kernel.

  • With the same arguments as previously, we define the

graph scaling function at scale s centered around a as: φs,a = χ ˆ Hs ˆ δa = χ ˆ Hs χ⊤ δa

  • p. 9
slide-10
SLIDE 10

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Examples of scaling functions

φs=1,a φs=35,a φs=25,a φs=50,a

  • p. 10
slide-11
SLIDE 11

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Example of filters

For each graph under study, we automatically find the good filter parameters for g by imposing:

  • The coarsest scale needs to be focused on the eigenvector

χ1 (Fiedler vector).

  • All scales need at least to keep some information from χ1.
  • The finest scale needs to use the information from all

eigenvectors (i.e., Fourier modes).

10 20 2 4 6 8 λ g(s λ) 10 20 2 4 6 8 λ h(s λ)

  • p. 11
slide-12
SLIDE 12

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Example of wavelet filters

  • More precisely, we will use the following kernel:

g(x; α, β, x1, x2) =      x−α

1

xα for x < x1 p(x) for x1 ≤ x ≤ x2 xβ

2 x−β

for x > x2.

  • The parameters will be:

smin = 1 λ2 , x2 = 1 λ2 , smax = 1 λ2

2

, x1 = 1, β = 1/log10 λ3 λ2

  • This leads to:

(α = 2)

1 2 2 4 6 8 λ g(sλ) s=7 s=13 s=25 s=47 2 4 6 8 10 x1 x2 x g(x) α=1 α=2 α=10 α=50

  • p. 12
slide-13
SLIDE 13

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Nota Bene

In the following, we will not actually focus on the Wavelet Transform of a signal. We will rather focus on the wavelets ψi themselves We take advantage of the local topological information at their scale encoded in them.

  • p. 13
slide-14
SLIDE 14

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Purpose of the last part of the lecture

Develop a scale dependent community mining tool using concepts from graph signal processing. Why ? For joint processing of graph signals and networks.

General Ideas

  • Take advantage of local topological information encoded in

Graph Wavelets. Wavelet = ego-centered vision from a node

  • Group together nodes whose local environments are

similar at the description scale

  • This will naturally offer a multiscale vision of communities
  • p. 14
slide-15
SLIDE 15

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Let us recall: objective of community detection

Three examples of community detections:

  • (A) A complex sensor network (non-uniform swiss roll

topology)

  • (B) A contact network in a primary school [Stehle ’11]
  • (C) A hierarchical graph benchmark [Sales-Pardo ’07]

A B C

  • p. 15
slide-16
SLIDE 16

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

  • r multiscale community detection ?
  • p. 16
slide-17
SLIDE 17

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Multiscale community structure in a graph

Classical community detection algorithm based (for instance)

  • n modularity optimisation only finds one solution:

A B C Where the modularity function reads: Q =

1 2N

  • ij
  • Aij − didj

2N

  • δ(ci, cj)
  • p. 17
slide-18
SLIDE 18

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

A new method for multiscale community detection

[N. Tremblay, P . Borgnat, 2013]

The problem of community mining is considered as a problem

  • f clustering. We then need to decide upon:
  • 1. feature vectors for each node
  • 2. a distance to measure two given vectors’ closeness
  • 3. a clustering algorithm to separate nodes in clusters

The method uses:

  • 1. wavelets (resp. scaling functions) as feature vectors
  • 2. the correlation distance
  • 3. the complete linkage clustering algorithm
  • p. 18
slide-19
SLIDE 19

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

1) Wavelets as features

Each node a has feature vector ψs,a. Globally, one will need Ψs, all wavelets at a given scale s, i.e. Ψs =

  • ψs,1|ψs,2| . . . |ψs,N
  • = χGsχ⊤.

NODE A: NODE B: AT SMALL SCALE: AT LARGE SCALE:

  • p. 19
slide-20
SLIDE 20

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

2) Correlation distances

Ds(a, b) = 1 − ψ⊤

s,aψs,b

||ψs,a||2||ψs,b||2 . NODE A: NODE B: CORR. COEF.: RESULT:

  • 0.50

Far appart in the dendrogram 0.97 Close to each other in the dendrogram

  • p. 20
slide-21
SLIDE 21

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

3) Complete linkage clustering and dendrogram

  • It is a bottom to top hierarchical algorithm: it starts with as

many clusters as nodes and works its way up to fewer clusters (by linking subclusters together) until it reaches

  • ne global cluster.
  • To compute the distance between two subclusters under

examination : all possible pairs of nodes, taking one from each cluster, are considered. The maximum possible node-to-node distance is declared to be the cluster-to-cluster closeness.

  • Outputs a dendrogram (from Greek dendron "tree" and

gramma "drawing").

  • p. 21
slide-22
SLIDE 22

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Example of a dendrogram at a given scale s

0.5 1 1.5 2 correlation distance The big question: where should we cut the dendrogram?

  • p. 22
slide-23
SLIDE 23

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

A toy graph for introducing the method

smallest scale (16 com.): small scale (8 com.): medium scale (4 com.): large scale (2 com.):

  • p. 23
slide-24
SLIDE 24

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledge

Let us cheat by using prior knowledge on the number of communities we are looking for. If we cut each dendrogram in two clusters

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using wavelets as features Conclusion: the dendrograms at different scales contain the community structure at various scales.

  • p. 24
slide-25
SLIDE 25

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledge

Let us cheat by using prior knowledge on the number of communities we are looking for. If we cut each dendrogram in four clusters

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using wavelets as features Conclusion: the dendrograms at different scales contain the community structure at various scales.

  • p. 24
slide-26
SLIDE 26

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledge

Let us cheat by using prior knowledge on the number of communities we are looking for. If we cut each dendrogram in eight clusters

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using wavelets as features Conclusion: the dendrograms at different scales contain the community structure at various scales.

  • p. 24
slide-27
SLIDE 27

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledge

Let us cheat by using prior knowledge on the number of communities we are looking for. If we cut each dendrogram in sixteen clusters

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using wavelets as features Conclusion: the dendrograms at different scales contain the community structure at various scales.

  • p. 24
slide-28
SLIDE 28

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledge

Let us cheat by using prior knowledge on the number of communities we are looking for. The four levels of communities.

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using wavelets as features Conclusion: the dendrograms at different scales contain the community structure at various scales.

  • p. 24
slide-29
SLIDE 29

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Recall: The Adjusted Rand Index

Let:

  • C and C′ be two partitions we want to compare.
  • a be the # of pairs of nodes that are in the same

community in C and in the same community in C′

  • b be the # of pairs of nodes that are in different

communities in C and in different communities in C′

  • c be the # of pairs of nodes that are in the same

community in C and in different communities in C′

  • d be the # of pairs of nodes that are in different

communities in C and in the same community in C′ a + b is the number of “agreements“ between C and C′. c + d is the number of “disagreements“ between C and C′.

  • p. 25
slide-30
SLIDE 30

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The Adjusted Rand Index

The Rand index, R, is: R = a + b a + b + c + d = a + b n

2

  • The Adjusted Rand index AR is the corrected-for-chance

version of the Rand index: AR = R − ExpectedIndex MaxIndex − ExpectedIndex

  • p. 26
slide-31
SLIDE 31

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with classical modularity

Recall that the classical modularity matrix reads: B(A) =

1 2m(A + dd⊤ 2m )

where d is the strength vector and 2m = d(i) Classical modularity is Q = Tr(S⊤BS)

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Solution not good at large scale.

  • p. 27
slide-32
SLIDE 32

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with filtered modularity

We define the filtered adjacency matrices at scale s:

  • recall that A = D

1 2 χ(I − Λ)χ⊤D 1 2

  • Ag

s = A + D

1 2 χ ˆ

Gsχ⊤D− 1

2 A

We define the filtered modularity matrices at scale s:

Bg

s = B(Ag s)

  • p. 28
slide-33
SLIDE 33

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity

Filtered Modu Opt. Classical Modu Opt.

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

  • p. 29
slide-34
SLIDE 34

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Notes about the filtered modularity

Ag

s = A + D

1 2 χ ˆ

Gsχ⊤D− 1

2 A

Consider d the vector of strengths of A and 2m the sum of the

  • strengths. The classical modularity reads:

B = A 2m − dd⊤ (2m)2 Consider d′ the vector of strengths of Ag

s and 2m′ the sum of

the strengths. We can show that: dd⊤ (2m)2 = d′d′⊤ (2m′)2 Moreover, if gs(1) = 0 (which is the case), the filtered modularity reads: Bg

s = A + D

1 2 χ ˆ

Gsχ⊤D− 1

2 A

2m − dd⊤ (2m)2

  • p. 30
slide-35
SLIDE 35

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Notes about the filtered modularity

Bg

s = A + D

1 2 χ ˆ

Gsχ⊤D− 1

2 A

2m − dd⊤ (2m)2 Recall that modularity compares the actual normalised weight

Aij 2m to the expected weight if the graph was a random Chung-Lu

graph:

didj (2m)2 .

The filtered modularity does not change the expected weight but rather changes the actual normalised weigth: it adds or retrieve value to Aij

2m . At small scale, it will increase the weights

important for small scale structures and decrease the weights important for superstructures.

  • p. 31
slide-36
SLIDE 36

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Notes about the filtered modularity

It can be written: Bg

s =

1 2m

N

  • i=2

(1 + gs(i))(1 − λi)D

1 2 χiχ⊤

i D

1 2

To compare to Schaub-Delvenne’s filtered modularity: Bt = 1 2m

N

  • i=2

(1 − λi)tD

1 2 χiχ⊤

i D

1 2

And Arenas’ version: (here for regular networks) Bα = 1 2m

N

  • i=2

(1 − λi α )D

1 2 χiχ⊤

i D

1 2

  • p. 32
slide-37
SLIDE 37

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity on Sales-Pardo network

2 10 40 160 0.5 1 # of clusters Q/max(Q) 2 10 40 160 0.5 1 # of clusters Qh

s/max(Qh s)

  • p. 33
slide-38
SLIDE 38

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity on Sales-Pardo network

1 2 4 0.5 1 s nmi

fine interm. coarse

  • p. 34
slide-39
SLIDE 39

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity on Sales-Pardo network

0.4 0.8 1.7 3.6 1 10 100 s <# of com.> 0.4 0.8 1.7 3.6 10 100 1000 s <Com. size>

  • p. 35
slide-40
SLIDE 40

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Intermediate bilan

  • As expected, the method works week with filtered

modularity

  • Note: it works similarly with scaling functions
  • Fundamental reason: it is related to Arenas or

Schaub-Delvenne modified modularity to take into account a scale

  • However: the dendrogram has already in itself the good

solutions, with no need of the step of (filtered) modularity

  • ptimization.
  • For that: look at the gaps !
  • p. 36
slide-41
SLIDE 41

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

To avoid the cumbersome multiscale modularity optimization, we can simply cut the dendrogram at its maximal gap. At small scale:

0.5 1 1.5 2 correlation distance

At large scale:

0.5 1 1.5 2 correlation distance

  • p. 37
slide-42
SLIDE 42

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

To avoid the cumbersome multiscale modularity optimization, we can simply cut the dendrogram at its maximal gap. At small scale:

0.5 1 1.5 2 correlation distance

At large scale:

0.5 1 1.5 2 correlation distance

  • p. 37
slide-43
SLIDE 43

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using wavelets

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

Using scaling functions

  • p. 38
slide-44
SLIDE 44

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

scale number nodes 10 20 30 40 20 40 60 80 100 120

Using wavelets

scale number nodes 10 20 30 40 50 20 40 60 80 100 120

Using scaling functions

  • p. 39
slide-45
SLIDE 45

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap: non robust to outliers

0.5 1 1.5 correlation distance nodes

  • p. 40
slide-46
SLIDE 46

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal average gap

0.2 0.4 0.6 0.8 1 0.5 1 correlation distance Γa

Γ = 1 Nmax(corr. dist.)

  • a∈V

Γa At small scale

0.2 0.4 0.6 0.8 1 0.5 1 correlation distance Γ 0.5 1 correlation distance nodes

  • p. 41
slide-47
SLIDE 47

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal average gap

0.2 0.4 0.6 0.8 1 0.5 1 correlation distance Γa

Γ = 1 Nmax(corr. dist.)

  • a∈V

Γa At larger scale

0.5 1 1.5 0.5 1 correlation distance Γ 0.5 1 1.5 correlation distance nodes

  • p. 42
slide-48
SLIDE 48

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal average gap

For the previous graph:

0.5 1 0.5 1 correlation distance Γ

  • p. 43
slide-49
SLIDE 49

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Comparison maximal gap vs. filtered modularity

Maximal Gap Filtered Modu Opt. Classical Modu Opt.

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

  • p. 44
slide-50
SLIDE 50

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Comparison maximal gap vs. filtered modularity

Maximal Gap Filtered Modu Opt. Classical Modu Opt.

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

  • p. 44
slide-51
SLIDE 51

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Comparison maximal gap vs. filtered modularity

Maximal Gap Filtered Modu Opt. Classical Modu Opt.

10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient 10 20 30 40 50 0.2 0.4 0.6 0.8 1 scale number AR coefficient

  • p. 44
slide-52
SLIDE 52

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Multiscale community detection on a simple network

scale number nodes 10 20 30 20 40 60 80 100 120

Another toy graph Using wavelets

  • p. 45
slide-53
SLIDE 53

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Multiscale community detection on more elaborate networks

  • p. 46
slide-54
SLIDE 54

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The Sales-Pardo benchmark

  • Three community structures nested in one another
  • Parameters:
  • sizes of the communities (N = 640)
  • ρ tunes how well separated the different scales are
  • ¯

k is the average degree; the sparser is the graph, the harder it is to recover the communities.

  • p. 47
slide-55
SLIDE 55

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Results on the Sales-Pardo benchmark

10 15.8 25.1 39.8 0.5 1 scale s

  • Adj. Rand index

Large Scale Medium Scale Small Scale

2 3 1

  • p. 48
slide-56
SLIDE 56

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Results on the Sales-Pardo benchmark

11 13 15 17 0.9 0.95 1 <L. sc. Recall>

k

  • Gr. wav. correlation

Schaub Arenas

11 13 15 17 0.97 0.98 0.99 1 <M. sc. Recall>

k

  • Gr. wav. correlation

Schaub Arenas

11 13 15 17 0.92 0.94 0.96 0.98 1 <S. sc. Recall>

k

  • Gr. wav. correlation

Schaub Arenas

  • p. 49
slide-57
SLIDE 57

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The case of larger networks

  • Limit of the method: computation of the N × N matrix of the

wavelets Ψs.

  • Improvement: use of random features.
  • Let r ∈ RN be a random vector on the nodes of the graph,

composed of N independent normal random variables of zero mean and finite variance σ2.

  • Define the feature fs,a ∈ R at scale s associated to node a

as fs,a = ψ⊤

s,ar = N

  • k=1

ψs,a(k)r(k).

  • p. 50
slide-58
SLIDE 58

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The case of larger networks

  • Let us define the correlation between features

Cor(fs,a, fs,b)= E((fs,a − E(fs,a))(fs,b − E(fs,b)))

  • Var(fs,a)Var(fs,b)

.

  • It is easy to show that:

Cor(fs,a, fs,b) = ψ⊤

s,aψs,b

||ψs,a||2||ψs,b||2 .

  • Therefore, the sample correlation estimator ˆ

Cab,η satisfies: lim

η→+∞

ˆ Cab,η = ψ⊤

s,aψs,b

||ψs,a||2||ψs,b||2 = 1 − Ds(a, b).

  • This leads to a faster algorithm.
  • p. 51
slide-59
SLIDE 59

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Results on the Sales-Pardo benchmark

  • As a function of η, the number of random vectors used

20 40 60 80 100 0.5 1 η <ratios> LS Recall MS Recall SS Recall 20 40 60 80 100 3.2 3.4 3.6 3.8 4 4.2 η <comp. time> (sec)

  • p. 52
slide-60
SLIDE 60

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Stability of the communities

  • Not all partitions are relevant: only those stable enough

convey information about the network

  • Lambiotte’s approach to stability:

Create B resampled graphs by randomly adding ±p% (typically p = 10) to the weight of each link and computing the corresponding B sets of partitions {Pb

s }b∈[1,B],s∈S.

Then, stability: γr(s) = 2 B(B − 1)

  • (b,c)∈[1,B]2,b=c

ari(Pb

s , Pc s ),

(1)

  • New approach: we have a stochastic algorithm.

Consider J sets of η random signals and compute the associated sets of partitions {Pj

s}j∈[1,J],s∈S. Let stability be:

γa(s) = 2 J(J − 1)

  • (i,j)∈[1,J]2,i=j

ari(Pi

s, Pj s).

(2)

  • p. 53
slide-61
SLIDE 61

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Results with stabilities on the Sales-Pardo benchmark

10 15.8 25.1 39.8 0.5 1 scale s

  • Adj. Rand index

Large Scale Medium Scale Small Scale

2 3 1

10 15.8 25.1 39.8 0.5 1 scale s Instabilities 1−γ

r

1−γ

a

1 2 3

  • p. 54
slide-62
SLIDE 62

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

In addition: statistical test of relevance of the communities

  • It is possible to design a data-driven test on γa (not

explained here).

  • Result: threshold for 1 − γa above which the partition in

communities is irrelevant. Sales-Pardo graph Chung-Lu graph

10 15.8 25.1 39.8 0.5 1 scale s 1−γa

3 2 1

1.9 2.2 2.5 0.5 1 scale s 1−γa

  • p. 55
slide-63
SLIDE 63

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Comparison on larger Sales-Pardo graphs

N = 6400 nodes Schaub-Delvenne

0.1 1 0.5 1 Markov time

  • Adj. Rand index

LS MS SS 0.1 1 0.5 1 Markov time

  • Var. Information

Wavelets

6.3 10 15.8 25.1 0.5 1 scale s

  • Adj. Rand index

LS MS SS 6.3 10 15.8 25.1 0.5 1 scale s 1−γa

  • p. 56
slide-64
SLIDE 64

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

More elaborate graphs

  • p. 57
slide-65
SLIDE 65

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Sensor network on the swiss roll manifold

  • Three scale ranges of relevant community structure

10000 100000 1e+06 1e+07 0.1 0.2 0.3 0.4 scale s 1−γa

  • p. 58
slide-66
SLIDE 66

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

The dynamic social network of a primary school

Collaboration with A. Barrat (CPT Marseille)

  • p. 59
slide-67
SLIDE 67

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Multi-scale Communities in Primary School

Collaboration with A. Barrat (CPT Marseille) 20 28 37 51 74 103 0.5 1 scale s 1−γa

scale s 20 28 37 51 74 103 1st a 1st b 2nd a 2nd b 3rd a 3rd b 4th a 4th b 5th a 5th b

  • p. 60
slide-68
SLIDE 68

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Multi-scale Communities in Primary School

Collaboration with A. Barrat (CPT Marseille)

  • p. 61
slide-69
SLIDE 69

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Intra-chromosomic interaction data

Collaboration with R. Boulos, B. Audit (ENS Lyon)

10 100 1000

  • p. 62
slide-70
SLIDE 70

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Evolution of the correlation matrix of the wavelets with respect to scale

Collaboration with R. Boulos, B. Audit (ENS Lyon)

  • p. 63
slide-71
SLIDE 71

Introduction Spectral Graph Wavelets Multiscale community mining Developments; Stability of communities Conclusion

Conclusion

  • Wavelet ψs,a gives an ”egocentered view“ of the network

seen from node a at scale s

  • Correlation between these different views gives us a

distance between nodes at scale s

  • This enables multi-scale clustering of nodes in

communities

  • I hope that you were attracted to

the emerging field of graph signal processing for networks. http://perso.ens-lyon.fr/pierre.borgnat Acknowledgements: thanks to Nicolas Tremblay for borrowing many of his figures or slides.

  • p. 64