Expected Nodes : a quality function for the detection of link - - PowerPoint PPT Presentation

expected nodes a quality function for the detection of
SMART_READER_LITE
LIVE PREVIEW

Expected Nodes : a quality function for the detection of link - - PowerPoint PPT Presentation

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6 Expected Nodes : a quality function for the detection of link communities e Gaumont , Fran cois Queyroi, Cl emence Magnien and No Matthieu Latapy LIP6 - CNRS & UPMC ,


slide-1
SLIDE 1

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Expected Nodes: a quality function for the detection of link communities

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy

LIP6 - CNRS & UPMC , Universit´ e Pierre et Marie Curie – Sorbonne Universit´ es, Paris, France.

CompleNet 2015

25 March 2015

slide-2
SLIDE 2

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Summary

1 Link community 2 Expected Nodes : a new quality function 3 Tests with LF benchmark 4 Conclusion and perspectives

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 2/19

slide-3
SLIDE 3

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Community Detection

Node community Link community

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 3/19

slide-4
SLIDE 4

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Node community

Example in a email dataset. Communities : groups of people. Input : A graph, G = (V , E). Output : A partition P of V .

  • S. Fortunato.

Community detection in graphs.

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 4/19

slide-5
SLIDE 5

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Link community

Example in a email dataset. Communities : threads. Input : A graph, G = (V , E). Output : A partition P of E.

T.S. Evans et R. Lambiotte. Line graphs, link partitions, and overlapping communities. Y.-Y. Ahn, J. P. Bagrow, et S. Lehmann. Link communities reveal multiscale complexity in networks.

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 5/19

slide-6
SLIDE 6

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Expected Nodes : a new quality function

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 6/19

slide-7
SLIDE 7

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Outline of the quality function

Why is the group of blue links relevant ?

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 7/19

slide-8
SLIDE 8

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Outline of the quality function

Why is the group of blue links relevant ? Dense blue links and sparse pink links compare to what could be expected in the configuration model.

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 7/19

slide-9
SLIDE 9

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

The idea behind Expected Nodes

Compare observed nodes to expected one :

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 8/19

slide-10
SLIDE 10

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

The idea behind Expected Nodes

Compare observed nodes to expected one :

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 8/19

slide-11
SLIDE 11

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

The idea behind Expected Nodes

Compare observed nodes to expected one :

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 8/19

slide-12
SLIDE 12

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Internal quality function

L : set of links, V (L) : internal nodes of L. The internal quality of group L is : Qin(L) = E[V (L)] − |V (L)| E[V (L)] E[V (L)] : sum of random variable with hypergeometric distribution.

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 9/19

slide-13
SLIDE 13

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

External quality function

Compare adjacent nodes to expected ones :

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 10/19

slide-14
SLIDE 14

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

External quality function

Compare adjacent nodes to expected ones :

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 10/19

slide-15
SLIDE 15

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Combining both quality functions

|Lout| : set of adjacent links to L. Qext(Lout) computed in a similar way as Qin(L).

Good internal quality Bad external quality Good external quality Bad interal quality

Q∗(L) = 2|L|Qin(L) + |Lout|Qext(Lout) |L| + |Lout|

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 11/19

slide-16
SLIDE 16

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Tests with LF benchmark

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 12/19

slide-17
SLIDE 17

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Test method

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 13/19

slide-18
SLIDE 18

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Ground truth generation

LF generation

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 14/19

slide-19
SLIDE 19

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Results for Evans et al.

T A T B E 2 L C . 5 . 5 5 . 6 . 6 5 . 7 . 7 5 . 8

Ep TA TB LC Evans function

Figure – Evaluation of Ef from Evans et al. on several partitions.

Highlight : Q(TA) < Q(E2) Q(TA) = Q(TB)

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 15/19

slide-20
SLIDE 20

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Results for the Partition density

T A T B E 2 L C . 0 . 1 . 2 . 3 . 4 . 5 . 6

TA TB Ep LC Partition density

Figure – Evaluation of the partition density from Ahn et al. on several partitions.

Highlight :

  • Q(TA) ≤ Q(LC)
  • Q(TA) = Q(TB)

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 16/19

slide-21
SLIDE 21

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Results for Expected Nodes

T A T B E 2 L C . 0 . 2 . 4 . 6 . 8 1 . 0 1 . 2 1 . 4

TA TB Ep LC Expected Nodes

Figure – Evaluation of Expected Nodes on several partitions.

Highlight :

  • Q(TA) > Q(X)
  • Q(TA) = Q(TB)

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 17/19

slide-22
SLIDE 22

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Conclusion and perspectives

To sum up :

  • Consider community of links instead of nodes.
  • Definition of Expected Nodes to evaluate link partitions.
  • On the tests, the ground truth is the best choice only for Expected

Nodes. Perspectives

  • Design an algorithm for maximizing Expected Nodes.
  • More detailed comparisons between quality functions

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 18/19

slide-23
SLIDE 23

Questions ?

slide-24
SLIDE 24

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

LF generation example

Green group : a community in the ground truth

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 20/19

slide-25
SLIDE 25

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Partition Ep

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 21/19

slide-26
SLIDE 26

u n i v e r s i t e p i e r r e e t m a r i e c u r i e - l i p 6

Partition LC

No´ e Gaumont, Fran¸ cois Queyroi, Cl´ emence Magnien and Matthieu Latapy — 25 March 2015 22/19