Community detection in complex networks Vinh Loc DAO Summary 1 - - PowerPoint PPT Presentation

community detection in complex networks
SMART_READER_LITE
LIVE PREVIEW

Community detection in complex networks Vinh Loc DAO Summary 1 - - PowerPoint PPT Presentation

Community detection in complex networks Vinh Loc DAO Summary 1 Introduction 2 Datasets and Benchmarks 3 Community detection in network 4 Evaluating partition quality 5 Objectives and Perspectives 2/24 04/05/2016 Vinh Loc DAO Community


slide-1
SLIDE 1

Community detection in complex networks

Vinh Loc DAO

slide-2
SLIDE 2

Summary

1 Introduction 2 Datasets and Benchmarks 3 Community detection in network 4 Evaluating partition quality 5 Objectives and Perspectives

2/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-3
SLIDE 3

Summary

1 Introduction

What is a network or a graph ? Some notions Structure properties of real networks

2 Datasets and Benchmarks 3 Community detection in network 4 Evaluating partition quality 5 Objectives and Perspectives

3/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-4
SLIDE 4

What is a network or a graph ?

Example : Internet, transport network, power grid, food web, social network

Figure – A very simple network G(V , E) with |V | Vertices and |E| Edges

Node : Entity in real life Edge : Relation between two entities to which it connects A natural language to describe complex systems.

4/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-5
SLIDE 5

A local mesh network - very small network

Node : Computer Edge : Connection between two computer

5/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-6
SLIDE 6

Student network in a university - relatively small network

Node : Student Edge : Relation (Could be any kind of relation)

6/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-7
SLIDE 7

The French highway network - large network

Node : City Edge : Highway

7/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-8
SLIDE 8

The Internet - international network

8/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-9
SLIDE 9

Some notions

Complex network : Graph with non-trivial topology features Network analysis : Studies of graph to extract non-trivial features Community detection algorithm : Divide nodes into groups called communities whose members are connected densely.

Figure – Uncover graph modules without specifying clusters’ size

9/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-10
SLIDE 10

Structure properties of real networks

  • Graph representing real systems are normaly neither regular nor random.
  • Degree (nb of connections of a node) distribution often follows a power law,

as connections often follow preferential patterns.

  • Nodes are often found to cluster into high density groups.

Figure – Regular lattice graph Figure – Graph with modular structure

10/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-11
SLIDE 11

Why community detection ?

  • Comprehend network global organization
  • Reveal modular structures
  • Reveal hidden properties between nodes
  • Understand information diffusion process throughout network

APPLICATIONS :

  • Detect web clients with similar interests
  • Prevent epidemic transmission
  • Managing collaboration network
  • etc,
  • You name it

11/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-12
SLIDE 12

Summary

1 Introduction 2 Datasets and Benchmarks

Datasets - Real networks Datasets - Artificial networks

3 Community detection in network 4 Evaluating partition quality 5 Objectives and Perspectives

12/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-13
SLIDE 13

Datasets - Real networks

  • Social network of friendships between 34 members of a karate club at a US

university in the 1970s. (left)

  • An identity graph with 25 vertices and 31 edges. An identity graph has a

single graph auto-morphism, the trivial one. (right)

Figure – Zachary karate network Figure – Walther network

13/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-14
SLIDE 14

Datasets - Artificial networks

Parameters : Graph size, node distribution, link distribution, density distribution, etc.

Figure – GN benchmark network Figure – LFR benchmark network

14/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-15
SLIDE 15

Summary

1 Introduction 2 Datasets and Benchmarks 3 Community detection in network

Dense structure in modular network From dense structure to community

4 Evaluating partition quality 5 Objectives and Perspectives

15/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-16
SLIDE 16

An example of graph partitioning by K-means

Apply K-means graph partitioning on karate club network (K = 2)

Figure – The Zachary karate network Figure – K-means partition on the Zachary karate club

  • Good solution ?
  • How do we know the club has 2 communities ?
  • Wait, tell me again what is a community ?

16/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-17
SLIDE 17

Did we say a ”community” ? Isn’t it a ”cluster” ?

  • What is a community ? - Answer : ”I know it when I see it”
  • No universal accepted definition.
  • More edges inside the community than edges linking its vertices with the rest
  • f the network.
  • Many detection methods : overlapping/non-overlapping, fast/slow,

single-scale/multi-scale.

Figure – A graph division Figure – The karate club is separated due to a conflict coach/president

17/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-18
SLIDE 18

Summary

1 Introduction 2 Datasets and Benchmarks 3 Community detection in network 4 Evaluating partition quality 5 Objectives and Perspectives

18/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-19
SLIDE 19

Partition quality

  • What is a good partition of a network into modules ?
  • Quality function : assigns score to each partition of a graph
  • The most popular quality function is modularity

Q =

1 2|V |Σij(Aij − Pij)δ(Ci, Cj)

Aij : Adjacency matrix, Pij : Expected connection adjacency matrix Q = Fraction of edges within communities - expected fraction of such edges in a random Modularity favors inter community links.

19/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-20
SLIDE 20

Community detection on the karate club

Edge betweenness O(|V ||E|2) Fast greedy O(|V ||E|log(|V |)) Label propagation O(|V | + |E|) Louvain method O(|V |log(|V |) Leading eigenvector O(|V |2 + |E|) Modular optimization (NP-complete) Infomap O (|V |(|V | + |E|)) Walktrap O(|E||V |2)

20/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-21
SLIDE 21

Community detection on the Walther network

Many possible meaning divisions on a less modular network.

21/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-22
SLIDE 22

Quality is relative ? Goodness is subjective

  • Community characteristics :

Community density Community connectiveness Robustness to perturbation, etc Question : How to choose appropriate method to satisfy certain characteristics and utilizing as much as possible available information ? Ex : Which method to chose to :

  • Divide students into the most cohesive groups.
  • Establish geographic sites to minimize remote works of collaborators.
  • Compromise between community density and calculation time.
  • Maximize range of ages in a dancing community.

22/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-23
SLIDE 23

Summary

1 Introduction 2 Datasets and Benchmarks 3 Community detection in network 4 Evaluating partition quality 5 Objectives and Perspectives

23/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks

slide-24
SLIDE 24

Objectives and Perspectives

Guide to choose best methods base on expected quality indicators, graph characteristics and available resources Create generative model to summarize community characteristics Create a benchmark base on generative model Construct criteria for evaluating partition quality base on end user point

  • f view

Propose methods to improve detection quality Tools :

  • R for network analyzing, data analyzing, visualization
  • Gephi : Visualization

24/24 04/05/2016 Vinh Loc DAO Community detection in Complex Networks