Temporal Graph Clustering Fabrice Rossi, Romain Guigours et Marc - PowerPoint PPT Presentation

Temporal Graph Clustering Fabrice Rossi, Romain Guigourès et Marc Boullé SAMM (Université Paris 1) et Orange Labs (Lannion) October 20, 2015

Temporal Graphs A variable notion... ◮ a time series of graphs? (e.g., one per day) ◮ transient nodes with permanent connections ◮ edges with duration ◮ etc.

Temporal Graphs A variable notion... ◮ a time series of graphs? (e.g., one per day) ◮ transient nodes with permanent connections ◮ edges with duration ◮ etc. with a unifying model (Casteigts et al. [2012]) ◮ a set of vertices V and a set of edges E ◮ a time domain T ◮ a presence function ρ from E × T to { 0 , 1 } ◮ a latency function ζ from E × T to R +

Temporal Interaction Data Time stamped interactions between actors ◮ X sends a SMS to Y at time t ◮ X sends an email to Y at time t ◮ X likes/answers to Y ’s post at time t ◮ and also: citations (patents, articles), web links, tweets, moving objects, etc. Temporal Interaction Data ◮ a set of sources S (emitters) ◮ a set of destinations D (receivers) ◮ a temporal interaction data set E = ( s n , d n , t n ) 1 ≤ n ≤ m with s n ∈ S , d n ∈ D and t n ∈ R (time stamps)

Time-Varying Graph Graph point of view ◮ interactions as edges in a directed graph G = ( V , E ′ ) ◮ vertices V = S ∪ D , edges E ′ ≃ E E ′ = { ( s , d ) ∈ V 2 | ∃ t ( s , d , t ) ∈ E } ◮ presence function ρ from V 2 × R to { 0 , 1 } : ρ ( s , d , t ) = 1 if and only if ( s , d , t ) ∈ E Complex time-varying graphs ◮ directed graph (possibly bipartite) ◮ multiple edges: s can send several messages to d (at different times) ◮ no “snapshot” assumption: time stamps are continuous

Example S = { 1 , 2 , 3 } D = { a , b , c , d , e } source dest. time 2 4 10 a 20 2 5 d 1 2 3 2 7 d 4 1 8 5 7 b 14 1 10 e 2 14 8 a c e b d b 3 20 a

Outline Introduction Static Graph Analysis Temporal Extensions Proposed Model Experiments

Static Graph Analysis Role based analysis ◮ Groups of “equivalent” actors (roles) ◮ Structure based equivalence: interacting in the same way with other (groups of) actors ◮ Strongly related to graph clustering

Static Graph Analysis Role based analysis ◮ Groups of “equivalent” actors (roles) ◮ Structure based equivalence: interacting in the same way with other (groups of) actors ◮ Strongly related to graph clustering Notable patterns ◮ community : internal connections and no external ones ◮ bipartite : external connections and no internal ones ◮ hub : very high degree vertex

Block Models Principles ◮ Each actor (vertex) has a hidden role chosen among a finite set of possibilities (classes) ◮ The connectivity is explained only by the hidden roles Stochastic Block Model ◮ K classes (roles) ◮ Z i ∈ { 1 , . . . , K } role of vertex/actor i ◮ conditional independence of connections i � = j P ( X ij | Z i , Z j ) where X ij = 1 when i and j are P ( X | Z ) = � connected ◮ P ( X ij = 1 | Z i = k , Z j = l ) = γ kl connection probability between roles k and l ◮ given X , we infer Z (clustering) and γ

Example

Example 1 8 1 1 3 3 1 3 1 1 3 3 3 1 1 1 2 3 1 1 1 5 1 8 2 2 4 5 5 5 8 5 8 10 5 8 5 5 8 7 5 7 7 11 12 7 11 6 7 11 7 8 11 8 8 11 8 11 8 11 8 8 8 9 8 11 9 8 9 9 8 9 9 9 9 9 9

Temporal Models Snapshot Assumption ◮ Time series of static graphs: G 1 , G 2 , . . . , G T ◮ Each graph covers a time interval ◮ Nothing happens (on a temporal point of view) during a time interval A Naive Analysis... ◮ Analyze each graph G k independently ◮ Hope for the results to show some consistency

Temporal Models Snapshot Assumption ◮ Time series of static graphs: G 1 , G 2 , . . . , G T ◮ Each graph covers a time interval ◮ Nothing happens (on a temporal point of view) during a time interval A Naive Analysis... ◮ Analyze each graph G k independently ◮ Hope for the results to show some consistency Fails 1. Fitting a model is a complex combinatorial optimization problem: results are unstable 2. Intrinsic redundancy: what is evolving?

What is Evolving? Evolving clusters, fixed patterns Day 1 Day 2

What is Evolving? Fixed clustering, evolving patterns Day 1 Day 2 Community bipartite

Possible solutions Soft Constraints ◮ Clusters (roles) at time t + 1 are influenced by clusters at time t : Markov chain models for instance ◮ Constrained evolution of connection probabilities (e.g. friendship increases with the number of encounters) Hard Constraints ◮ Fixed patterns: modularity ◮ Fixed clustering

Possible solutions Soft Constraints ◮ Clusters (roles) at time t + 1 are influenced by clusters at time t : Markov chain models for instance ◮ Constrained evolution of connection probabilities (e.g. friendship increases with the number of encounters) Hard Constraints ◮ Fixed patterns: modularity ◮ Fixed clustering Lifting the Snapshot Constraint ◮ Continuous time models ◮ Change detection point of view: find intervals on which the connectivity pattern is stable

Temporal Block Models Main principle ◮ S : source vertices, D : destination vertices ◮ k S source roles, k D destination roles and k T time intervals ◮ µ ijl is the number of interactions between sources with role i and destinations with role j that take place during the time interval l ◮ given the roles and the time intervals, the µ ijl are independent Non parametric approach ◮ we do not use a parametric distribution for µ ijl ◮ µ ijl becomes a parameter in (discrete) generative model ◮ implies a rank based representation of the time stamps

A Generative Model for Temporal Interaction Data Parameters ◮ three partitions C S , C D and C T ◮ an edge/interaction count 3D table µ : µ ijl is the number of interactions between sources in c S i and destinations in c D j that take place during c T l ◮ out-degrees δ S of sources and in-degrees δ D of destinations ◮ consistency constraints Over parametrized ◮ allows switching from a clustering point of view to a numerical one ◮ ease the design of the generative model ◮ ease the design of a prior distribution

An example ◮ S = { 1 , . . . , 6 } , D = { a , b , . . . , h } . ◮ C S = {{ 1 , 2 , 3 } , { 4 , 5 } , { 6 }} , C D = {{ a , b , c , d , e } , { f , g , h }} ◮ C T = {{ 1 , . . . , 12 } , { 13 , . . . , 33 } , { 34 , . . . , 50 }} ◮ µ c D c D c D c D c D c D 1 2 1 2 1 2 c S 5 1 c S 2 2 c S 0 0 1 1 1 c S 2 0 c S 2 5 c S 1 0 2 2 2 4 0 5 5 1 15 c S c S c S 3 3 3 c T c T c T 1 2 3 ◮ degrees 1 2 3 4 5 6 s d a b c d e f g h δ S 3 6 1 2 8 30 δ D 3 6 2 6 5 13 8 7 s d

Generation process Principles ◮ hierarchical model ◮ independence inside each level ◮ uniform distribution for each independent part The distribution Generating E = ( s n , d n , t n ) 1 ≤ n ≤ ν from a parameter list (with ijl µ ijl ) ν = � 1. assign each ( s n , d n , t n ) to a tri-cluster c S i × c S j × c S l while fulfilling µ constraints 2. independently on each variable ( S , D and T ), assign s n , d n and t n based on the tri-cluster constraints, on δ D and on δ S

A MAP approach Generative model 101 ◮ chose probability distribution over set of objects, with a parameter “vector” M ◮ quality measure for M given an object E , the likelihood L ( M ) = P ( E |M )

A MAP approach Generative model 101 ◮ chose probability distribution over set of objects, with a parameter “vector” M ◮ quality measure for M given an object E , the likelihood L ( M ) = P ( E |M ) Maximum A Posteriori ◮ P ( M| E ) = P ( E |M ) P ( M ) P ( E ) ◮ we use a MAP (maximum a posteriori) approach M ∗ = arg max M P ( E |M ) P ( M ) ◮ M can include what would be meta-parameters in other approaches (the number of clusters, for instance) ◮ strongly related to regularization approaches

MAP implementation Difficult Combinatorial Optimization Problem ◮ large parameter space ◮ discrete and complex criterion Simple Heuristic ◮ greedy block merging ◮ starts with the most refined triclustering ◮ choose the best merge at each step ◮ specific data structures: O ( m ) operations for evaluating a parameter list and O ( m √ m log m ) for the full merging operation Extensions ◮ local improvements (vertex swapping for instance) ◮ greedy merging starting from semi-random partitions

Experiments Synthetic Data ◮ block structure [ 0 , 20 [ [ 20 , 30 [ [ 30 , 60 [ [ 60 , 100 ] ◮ cluster sizes cluster 1 2 3 4 size 5 5 10 20 ◮ edges are built according to this model, with 30 % of random rewiring ◮ results as a function of m , the number of edges

Results 1. With the data just described

Results 1. With the data just described 2. When the temporal structured is removed

Temporal Graph Clustering Fabrice Rossi, Romain Guigours et Marc - PowerPoint PPT Presentation

Temporal Graph Clustering Fabrice Rossi, Romain Guigours et Marc Boull SAMM (Universit Paris 1) et Orange Labs (Lannion) October 20, 2015 Temporal Graphs A variable notion... a time series of graphs? (e.g., one per day) transient

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Graph Clustering Why graph clustering is useful? Distance matrices are graphs as useful as

PAC-Bayesian Analysis of Co-clustering, Graph Clustering and Pairwise Clustering Yevgeny Seldin

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

Income Inequality in Cte dIvoire from 1985 to 2014 Lo Czajka Universit Catholique de

DeeMe Search for Muon-Electron Conversion in Nuclear Field

PVMD Delft University of Technology Learning objective 1. Maximum Power Point Tracking 1.

Informatics 2D Reasoning and Agents Semester 2, 20192020 Alex Lascarides

Travel Sentiment Study Wave 12 JUNE 2, 2020 COVID-19 TRAVEL SENTIMENT STUDY WAVE 12 Fielded

Iterators An iterator permits you to examine the elements of a data structure one at a time.

NP and NP Completeness Lecture 23 April 20, 2017 Chandra Chekuri (UIUC) CS374 1 Spring 2017

http://cs246.stanford.edu Goal: Given a large number (N in the millions or billions) of text

Sambuz

Useful Links

Newsletter

Mail Us