bayesian methods for graph clustering
play

Bayesian Methods for Graph Clustering P. Latouche, E. Birmel e - PowerPoint PPT Presentation

Bayesian Methods for Graph Clustering P. Latouche, E. Birmel e Laboratoire Statistique et G enome (UMR CNRS 8071, INRA 1152) Journ ees MAS, August 2008 P. Latouche (Stat. & G enome) Bayesian Methods for Graph Clustering


  1. Bayesian Methods for Graph Clustering P. Latouche, E. Birmel´ e Laboratoire ”Statistique et G´ enome” (UMR CNRS 8071, INRA 1152) Journ´ ees MAS, August 2008 P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 1 / 22

  2. Outline Introduction 1 Real networks Random graph models The MixNet model Maximum likelihood estimation Bayesian View of MixNet 2 Bayesian probabilistic model Variational inference Model selection Applications 3 Affiliation models Metabolic network of E. coli P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 2 / 22

  3. Outline Introduction 1 Real networks Random graph models The MixNet model Maximum likelihood estimation Bayesian View of MixNet 2 Bayesian probabilistic model Variational inference Model selection Applications 3 Affiliation models Metabolic network of E. coli P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 3 / 22

  4. Real networks Many scientific fields : World Wide Web, Biology, sociology, physics. Nature of data under study : interactions between n objects, O ( n 2 ) possible interactions. Network topology : describes the way nodes interact, structure/function Sample of 250 blogs (nodes) with their links (edges) relationship. of the French political Blogosphere. P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 4 / 22

  5. Estimation of random graph models Random graph models Random graph approaches (Govaert 1977, Frank and Harary 1982, Handcock 2006, Newman and Leicht 2007, Hofman and Wiggins 2008). The Block-Clustering model (Snijders and Nowicki 1997). Erd¨ os-R´ enyi Mixture Model for Network (MixNet; Daudin et al 2008). Estimation of the model parameters Bayesian strategies cannot handle large networks. Maximum likelihood strategies. P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 5 / 22

  6. The MixNet probabilistic model Origin model developed by J. Daudin et al. (2008), ER model generalization, application fields: biology, internet, social network... Modelling connection heterogeneity hyp.: there exists a hidden structure with Q classes, Z = ( Z i ) i , Z iq = I { i ∈ q } are indep. hidden variables, α = { α q } , the prior proportions of groups, ( Z i ) ∼ M (1 , α ). Distribution of X k Conditional distribution: X ij |{ Z iq Z j ℓ = 1 } ∼ B ( π ql ) X jk=0 where B () is the Bernoulli distribution , j X ij=1 X ij | Z are independant. i Marginal distribution: X ij ∼ � q ℓ α q α ℓ B ( π ql ), P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 6 / 22

  7. Maximum likelihood estimation Likelihood(s) of the model : → Observed data : p ( X | α , π ) = � Z p ( X , Z | α , π ). → Complete data : p ( X , Z | α , π ). → EM-like strategies require the knowledge of p ( Z | X , α , π ). Problem In our case, p ( Z | X , α , π ) is not tractable (no conditional independence). P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 7 / 22

  8. Variational method Daudin et. al, 2008 Decomposition and variational EM � � � � ln p ( X | α , π ) = L q ( . ); α , π + KL q ( . ) || p ( . | X , α , π ) , where q ( Z ) ln { p ( X , Z | α , π ) � � � L q ( . ); α , π = } , q ( Z ) Z and q ( Z ) ln { p ( Z | X , α , π ) � � � KL q ( . ) || p ( . | X , α , π ) = − } . q ( Z ) Z Approximation N N � � q ( Z ) = q ( Z i ) = M ( Z i ; 1 , τ i ) . i =1 i =1 P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 8 / 22

  9. Estimation of the number of classes Criteria Since p ( X | α , π ) is not tractable, many criteria cannot be used: 1 Akaike Information Criterion: AIC = ln p ( X | α ML , π ML ) − M . 2 Bayesian Information Criterion: BIC = ln p ( X | α MAP , π MAP ) − 1 2 M ln n . Integrated Classification Likelihood (ICL) 1 Following the work of Biernacki et al. (2000), Mariadassou and Robin (2007) used a criterion based on an asymptotic approximation of the Integrated Classification Likelihood (ICL). � � 2 ICL = max α , π ln p ( X , ˜ Q ( Q +1) Z | α , π ) − 1 � � ln n ( n +1) − ( Q − 1) ln( n ) 2 2 P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 9 / 22

  10. Outline Introduction 1 Real networks Random graph models The MixNet model Maximum likelihood estimation Bayesian View of MixNet 2 Bayesian probabilistic model Variational inference Model selection Applications 3 Affiliation models Metabolic network of E. coli P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 10 / 22

  11. Conjugate prior distributions Mixing coefficients : α ∼ Dirichlet ( α ; n 0 ) n 0 = ( n 0 1 , . . . , n 0 → Q ). n 0 → q is the prior number of vertices in class q . Connectivity matrix : π ∼ � Q q , l Beta( π ql ; η 0 ql , ζ 0 ql ) η 0 → ql is the prior number of edges connecting vertices of class q to vertices of class l . ζ 0 → ql is the prior number of non -edges connecting vertices of class q to vertices of class l . P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 11 / 22

  12. Variational Bayes Decomposition � � � � ln p ( X ) = L q ( . ) + KL q ( . ) || p ( . | X , where q ( Z , α , π ) ln { p ( X , Z , α , π ) � � � � � L q ( . ) = q ( Z , α , π ) } d α d π , Z and � � q ( Z , α , π ) ln { p ( Z , α , π | X ) � � � KL q ( . ) || p ( . | X ) = − q ( Z , α , π ) } d α d π . Z P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 12 / 22

  13. Variational Bayes Factorization N � q ( Z , α , π ) = q ( α ) q ( π ) q ( Z ) = q ( α ) q ( π ) q ( Z i ) . i =1 Optimization 1 ln ˜ q ( Z i ) = E Z \ i , α , π [ln p ( X , Z , α , π )] + cste . 2 ln ˜ q ( α ) = E Z , π [ln p ( X , Z , α , π )] + cste . 3 ln ˜ q ( π ) = E Z , α [ln p ( X , Z , α , π )] + cste . P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 13 / 22

  14. Optimization Variational Bayes E-step q ( Z i ) = M ( Z i ; 1 , τ i = { τ i 1 , . . . , τ iQ } ) . Variational Bayes M-step (1) q ( α ) = Dir( α ; n ) , q + � N where n q = n 0 i =1 τ iq . Variational Bayes M-step (2) Q � q ( π ) = Beta( π ql | η ql , ζ ql ) , q , l where η ql = η 0 ql + � N i � = j X ij τ iq τ jl and ζ ql = ζ 0 ql + � N i � = j (1 − X ij ) τ iq τ jl . P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 14 / 22

  15. Model selection The model evidence depends on Q . Bayes’ rule leads to p ( Q | X ) ∝ p ( X | Q ) p ( Q ). If p ( Q ) is broad, maximizing p ( Q | X ) is equivalent to maximizing p ( X | Q ). Since p ( X | Q ) is intractable we propose to use the lower bound � � L q ( . ) and to add a term ln Q ! to take the multimodality into account . First non-asymptotic criterion based on an approximation of the model evidence. P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 15 / 22

  16. Outline Introduction 1 Real networks Random graph models The MixNet model Maximum likelihood estimation Bayesian View of MixNet 2 Bayesian probabilistic model Variational inference Model selection Applications 3 Affiliation models Metabolic network of E. coli P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 16 / 22

  17. Experiments on affiliation models Probability of intra-connection : λ . Probability of inter-connection : ǫ . Number of vertices : n = 50 . For each graph model ( λ + ǫ = 1 ) and for each number of classes Q True ∈ { 2 , 3 , 4 , 5 } , we generated 100 graphs . 5 initializations using spectral clustering techniques . Select the best number of estimated classes according to each criterion . P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 17 / 22

  18. Affiliation model (1) 1 2 3 4 5 6 2 0 100 0 0 0 0 a ) Q True / Q ICL 3 0 0 100 0 0 0 4 0 0 1 98 1 0 5 0 0 10 61 29 0 1 2 3 4 5 6 2 0 100 0 0 0 0 b ) Q True / Q VB 3 0 0 100 0 0 0 4 0 0 0 98 2 0 5 0 0 1 29 65 5 Table: λ = 0 . 85 and ǫ = 0 . 15. P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 18 / 22

  19. Affiliation model (2) 1 2 3 4 5 6 2 0 100 0 0 0 0 a ) Q True / Q ICL 3 0 0 100 0 0 0 4 0 0 14 86 0 0 5 0 17 36 44 3 0 1 2 3 4 5 6 2 0 100 0 0 0 0 b ) Q True / Q VB 3 0 0 100 0 0 0 4 0 0 5 94 1 0 5 0 4 18 43 29 6 Table: λ = 0 . 8 and ǫ = 0 . 2. P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 19 / 22

  20. Metabolic network of E. coli 593 vertices . 1782 edges . 60 initializations . Compute the lower bound . P. Latouche (Stat. & G´ enome) Bayesian Methods for Graph Clustering Journ´ ees MAS, August 2008 20 / 22

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend