a nearly linear time algorithm for exact community
play

A Nearly-Linear Time Algorithm for Exact Community Recovery in - PowerPoint PPT Presentation

A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model Peng Wang 1 , Zirui Zhou 2 , Anthony Man-Cho So 1 1 Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong 2


  1. A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model Peng Wang 1 , Zirui Zhou 2 , Anthony Man-Cho So 1 1 Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong 2 Department of Mathematics, Hong Kong Baptist University June 14, 2020

  2. Table of Contents 1 Overview 2 Introduction 3 Main Results 4 Experimental Results 5 Conclusions

  3. Table of Contents 1 Overview 2 Introduction 3 Main Results 4 Experimental Results 5 Conclusions

  4. Community Detection • Community detection refers to the problem of inferring similarity classes of vertices (i.e., communities) in a network by observing their local interactions (Abbe 2017); see the below graphs. • Broad applications in machine learning, biology, social science and many areas. • Exact recovery requires to identify the entire partition correctly.

  5. Overview • Problem : exactly recover the communities in the binary symmetric stochastic block model (SBM), where n vertices are partitioned into two equal-sized communities and the vertices are connected with probability p = α log( n ) / n within communities and q = β log( n ) / n across communities. • Goal : propose an efficient algorithm that achieves exact recovery at the information-theoretic limit, i.e., √ α − √ β > √ 2. • Proposed Method : a two-stage iterative algorithm: (i) 1st-stage: power method, coarse estimate, (ii) 2nd-stage: generalized power method, refinement. • Theoretic Results : the proposed method can achieve exact recovery at the information-theoretic limit within ˜ O ( n ) time complexity.

  6. Table of Contents 1 Overview 2 Introduction 3 Main Results 4 Experimental Results 5 Conclusions

  7. Stochastic Block Model Given n nodes in two equal-sized clusters, we denote by x ∗ its true community structures, e.g., for every i ∈ [ n ], x ∗ i = 1 if the node i belongs to the first cluster and x ∗ i = − 1 if it belongs to the second one. Model 1 (Binary symmetric SBM) The elements { a ij : 1 ≤ i ≤ j ≤ n } of A are generated independently by � if x ∗ i x ∗ Bern ( p ) , j = 1 , a ij ∼ if x ∗ i x ∗ Bern ( q ) , j = − 1 , where p = α log n q = β log n and n n for some constants α > β > 0 . Besides, we have a ij = a ji for all 1 ≤ j < i ≤ n. The problem of achieving exact recovery is to develop efficient methods that can find x ∗ or − x ∗ with high probability given the adjacency matrix A .

  8. Phase Transition The maximum likelihood (ML) estimator of x ∗ in the binary symmetric SBM is the solution of the following problem: � � x T Ax : 1 T max n x = 0 , x i = ± 1 , i = 1 , . . . , n . (1) Theorem 1 (Abbe et al. (2016), Mossel et al. (2014)) In the binary symmetric SBM, exact recovery is impossible if √ √ α − √ β < 2 , while it is possible and can be achieved by the ML √ estimator if √ α − √ β > 2 . √ In literature, √ α − √ β > 2 is called the information-theoretic limit. Question: Is it possible to develop efficient methods for achieving exact recovery at the information-theoretic limit?

  9. Related Works Table: Methods above the information-theoretic limit Authors Methods Time complexity Recovery bounds ( α − β ) 2 / ( α + β ) > 72 Boppana, 1987 spectral algo. polynomial time ( α − β ) 2 / ( α + β ) > 64 McSherry, 2001 spectral algo. polynomial time 3( α − β ) 2 > 24( α + β )+ Abbe et al., 2016 SDP polynomial time 8( α − β ) ( p − q ) / √ p + q ≥ cn − 1 / 6 Bandeira et al., 2016 manifold opti. polynomial time Table: Methods at the information-theoretic limit Authors Methods Time complexity Recovery bounds √ α − √ β > √ Hajek et al., 2016 SDP polynomial time 2 √ √ α − √ β > Abbe et al., 2017 spectral algo. polynomial time 2 √ α − √ β > √ Gao et al., 2017 two-stage algo. polynomial time 2 √ √ α − √ β > Our paper two-stage algo. nearly-linear time 2

  10. Table of Contents 1 Overview 2 Introduction 3 Main Results 4 Experimental Results 5 Conclusions

  11. Algorithm Algorithm 1 A Two-Stage Algorithm for Exact Recovery 1: Input: adjacency matrix A , positive integer N n A 1 n / n 2 and B ← A − ρ E n 2: set ρ ← 1 T 3: choose y 0 randomly with uniform distribution over the unit sphere 4: for k = 1 , 2 , . . . , N do power method set y k ← By k − 1 / � By k − 1 � 2 5: (PM): coarse 6: end for estimate 7: set x 0 ← √ n y N 8: for k = 1 , 2 , . . . do set x k ← Bx k − 1 / | Bx k − 1 | 9: generalized if x k = x k − 1 then 10: power method terminate and return x k 11: stopping criteria (GPM): re- end if 12: finement 13: end for For any v ∈ R n , v / | v | denotes the vector of R n defined as � v � � 1 , if v i ≥ 0 , = otherwise , i = 1 , . . . , n . | v | − 1 , i

  12. Main Theorem Theorem 2 (Iteration Complexity for Exact Recovery) √ Let A be randomly generated by Model 1. If √ α − √ β > 2 , then the following statement holds with probability at least 1 − n − Ω(1) : Algorithm 1 finds x ∗ or − x ∗ in O (log n / log log n ) power iterations and O (log n / log log n ) generalized power iterations. Consequences: • Algorithm 1 achieves exact recovery at the information-theoretic limit. • Explicit iteration complexity bound for Algorithm 1 to achieve exact recovery. The number of non-zero entries in A is, with high probability, in the order of n log n . Corollary 3 (Time Complexity for Exact Recovery) Let A be randomly generated by Model 1. If √ α − √ β > √ 2 , then with probability at least 1 − n − Ω(1) , Algorithm 1 finds x ∗ or − x ∗ in O ( n log 2 n ) time complexity.

  13. Analysis of Power Method Proposition 1 (Convergence Rate of Power Method) Let { y k } k ≥ 0 be the sequence generated in the first-stage of Algorithm 1. Then, it holds with probability at least 1 − n − Ω(1) that s ∈{± 1 } � y k − s u 1 � 2 � n / (log n ) k / 2 , ∀ k ≥ 0 , min (2) where u 1 is an eigenvector of B associated with the largest eigenvalue. • { y k } k ≥ 0 with high probability converges at least linearly to u 1 . • Equation (2) shows that the ratio in the linear rate of convergence tends to 0 as n → ∞ . Lemma 4 (Distance from Leading Eigenvalue of B to Ground Truth) It holds with probability at least 1 − n − Ω(1) that � √ n u 1 − s x ∗ � � � min 2 � n / log n . (3) � s ∈{± 1 } • It suffices to compute y N p such that min s ∈{± 1 } � y N p − s u 1 � 2 � 1 / √ log n . By (2), we have N p = O (log n / log log n ).

  14. Analysis of Generalized Power Method Proposition 2 (Convergence Rate of Generalized Power Method) Let α > β > 0 be fixed such that √ α − √ β > √ 2 . Suppose that the x 0 in Algorithm 1 satisfies � x 0 � 2 = √ n and � x 0 − x ∗ � 2 � � n / log n . Then, it holds with probability at least 1 − n − Ω(1) that � x k − x ∗ � 2 ≤ � x 0 − x ∗ � 2 / (log n ) k / 2 . (4) • Note that � x 0 − x ∗ � 2 ≤ � x 0 − √ n u 1 � 2 + �√ n u 1 − x ∗ � 2 � � n / log n . Lemma 5 (One-step Convergence of Generalized Power Iterations) √ For any fixed α > β > 0 such that √ α − √ β > 2 , the following event happens with probability at least 1 − n − Ω(1) : for all x ∈ {± 1 } n such that � x − x ∗ � 2 ≤ 2 , it holds that Bx / | Bx | = x ∗ . (5) • This lemma indicates that the GPM exhibits finite termination. • If � x 0 − x ∗ � 2 / (log n ) N g / 2 ≤ 2, by (4), we have � x N g − x ∗ � 2 ≤ 2. Then, x N g +1 = x ∗ . One can verify N g = O (log n / log log n ).

  15. Table of Contents 1 Overview 2 Introduction 3 Main Results 4 Experimental Results 5 Conclusions

  16. Phase Transition and Computation Efficiency • Benchmark methods: • SDP-based approach in Amini et al. (2018) solved by ADMM. • Manifold optimization (MFO) based approach in Bandeira et al. (2016) solved by manifold gradient descent (MGD) method. • Spectral clustering approach in Abbe et al. (2017) solved by Matlab function eigs . • Parameters setting: • n = 300; α and β vary from 0 to 30 and 0 to 10, with increments 0 . 5 and 0 . 4, respectively. • For fixed ( α, β ), we generate 40 instances and calculate the ratio of exact recovery. GPM SDP MGD SC 1 1 1 1 30 30 30 30 0.9 0.9 0.9 0.9 25 25 25 25 0.8 0.8 0.8 0.8 0.7 0.7 0.7 0.7 20 20 20 20 0.6 0.6 0.6 0.6 15 0.5 15 0.5 15 0.5 15 0.5 0.4 0.4 0.4 0.4 10 10 10 10 0.3 0.3 0.3 0.3 0.2 0.2 0.2 0.2 5 5 5 5 0.1 0.1 0.1 0.1 0 0 0 0 0 0 0 0 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 running time: 25 s running time: 9313 s running time: 1064 s running time: 118 s Figure: Phase transition: the x -axis is β , the y -axis is α , and darker pixels represent √ lower empirical probability of success. The red curve is √ α − √ β = 2.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend