A Nearly-Linear Time Algorithm for Exact Community Recovery in - PowerPoint PPT Presentation

A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model Peng Wang 1 , Zirui Zhou 2 , Anthony Man-Cho So 1 1 Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong 2 Department of Mathematics, Hong Kong Baptist University June 14, 2020

Table of Contents 1 Overview 2 Introduction 3 Main Results 4 Experimental Results 5 Conclusions

Community Detection • Community detection refers to the problem of inferring similarity classes of vertices (i.e., communities) in a network by observing their local interactions (Abbe 2017); see the below graphs. • Broad applications in machine learning, biology, social science and many areas. • Exact recovery requires to identify the entire partition correctly.

Overview • Problem : exactly recover the communities in the binary symmetric stochastic block model (SBM), where n vertices are partitioned into two equal-sized communities and the vertices are connected with probability p = α log( n ) / n within communities and q = β log( n ) / n across communities. • Goal : propose an efficient algorithm that achieves exact recovery at the information-theoretic limit, i.e., √ α − √ β > √ 2. • Proposed Method : a two-stage iterative algorithm: (i) 1st-stage: power method, coarse estimate, (ii) 2nd-stage: generalized power method, refinement. • Theoretic Results : the proposed method can achieve exact recovery at the information-theoretic limit within ˜ O ( n ) time complexity.

Stochastic Block Model Given n nodes in two equal-sized clusters, we denote by x ∗ its true community structures, e.g., for every i ∈ [ n ], x ∗ i = 1 if the node i belongs to the first cluster and x ∗ i = − 1 if it belongs to the second one. Model 1 (Binary symmetric SBM) The elements { a ij : 1 ≤ i ≤ j ≤ n } of A are generated independently by � if x ∗ i x ∗ Bern ( p ) , j = 1 , a ij ∼ if x ∗ i x ∗ Bern ( q ) , j = − 1 , where p = α log n q = β log n and n n for some constants α > β > 0 . Besides, we have a ij = a ji for all 1 ≤ j < i ≤ n. The problem of achieving exact recovery is to develop efficient methods that can find x ∗ or − x ∗ with high probability given the adjacency matrix A .

Phase Transition The maximum likelihood (ML) estimator of x ∗ in the binary symmetric SBM is the solution of the following problem: � � x T Ax : 1 T max n x = 0 , x i = ± 1 , i = 1 , . . . , n . (1) Theorem 1 (Abbe et al. (2016), Mossel et al. (2014)) In the binary symmetric SBM, exact recovery is impossible if √ √ α − √ β < 2 , while it is possible and can be achieved by the ML √ estimator if √ α − √ β > 2 . √ In literature, √ α − √ β > 2 is called the information-theoretic limit. Question: Is it possible to develop efficient methods for achieving exact recovery at the information-theoretic limit?

Related Works Table: Methods above the information-theoretic limit Authors Methods Time complexity Recovery bounds ( α − β ) 2 / ( α + β ) > 72 Boppana, 1987 spectral algo. polynomial time ( α − β ) 2 / ( α + β ) > 64 McSherry, 2001 spectral algo. polynomial time 3( α − β ) 2 > 24( α + β )+ Abbe et al., 2016 SDP polynomial time 8( α − β ) ( p − q ) / √ p + q ≥ cn − 1 / 6 Bandeira et al., 2016 manifold opti. polynomial time Table: Methods at the information-theoretic limit Authors Methods Time complexity Recovery bounds √ α − √ β > √ Hajek et al., 2016 SDP polynomial time 2 √ √ α − √ β > Abbe et al., 2017 spectral algo. polynomial time 2 √ α − √ β > √ Gao et al., 2017 two-stage algo. polynomial time 2 √ √ α − √ β > Our paper two-stage algo. nearly-linear time 2

Algorithm Algorithm 1 A Two-Stage Algorithm for Exact Recovery 1: Input: adjacency matrix A , positive integer N n A 1 n / n 2 and B ← A − ρ E n 2: set ρ ← 1 T 3: choose y 0 randomly with uniform distribution over the unit sphere 4: for k = 1 , 2 , . . . , N do power method set y k ← By k − 1 / � By k − 1 � 2 5: (PM): coarse 6: end for estimate 7: set x 0 ← √ n y N 8: for k = 1 , 2 , . . . do set x k ← Bx k − 1 / | Bx k − 1 | 9: generalized if x k = x k − 1 then 10: power method terminate and return x k 11: stopping criteria (GPM): re- end if 12: finement 13: end for For any v ∈ R n , v / | v | denotes the vector of R n defined as � v � � 1 , if v i ≥ 0 , = otherwise , i = 1 , . . . , n . | v | − 1 , i

Main Theorem Theorem 2 (Iteration Complexity for Exact Recovery) √ Let A be randomly generated by Model 1. If √ α − √ β > 2 , then the following statement holds with probability at least 1 − n − Ω(1) : Algorithm 1 finds x ∗ or − x ∗ in O (log n / log log n ) power iterations and O (log n / log log n ) generalized power iterations. Consequences: • Algorithm 1 achieves exact recovery at the information-theoretic limit. • Explicit iteration complexity bound for Algorithm 1 to achieve exact recovery. The number of non-zero entries in A is, with high probability, in the order of n log n . Corollary 3 (Time Complexity for Exact Recovery) Let A be randomly generated by Model 1. If √ α − √ β > √ 2 , then with probability at least 1 − n − Ω(1) , Algorithm 1 finds x ∗ or − x ∗ in O ( n log 2 n ) time complexity.

Analysis of Power Method Proposition 1 (Convergence Rate of Power Method) Let { y k } k ≥ 0 be the sequence generated in the first-stage of Algorithm 1. Then, it holds with probability at least 1 − n − Ω(1) that s ∈{± 1 } � y k − s u 1 � 2 � n / (log n ) k / 2 , ∀ k ≥ 0 , min (2) where u 1 is an eigenvector of B associated with the largest eigenvalue. • { y k } k ≥ 0 with high probability converges at least linearly to u 1 . • Equation (2) shows that the ratio in the linear rate of convergence tends to 0 as n → ∞ . Lemma 4 (Distance from Leading Eigenvalue of B to Ground Truth) It holds with probability at least 1 − n − Ω(1) that � √ n u 1 − s x ∗ � � � min 2 � n / log n . (3) � s ∈{± 1 } • It suffices to compute y N p such that min s ∈{± 1 } � y N p − s u 1 � 2 � 1 / √ log n . By (2), we have N p = O (log n / log log n ).

Analysis of Generalized Power Method Proposition 2 (Convergence Rate of Generalized Power Method) Let α > β > 0 be fixed such that √ α − √ β > √ 2 . Suppose that the x 0 in Algorithm 1 satisfies � x 0 � 2 = √ n and � x 0 − x ∗ � 2 � � n / log n . Then, it holds with probability at least 1 − n − Ω(1) that � x k − x ∗ � 2 ≤ � x 0 − x ∗ � 2 / (log n ) k / 2 . (4) • Note that � x 0 − x ∗ � 2 ≤ � x 0 − √ n u 1 � 2 + �√ n u 1 − x ∗ � 2 � � n / log n . Lemma 5 (One-step Convergence of Generalized Power Iterations) √ For any fixed α > β > 0 such that √ α − √ β > 2 , the following event happens with probability at least 1 − n − Ω(1) : for all x ∈ {± 1 } n such that � x − x ∗ � 2 ≤ 2 , it holds that Bx / | Bx | = x ∗ . (5) • This lemma indicates that the GPM exhibits finite termination. • If � x 0 − x ∗ � 2 / (log n ) N g / 2 ≤ 2, by (4), we have � x N g − x ∗ � 2 ≤ 2. Then, x N g +1 = x ∗ . One can verify N g = O (log n / log log n ).

Phase Transition and Computation Efficiency • Benchmark methods: • SDP-based approach in Amini et al. (2018) solved by ADMM. • Manifold optimization (MFO) based approach in Bandeira et al. (2016) solved by manifold gradient descent (MGD) method. • Spectral clustering approach in Abbe et al. (2017) solved by Matlab function eigs . • Parameters setting: • n = 300; α and β vary from 0 to 30 and 0 to 10, with increments 0 . 5 and 0 . 4, respectively. • For fixed ( α, β ), we generate 40 instances and calculate the ratio of exact recovery. GPM SDP MGD SC 1 1 1 1 30 30 30 30 0.9 0.9 0.9 0.9 25 25 25 25 0.8 0.8 0.8 0.8 0.7 0.7 0.7 0.7 20 20 20 20 0.6 0.6 0.6 0.6 15 0.5 15 0.5 15 0.5 15 0.5 0.4 0.4 0.4 0.4 10 10 10 10 0.3 0.3 0.3 0.3 0.2 0.2 0.2 0.2 5 5 5 5 0.1 0.1 0.1 0.1 0 0 0 0 0 0 0 0 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 running time: 25 s running time: 9313 s running time: 1064 s running time: 118 s Figure: Phase transition: the x -axis is β , the y -axis is α , and darker pixels represent √ lower empirical probability of success. The red curve is √ α − √ β = 2.

A Nearly-Linear Time Algorithm for Exact Community Recovery in - PowerPoint PPT Presentation

A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model Peng Wang 1 , Zirui Zhou 2 , Anthony Man-Cho So 1 1 Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong 2

COMMUNITY MANAGEMENT jono bacon COMMUNITY COMMUNITY COMMUNITY COMMUNITY COMMUNITY COMMUNITY

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Algebraic Tools for Exact Geometric Computing I - Exact Arithmetic and Filtering Michael Hemmer

can be done in linear time? Yuri Gurevich Tallinn, April 29, 2014 Agenda 1. What linear time?

Exact asymptotics for linear processes Magda Peligrad University of Cincinnati October 2011

Exact Indexing of Dynamic Exact Indexing of Dynamic Time Warping Time Warping Eamonn Keogh

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

BEST OF EXACT GLOBE Jos Suijkens Michiel Beek Best of Exact Globe 2 Agenda 1. A fresh look for

Notes on exact meets and joins R. N. Ball, J. Picado and A. Pultr 1 Exact meets and joins.

Extending Nearly-Linear Models Chiara Corsato, Renato Pelessoni and Paolo Vicig University of

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

How to Detect How to Actually . . . Linear Dependence Resulting Algorithm Heavy-Tailed . . .

Linear Manifold Clustering Robert Haralick and Rave Harpaz Outline Background The linear

SCHOOL BOARD PROFILE Nearly 170 years of excellence in Nearly Catholic Catholic 91,000

Categorical Semantics for Linear Logic Categorical semantics for linear logic Interaction

CSE 473: Artificial Intelligence Hanna Hajishirzi

Particle Competition and Cooperation to Prevent Error Propagation from Mislabeled Data in Semi-

StayingFIT: StayingFIT: EfficientLoadSheddingTechniquesfor

A Contextual Query Expansion Approach by Term Clustering for Robust Text Summarization Massih

Shortest-Weight Paths in Random Graphs Hamed Amini EPFL Nice Random Graphs Workshop , May 2014

HLSaaS: High-Level Video Streaming as a Service Mohsen Amini-Salehi, Xiangbo Li High

Chapter 11. Network Community Detection Wei Pan Division of Biostatistics, School of Public

Cach: Caching Location-Enhanced Content to Improve User Privacy Carnegie Mellon University

A Nearly-Linear Time Algorithm for Exact Community Recovery in - PowerPoint PPT Presentation

A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model Peng Wang 1 , Zirui Zhou 2 , Anthony Man-Cho So 1 1 Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong 2

COMMUNITY MANAGEMENT jono bacon COMMUNITY COMMUNITY COMMUNITY COMMUNITY COMMUNITY COMMUNITY

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Algebraic Tools for Exact Geometric Computing I - Exact Arithmetic and Filtering Michael Hemmer

can be done in linear time? Yuri Gurevich Tallinn, April 29, 2014 Agenda 1. What linear time?

Exact asymptotics for linear processes Magda Peligrad University of Cincinnati October 2011

Exact Indexing of Dynamic Exact Indexing of Dynamic Time Warping Time Warping Eamonn Keogh

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

BEST OF EXACT GLOBE Jos Suijkens Michiel Beek Best of Exact Globe 2 Agenda 1. A fresh look for

Notes on exact meets and joins R. N. Ball, J. Picado and A. Pultr 1 Exact meets and joins.

Extending Nearly-Linear Models Chiara Corsato, Renato Pelessoni and Paolo Vicig University of

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

How to Detect How to Actually . . . Linear Dependence Resulting Algorithm Heavy-Tailed . . .

Linear Manifold Clustering Robert Haralick and Rave Harpaz Outline Background The linear

SCHOOL BOARD PROFILE Nearly 170 years of excellence in Nearly Catholic Catholic 91,000

Categorical Semantics for Linear Logic Categorical semantics for linear logic Interaction

CSE 473: Artificial Intelligence Hanna Hajishirzi

Particle Competition and Cooperation to Prevent Error Propagation from Mislabeled Data in Semi-

StayingFIT: StayingFIT: EfficientLoadSheddingTechniquesfor

A Contextual Query Expansion Approach by Term Clustering for Robust Text Summarization Massih

Shortest-Weight Paths in Random Graphs Hamed Amini EPFL Nice Random Graphs Workshop , May 2014

HLSaaS: High-Level Video Streaming as a Service Mohsen Amini-Salehi, Xiangbo Li High

Chapter 11. Network Community Detection Wei Pan Division of Biostatistics, School of Public

Cach: Caching Location-Enhanced Content to Improve User Privacy Carnegie Mellon University

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE