A Tensor Spectral Approach to Learning Mixed Membership Community - PowerPoint PPT Presentation

A Tensor Spectral Approach to Learning Mixed Membership Community Models Anima Anandkumar U.C. Irvine Joint work with Rong Ge, Daniel Hsu, Furong Huang, Niranjan UN, Mohammad Hakeem, Sham Kakade.

Network Communities in Various Domains Social Networks Social ties: e.g. friendships, co-authorships Biological Networks Functional relationships: e.g. gene regulation, neural activity. Recommendation Systems Recommendations: e.g. yelp reviews. Community Detection: Infer hidden communities from observed network.

Community Formation Models Basic Intuition: Nodes connect due to their community memberships

Community Formation Models Basic Intuition: Nodes connect due to their community memberships Classical: Stochastic Block Model Edges conditionally independent given node community memberships Single membership model: Nodes in at most one community

Community Formation Models Basic Intuition: Nodes connect due to their community memberships Classical: Stochastic Block Model Edges conditionally independent given node community memberships Single membership model: Nodes in at most one community Modeling Overlapping Communities UC Irvine People belong to multiple communities C o r n e l l Microsoft Community formation models? MIT Detection algorithms? Computational/sample complexities?

Pure vs. Mixed Membership Community Models Stochastic Block Model

Pure vs. Mixed Membership Community Models Stochastic Block Model Mixed Membership Model

Mixed Membership Community Models Node Membership Model Mixed memberships: Nodes can belong to multiple communities Fractional memberships: Node memberships normalized to one.

Mixed Membership Community Models Node Membership Model Mixed memberships: Nodes can belong to multiple communities Fractional memberships: Node memberships normalized to one. Edge Formation Model Edges conditionally independent given node community memberships Linearity: Edge probability averaged over community memberships

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 Dirichlet distribution supported over simplex Dir( α )

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 α j → 0 Dirichlet distribution supported over simplex

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 α j = 1 Dirichlet distribution supported over simplex

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 Large α j Dirichlet distribution supported over simplex

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 α j → ∞ Dirichlet distribution supported over simplex

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 Dirichlet distribution supported over simplex Dir( α )

Mixed Membership Dirichlet Model (Airoldi et. al.) Independent draws for community membership vectors { π u } u ∈ V from Dirichlet distribution P [ π u ] ∝ � k � k j =1 π u ( j ) α j − 1 , j =1 π u ( j ) = 1 Dirichlet distribution supported over simplex Dir( α ) Dirichlet concentration parameter α 0 := � j α j Roughly, level of sparsity in π is O ( α 0 ) . Regime of interest: small α i

Employing Mixed Membership Models Advantages Generative model: can be used for prediction Dirichlet distribution: can model sparse memberships Sparsity level related to concentration parameter: α 0 = � i α i Stochastic block model is a special case ( α i → 0)

Employing Mixed Membership Models Advantages Generative model: can be used for prediction Dirichlet distribution: can model sparse memberships Sparsity level related to concentration parameter: α 0 = � i α i Stochastic block model is a special case ( α i → 0) Challenges in Learning Mixed Membership Models Identifiability: when can parameters be estimated? Guaranteed learning? What input required? Potentially large sample and computational complexities

Overview of the Approach Method of Moments and Spectral Approach Inverse moment method: solve equations relating parameters to observed moments Spectral approach: reduce equation solving to computing the “spectrum” of the observed moments Non-convex but computationally tractable approaches

Overview of the Approach Method of Moments and Spectral Approach Inverse moment method: solve equations relating parameters to observed moments Spectral approach: reduce equation solving to computing the “spectrum” of the observed moments Non-convex but computationally tractable approaches Spectral Approach to Learning Mixed Membership Models Edge and Subgraph Counts: Moments of the observed network Tensor Spectral Approach: Low rank tensor form and efficient decomposition via power method Parallel Implementation: Linear algebraic operations and iterative tensor decomposition techniques

Outline Introduction 1 Summary of Theoretical Guarantees 2 Graph Moments: Tensor Form of Subgraph Counts 3 Algorithms for Tensor Decomposition 4 GPU Implementation and Experimental Results 5 Conclusion 6

Summary of Results Contributions First guaranteed learning method for overlapping (probabilistic) community models. Correctness under exact moments: edges and 3 -star counts. Efficient sample and computational complexity.

Summary of Results Contributions First guaranteed learning method for overlapping (probabilistic) community models. Correctness under exact moments: edges and 3 -star counts. Efficient sample and computational complexity. Scaling Requirements k communities, n nodes. Uniform communities. Dirichlet parameter: α 0 := � i α i . p, q : intra/inter-community connectivity � ( α 0 + 1) k � p − q n = ˜ Ω( k 2 ( α 0 + 1) 2 ) , = ˜ √ p Ω . n 1 / 2

Summary of Results Contributions First guaranteed learning method for overlapping (probabilistic) community models. Correctness under exact moments: edges and 3 -star counts. Efficient sample and computational complexity. Scaling Requirements k communities, n nodes. Uniform communities. Dirichlet parameter: α 0 := � i α i . p, q : intra/inter-community connectivity � ( α 0 + 1) k � p − q n = ˜ Ω( k 2 ( α 0 + 1) 2 ) , = ˜ √ p Ω . n 1 / 2 For stochastic block model ( α 0 = 0) , tight results Performance degradation as α 0 increases Efficient method for sparse community overlaps

Main Results: Recovery Guarantees k communities, n nodes. Uniform communities. Dirichlet concentration parameter: α 0 := � i α i . p, q : intra/inter-community connectivity Scaling Requirements � ( α 0 + 1) k � p − q n = ˜ Ω( k 2 ( α 0 + 1) 2 ) , = ˜ √ p Ω . n 1 / 2

Main Results: Recovery Guarantees k communities, n nodes. Uniform communities. Dirichlet concentration parameter: α 0 := � i α i . p, q : intra/inter-community connectivity Scaling Requirements � ( α 0 + 1) k � p − q n = ˜ Ω( k 2 ( α 0 + 1) 2 ) , = ˜ √ p Ω . n 1 / 2 Recovery Bounds (Anandkumar, Ge, Hsu, Kakade ‘13) � � ( α 0 + 1) 3 / 2 √ p ε π n := 1 Π i − Π i � 1 = ˜ � � ( p − q ) √ n n max O i � � ( α 0 + 1) 3 / 2 k √ p i,j ∈ [ n ] | � P i,j − P i,j | = ˜ ε P := max O ( p − q ) √ n

Support Recovery Guarantees (Homophilic Models) k communities, n nodes. Uniform communities. ε P : Error in recovering P Π is the true community membership matrix. Homophilic Models: p > q

Support Recovery Guarantees (Homophilic Models) k communities, n nodes. Uniform communities. ε P : Error in recovering P Π is the true community membership matrix. Homophilic Models: p > q Support Recovery Guarantee (AGHK ‘13) For a threshold ξ = Ω( ε P ) , for all nodes j ∈ [ n ] and all communities i ∈ [ k ] , the estimated support � S satisfies (w.h.p) Π( i, j ) ≤ ξ Π( i, j ) ≥ ξ ⇒ � 2 ⇒ � S ( i, j ) = 1 and S ( i, j ) = 0 . Zero-error Support Recovery of Significant Memberships of All Nodes

Outline Introduction 1 Summary of Theoretical Guarantees 2 Graph Moments: Tensor Form of Subgraph Counts 3 Algorithms for Tensor Decomposition 4 GPU Implementation and Experimental Results 5 Conclusion 6

Subgraph Counts as Graph Moments

A Tensor Spectral Approach to Learning Mixed Membership Community - PowerPoint PPT Presentation

A Tensor Spectral Approach to Learning Mixed Membership Community Models Anima Anandkumar U.C. Irvine Joint work with Rong Ge, Daniel Hsu, Furong Huang, Niranjan UN, Mohammad Hakeem, Sham Kakade. Network Communities in Various Domains Social

8. Tensor Field Visualization Tensor: extension of concept of scalar and vector Tensor data

Spectral Clustering Spectral Clustering? Spectral methods Methods using eigenvectors of

An Introduction to Spectral Learning Hanxiao Liu November 8, 2013 An Introduction to Spectral

(Some) Challenges in (Some) Challenges in Tensor Mining Tensor Mining Evrim Acar Sandia

Tensor Field Techniques Lecture 11 March 5, 2020 Outline Basics of tensor algebra Tensor

TENSOR ALGEBRA Continuum Mechanics Course (MMC) - ETSECCPB - UPC Introduction to Tensors Tensor

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

Tensor Field Visualization 9-1 Ronald Peikert SciVis 2007 - Tensor Fields Tensors

Tensor Methods for Signal Processing and Machine Learning Qibin Zhao Tensor Learning Unit RIKEN

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Precision Training PAI Overview What is mixed-precision

TENSOR LAYERS FOR COMPRESSION OF DEEP LEARNING NETWORKS Cris Cecka Senior Research Scientist,

EFFECTIVE USE OF MIXED PRECISION FOR HPC Kate Clark, Smoky Mountain Conference 2019 Why Mixed

Membership Jon Perry, DDGER NCD 817.825.7185 jon@jonperry.biz 1 We don't grow membership by

Spring 2020 Area Association Meeting 1 Membership 2 Membership Update Girl Membership Goal

Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods

OPEN THE BOX AT YOUR CHAIR UNTIL INSTRUCTED RETAIL THEATRE WITH A DIFFERENCE SO WHAT IS

project Chubby cats homemade cat food CRITERION A investigating a prequel to the madness Our

First Author Authors Title Abdelmoumen S. Abdelmoumen, E. Influence of the nature of the

Quality control selective focus on the variety of grain The use of special continued compliance

Presentation (Gail Wieringa), including Q & As Drinks and pasties Community

1 Todays webinar is brought to you by Cornwall Council, Cornwall & Isles of Scilly Local

Falmouth Community Needs Working Group Project April-June 2016 Key Findings Key Findings

Fruit Attraction Ed Griffiths Strategic Insight Director Kantar Worldpanel 1 52 we 9 September

A Tensor Spectral Approach to Learning Mixed Membership Community - PowerPoint PPT Presentation

A Tensor Spectral Approach to Learning Mixed Membership Community Models Anima Anandkumar U.C. Irvine Joint work with Rong Ge, Daniel Hsu, Furong Huang, Niranjan UN, Mohammad Hakeem, Sham Kakade. Network Communities in Various Domains Social

8. Tensor Field Visualization Tensor: extension of concept of scalar and vector Tensor data

Spectral Clustering Spectral Clustering? Spectral methods Methods using eigenvectors of

An Introduction to Spectral Learning Hanxiao Liu November 8, 2013 An Introduction to Spectral

(Some) Challenges in (Some) Challenges in Tensor Mining Tensor Mining Evrim Acar Sandia

Tensor Field Techniques Lecture 11 March 5, 2020 Outline Basics of tensor algebra Tensor

TENSOR ALGEBRA Continuum Mechanics Course (MMC) - ETSECCPB - UPC Introduction to Tensors Tensor

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

Tensor Field Visualization 9-1 Ronald Peikert SciVis 2007 - Tensor Fields Tensors

Tensor Methods for Signal Processing and Machine Learning Qibin Zhao Tensor Learning Unit RIKEN

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Precision Training PAI Overview What is mixed-precision

TENSOR LAYERS FOR COMPRESSION OF DEEP LEARNING NETWORKS Cris Cecka Senior Research Scientist,

EFFECTIVE USE OF MIXED PRECISION FOR HPC Kate Clark, Smoky Mountain Conference 2019 Why Mixed

Membership Jon Perry, DDGER NCD 817.825.7185 jon@jonperry.biz 1 We don't grow membership by

Spring 2020 Area Association Meeting 1 Membership 2 Membership Update Girl Membership Goal

Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods

OPEN THE BOX AT YOUR CHAIR UNTIL INSTRUCTED RETAIL THEATRE WITH A DIFFERENCE SO WHAT IS

project Chubby cats homemade cat food CRITERION A investigating a prequel to the madness Our

First Author Authors Title Abdelmoumen S. Abdelmoumen, E. Influence of the nature of the

Quality control selective focus on the variety of grain The use of special continued compliance

Presentation (Gail Wieringa), including Q &amp; As Drinks and pasties Community

1 Todays webinar is brought to you by Cornwall Council, Cornwall &amp; Isles of Scilly Local

Falmouth Community Needs Working Group Project April-June 2016 Key Findings Key Findings

Fruit Attraction Ed Griffiths Strategic Insight Director Kantar Worldpanel 1 52 we 9 September

Presentation (Gail Wieringa), including Q & As Drinks and pasties Community

1 Todays webinar is brought to you by Cornwall Council, Cornwall & Isles of Scilly Local