Generative Models for Complex Network Structure Aaron Clauset - PowerPoint PPT Presentation

Generative Models for Complex Network Structure Aaron Clauset @aaronclauset Computer Science Dept. & BioFrontiers Institute University of Colorado, Boulder External Faculty, Santa Fe Institute 4 June 2013 NetSci 2013, Complex Networks meets Machine Learning 1

• what is structure? • generative models for complex networks general form types models opportunities and challenges • weighted stochastic block models a parable about thresholding checking our models learning from data (approximately) 2

what is structure? • makes data different from noise makes a network different from a random graph 3

what is structure? • makes data different from noise makes a network different from a random graph • helps us compress the data describe the network succinctly capture most relevant patterns 4

what is structure? • makes data different from noise makes a network different from a random graph • helps us compress the data describe the network succinctly capture most relevant patterns • helps us generalize, from data we’ve seen to data we haven’t seen: i. from one part of network to another ii. from one network to others of same type iii. from small scale to large scale (coarse-grained structure) iv. from past to future (dynamics) 5

statistical inference • imagine graph is drawn from an ensemble or generative G model : a probability distribution with parameters P ( G | θ ) θ • can be continuous or discrete; represents structure of graph θ 6

statistical inference • imagine graph is drawn from an ensemble or generative G model : a probability distribution with parameters P ( G | θ ) θ • can be continuous or discrete; represents structure of graph θ • inference (MLE): given , find that maximizes P ( G | θ ) G θ • inference (Bayes): compute or sample from posterior distribution P ( θ | G ) 7

statistical inference • imagine graph is drawn from an ensemble or generative G model : a probability distribution with parameters P ( G | θ ) θ • can be continuous or discrete; represents structure of graph θ • inference (MLE): given , find that maximizes P ( G | θ ) G θ • inference (Bayes): compute or sample from posterior distribution P ( θ | G ) • if is partly known, constrain inference and determine the rest θ • if is partly known, infer and use to generate the rest P ( G | θ ) G θ • if model is good fit (application dependent), we can generate synthetic graphs structurally similar to G • if part of has low probability under model, flag as possible G anomaly 8

• what is structure? • generative models for complex networks general form types models opportunities and challenges • weighted stochastic block models a parable about thresholding checking our models learning from data (approximately) 9

generative models for complex networks general form Y P ( G | θ ) = P ( A ij | θ ) i<j assumptions about “structure” go into P ( A ij | θ ) ⇣ ⌘ consistency ˆ n →∞ Pr lim θ 6 = θ = 0 requires that edges be conditionally independent [Shalizi, Rinaldo 2011] 10

D , { p r } assortative modules probability p r 11

model hierarchical random graph j i instance i j Pr( i, j connected) = p r = p (lowest common ancestor of i,j ) 12

� p E r (1 − p r ) L r R r − E r L ( D , { p r } ) = r → r p r L r = number nodes in left subtree = number nodes in right subtree R r = number edges with as lowest E r r → common ancestor L r R r E r 13

classes of generative models • stochastic block models k types of vertices, depends only on types of i, j P ( A ij | z i , z j ) originally invented by sociologists [Holland, Laskey, Leinhardt 1983] many, many flavors, including mixed-membership SBM [Airoldi, Blei, Feinberg, Xing 2008] hierarchical SBM [Clauset, Moore, Newman 2006,2008] restricted hierarchical SBM [Leskovec, Chakrabarti, Kleinberg, Faloutsos 2005] infinite relational model [Kemp, Tenenbaum, Griffiths, Yamada, Ueda 2006] restricted SBM [Hofman, Wiggins 2008] degree-corrected SBM [Karrer, Newman 2011] SBM + topic models [Ball, Karrer, Newman 2011] SBM + vertex covariates [Mariadassou, Robin, Vacher 2010] SBM + edge weights [Aicher, Jacobs, Clauset 2013] + many others 14

classes of generative models • latent space models nodes live in a latent space, depends only on vertex-vertex proximity P ( A ij | f ( x i , x j )) many, many flavors, including logistic function on vertex features [Hoff, Raftery, Handcock 2002] social status / ranking [Ball, Newman 2013] nonparametric metadata relations [Kim, Hughes, Sudderth 2012] multiple attribute graphs [Kim, Leskovec 2010] nonparametric latent feature model [Miller, Griffiths, Jordan 2009] infinite multiple memberships [Morup, Schmidt, Hansen 2011] ecological niche model [Williams, Anandanadesan, Purves 2010] hyperbolic latent spaces [Boguna, Papadopoulos, Krioukov 2010] 15

opportunities and challenges • richly annotated data edge weights, node attributes, time, etc. = new classes of generative models • generalize from to ensemble n = 1 useful for modeling checking, simulating other processes, etc. • many familiar techniques frequentist and Bayesian frameworks makes probabilistic statements about observations, models predicting missing links leave- k -out cross validation ≈ approximate inference techniques (EM, VB, BP , etc.) sampling techniques (MCMC, Gibbs, etc.) • learn from partial or noisy data extrapolation, interpolation, hidden data, missing data 16

opportunities and challenges • only two classes of models stochastic block models latent space models • bootstrap / resampling for network data critical missing piece depends on what is independent in the data • model comparison naive AIC, BIC, marginalization, LRT can be wrong for networks what is goal of modeling: realistic representation or accurate prediction? • model assessment / checking? how do we know a model has done well? what do we check? • what is v -fold cross-validation for networks? n 2 /v Omit edges? Omit nodes? What? n/v 17

• what is structure? • generative models for complex networks general form types models opportunities and challenges • weighted stochastic block models a parable about thresholding learning from data (approximately) checking our models 18

stochastic block models functional groups, not just clumps • social “communities” (large, small, dense or empty) • social: leaders and followers • word adjacencies: adjectives and nouns • economics: suppliers and customers 19

classic stochastic block model nodes have discrete attributes each vertex has type t i ∈ { 1 , . . . , k } i matrix of connection probabilities k × k p if and , edge exists with probability ( i → j ) t i = r t j = s p rs not necessarily symmetric, and we do not assume p rr > p rs p given some , we want to simultaneously G label nodes (infer type assignment ) t : V → { 1 , . . . , k } learn the latent matrix p 20

classic stochastic block model model 1 2 3 4 5 6 1 assortative modules 2 3 4 5 instance 6 likelihood Y Y P ( G | t, θ ) = (1 − p t i ,t j ) p t i ,t j ( i,j ) 2 E ( i,j ) 62 E 21

thresholding edge weights • 4 groups • edge weights with ∼ N ( µ i , σ 2 ) µ 1 < µ 2 < µ 3 < µ 4 • what threshold should we choose? t = 1 , 2 , 3 , 4 t 22

• 4 groups • edge weights with ∼ N ( µ i , σ 2 ) µ 1 < µ 2 < µ 3 < µ 4 • set threshold , fit SBM t ≤ 1 23

• 4 groups • edge weights with ∼ N ( µ i , σ 2 ) µ 1 < µ 2 < µ 3 < µ 4 • set threshold , fit SBM t = 2 24

• 4 groups • edge weights with ∼ N ( µ i , σ 2 ) µ 1 < µ 2 < µ 3 < µ 4 • set threshold , fit SBM t = 3 25

• 4 groups • edge weights with ∼ N ( µ i , σ 2 ) µ 1 < µ 2 < µ 3 < µ 4 • set threshold , fit SBM t ≥ 4 26

weighted stochastic block model adding auxiliary information: each edge has weight w ( i, j ) let w ( i, j ) ∼ f ( x | θ ) = h ( x ) exp( T ( x ) · η ( θ )) covers all exponential-family type distributions: bernoulli, binomial (classic SBM), multinomial poisson, beta exponential, power law, gamma normal, log-normal, multivariate normal 27

weighted stochastic block model adding auxiliary information: each edge has weight w ( i, j ) let w ( i, j ) ∼ f ( x | θ ) = h ( x ) exp( T ( x ) · η ( θ )) examples of weighted graphs: frequency of social interactions (calls, txt, proximity, etc.) cell-tower traffic volume other similarity measures time-varying attributes missing edges, active learning, etc. 28

weighted stochastic block model block structure R : k × k → { 1 , . . . , R } weight distribution f block assignment z weighted graph G Y likelihood function: � � P ( G | z, θ , f ) = G i,j | θ R ( z i ,z j ) f i<j given and choice of , learn and G f θ z technical difficulties: degeneracies in likelihood function (variance can go to zero. oops) 29

Generative Models for Complex Network Structure Aaron Clauset - PowerPoint PPT Presentation

Generative Models for Complex Network Structure Aaron Clauset @aaronclauset Computer Science Dept. & BioFrontiers Institute University of Colorado, Boulder External Faculty, Santa Fe Institute 4 June 2013 NetSci 2013, Complex

generative design systems Generative Brief Design Definitions Workshop Processes

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan

Deep Generative models for Inverse Problems Alex Dimakis joint work with Ashish Bora, Dave Van

Invertible Generative Models for Inverse Problems Mitigating Representation Error and Dataset Bias

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Learning the Structure of Generative Models without Labeled Data Stephen Bach Bryan He

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

Introduction to Generative Models (and GANs) Haoqiang Fan fhq@megvii.com Nov. 2017 Figures

Compressed Sensing and Generative Models Ashish Bora Ajil Jalal Eric Price Alex Dimakis UT

LEARNING GENERATIVE MODELS ACROSS INCOMPARABLE SPACES Cha harlot otte Bunne unne , David

Probabilistic Models of Cognition: Generative models Table of Contents Chapter

Augmented Statistical Models: Exploiting Generative Models in Discriminative Classifiers Martin

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Power and Exponential cones x y 1 | z | and x ye z/y June 27, 2018 Ulf Worse

Rational Solutions of Polynomial-Exponential Equations Ayhan G unaydn CMAF Universidade de

Entropy numbers and eigenvalues of operators Radosaw Szwedek Adam Mickiewicz University in

Bilinear forms with exponential sums E. Kowalski (joint works with E. Fouvry, Ph. Michel and

Star Product and Star Exponential Akira Yoshioka, Dept. of Math. Sci., Tokyo University of

Todays Agenda Upcoming Homework Section 3.3: Derivatives of logarithmic and exponential

d i E Exponential Functions a l l u d Dr. Abdulla Eid b A College of Science . r D

AIM (9.3) : How do we model situations involving Exponential Decay? Alert! Take two minutes to

Generative Models for Complex Network Structure Aaron Clauset - PowerPoint PPT Presentation

Generative Models for Complex Network Structure Aaron Clauset @aaronclauset Computer Science Dept. & BioFrontiers Institute University of Colorado, Boulder External Faculty, Santa Fe Institute 4 June 2013 NetSci 2013, Complex

generative design systems Generative Brief Design Definitions Workshop Processes

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Learning Deep Generative Models Inference &amp; Representation Lecture 12 Rahul G. Krishnan

Deep Generative models for Inverse Problems Alex Dimakis joint work with Ashish Bora, Dave Van

Invertible Generative Models for Inverse Problems Mitigating Representation Error and Dataset Bias

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Learning the Structure of Generative Models without Labeled Data Stephen Bach Bryan He

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

Introduction to Generative Models (and GANs) Haoqiang Fan fhq@megvii.com Nov. 2017 Figures

Compressed Sensing and Generative Models Ashish Bora Ajil Jalal Eric Price Alex Dimakis UT

LEARNING GENERATIVE MODELS ACROSS INCOMPARABLE SPACES Cha harlot otte Bunne unne , David

Probabilistic Models of Cognition: Generative models Table of Contents Chapter

Augmented Statistical Models: Exploiting Generative Models in Discriminative Classifiers Martin

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Power and Exponential cones x y 1 | z | and x ye z/y June 27, 2018 Ulf Worse

Rational Solutions of Polynomial-Exponential Equations Ayhan G unaydn CMAF Universidade de

Entropy numbers and eigenvalues of operators Radosaw Szwedek Adam Mickiewicz University in

Bilinear forms with exponential sums E. Kowalski (joint works with E. Fouvry, Ph. Michel and

Star Product and Star Exponential Akira Yoshioka, Dept. of Math. Sci., Tokyo University of

Todays Agenda Upcoming Homework Section 3.3: Derivatives of logarithmic and exponential

d i E Exponential Functions a l l u d Dr. Abdulla Eid b A College of Science . r D

AIM (9.3) : How do we model situations involving Exponential Decay? Alert! Take two minutes to

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan