Exponential Random Graph Models for (Social) Network Data Analysis - - PowerPoint PPT Presentation

Outline Exponential Random Graph Models for (Social) Network Data Analysis - • Statistical Models for (Social) Network Data • p 0 , p 1 , p 2 and and p* Overview, Challenges, Research Problems • Graphons and Networks • ERGM, bERGM, tERGM, gERGM G¨ oran Kauermann • Research questions 22. September 2016 Ribno, 22.09.16 2 ”Social” Network Analysis Definition of Network • Nodes: A network consists of a set of nodes (actors) Common statistical models trace from sociology: A = { 1 , . . . , N } • There is a set of actors A • Edges: A network can be described with the adjacency • The actors interact, that is they build links or destroy links matrix • The links (edges) are of interest Y ∈ R N ⇥ N , with ”Social” networks are classical friendship networks but also ( • business networks 1 if there is an edge/link from node i to node j Y ij = • ecological networks 0 otherwise • economic networks • Direction: For simplicity we first assume an undirected • etc. network, which implies Y ij = Y ji . Ribno, 22.09.16 3 Ribno, 22.09.16 4

”Classical” Network Models: The p 0 Model ”Classical” Network Models: The p 1 Model • p 1 Model (Holland and Leinhardt, 1981) ✓ ◆ • Erd¨ os-Renyi Model (1959) P ( Y ij = 1 ) � � = α i + α j + z t logit P ( Y ij = 1 ) = log ij β 1 − P ( Y ij = 1 ) P ( Y ij = 1 ) = π • The p 1 Model assumes conditional independence of the • Independence of edges (and nodes) edges • Parameter π gives the average density • Node (actor) specific effects α i , i = 1 , . . . , N • Very simplistic model, may serve as intercept or null model • Edge (pair) specific covariate effects β • The model is a standard logit model • Can be fitted with standard software Ribno, 22.09.16 5 Ribno, 22.09.16 6 ”Classical” Network Models: The p 2 Model ”Classical” Network Models: The p* Model • p 2 Model (Duijn et al., 2004 and Zijlstra et al., 2006) • p* model or better known as Exponential Random Graph � � = φ i + φ j + z t Model (ERGM) (Frank and Strauss, 1986) logit P ( Y ij = 1 | Φ ) (1) ij β , Φ = ( φ 1 , . . . , φ n ) t ∼ N ( 0 , σ 2 � � φ I n ) θ T s ( y ) P ( Y = y | θ ) = exp κ ( θ ) • The model reduces the number of parameters for large networks • κ ( θ ) is a normalizing constant • The p 2 model induces nodal heterogeneity • s ( y ) is a vector of so-called network statistics • The modal results in a standard generalized linear mixed model (GLMM) • The model is an exponential family • Can be fitted with standard software Ribno, 22.09.16 7 Ribno, 22.09.16 8

Features of ERGM A First Comparison Modelling Unobserved Network Usability in Estimation • Unlike in the p 1 and p 2 model the edge Y ij depends on the Flexibility Modal Dependence Large rest of the network Y \ Y ij Heterogeneity networks (N → ∞ ) • Edge between node i and j depends on the ”individual” network of the two nodes p0 - - - X X • Conditional model p1 only parametric - X X covariates logit [ P ( Y ij = 1 | Y \{ Y ij } , θ )] = θ T [ s ( y ij = 1 , Y \{ Y ij } ) − s ( y ij = 0 , Y \{ Y ij } )] p2 only random - X X | {z } covariates := ∆ s ij ( y ) p* network and - X - - where ∆ s ij ( y ) denotes the vector of change statistics covariates Ribno, 22.09.16 9 Ribno, 22.09.16 10 ERGM: Estimation Problem (1) Estimation Problem (2) • Pseudo likelihood (Ikeda and Strauss, 1990): One assume independence of the edges, i.e. • The normalization constant κ ( θ ) is numerically infeasible, logit P ( Y ij = 1 | Y \{ Y ij } ) = logit P ( Y ij = 1 ) = ∆ s ij ( y ) θ since X κ ( θ ) = exp ( s ( y ) θ ) y 2 Y where Y = set of possible networks with N nodes • |Y| = 2 N ( N + 1 ) / 2 , for N = 10 ⇒ 3 · 10 13 networks • Estimation requires numerical simulation tools Ribno, 22.09.16 11 Ribno, 22.09.16 12

Estimation Problem (2) Estimation Problem (2) • Pseudo likelihood (Ikeda and Strauss, 1990): • Pseudo likelihood (Ikeda and Strauss, 1990): One assume independence of the edges, i.e. One assume independence of the edges, i.e. logit P ( Y ij = 1 | Y \{ Y ij } ) = logit P ( Y ij = 1 ) = ∆ s ij ( y ) θ logit P ( Y ij = 1 | Y \{ Y ij } ) = logit P ( Y ij = 1 ) = ∆ s ij ( y ) θ ⇒ Estimation is simple, but estimates are biased and ⇒ Estimation is simple, but estimates are biased and inference is invalid inference is invalid • Simulation based (Hunter and Handcock, 2006): We approximate X exp ( θ s ( y ⇤ )) κ ( θ ) ≈ s ( y ∗ ) where y ⇤ are random draws from the ERGM Ribno, 22.09.16 13 Ribno, 22.09.16 14 Estimation Problem (2) Estimation of ERGM (3) • Pseudo likelihood (Ikeda and Strauss, 1990): One assume independence of the edges, i.e. • Fully Bayesian Estimation (Caimo and Friel, 2011): logit P ( Y ij = 1 | Y \{ Y ij } ) = logit P ( Y ij = 1 ) = ∆ s ij ( y ) θ We are interested in the posterior distribution ⇒ Estimation is simple, but estimates are biased and π ( θ | y ) ∝ π ( y | θ ) π ( θ ) , inference is invalid • Simulation based (Hunter and Handcock, 2006): with π ( θ ) as prior distribution on θ . We approximate X exp ( θ s ( y ⇤ )) κ ( θ ) ≈ s ( y ∗ ) where y ⇤ are random draws from the ERGM ⇒ Estimation is unstable and numerically demanding Ribno, 22.09.16 15 Ribno, 22.09.16 16

Estimation of ERGM (3) Estimation of ERGM (4) Solution: Bergm: Exchange algorithm - We sample from an augmented distribution • Fully Bayesian Estimation (Caimo and Friel, 2011): π ( θ 0 , y 0 , θ | y ) ∝ π ( y | θ ) π ( θ ) h ( θ 0 | θ ) π ( y 0 | θ 0 ) . We are interested in the posterior distribution π ( θ | y ) ∝ π ( y | θ ) π ( θ ) , 1 Gibbs update of ( θ 0 , y 0 ) : with π ( θ ) as prior distribution on θ . i. Draw θ 0 ∼ h ( ·| θ ) . ii. Draw y 0 ∼ π ( ·| θ 0 ) . Problem: This posterior is “doubly-intractable”, because 2 Propose the exchange move from θ to θ 0 with probability neither the normalisation constant of π ( y | θ ) nor of π ( θ | y ) is known. ✓ ◆ 1 , q ( y 0 | θ ) π ( θ 0 ) h ( θ | θ 0 ) q ( y | θ 0 ) q ( y | θ ) π ( θ ) h ( θ 0 | θ ) q ( y 0 | θ 0 ) × κ ( θ 0 ) κ ( θ ) α = min . κ ( θ ) κ ( θ 0 ) Ribno, 22.09.16 17 Ribno, 22.09.16 18 Problems in ERGM Extension: Heterogeneity of Actors • ERGMs are notoriously unstable, i.e. the reasonable We have extended the model to allow for heterogeneous actors parameter space (Thiemichen et al., 2016) Θ 0 = { θ : density(Network) is bounded away form 0 and 1 } ⇥ � ⇤ = θ T ∆ s ij ( y ) + φ i + φ j , logit P ( Y ij = 1 � Y \ { Y ij } , θ , φ ) is getting smaller for N → ∞ with φ i ∼ N ( µ φ , σ 2 φ ) , for i = 1 , ..., n . • As a consequence: simulated networks are either full or empty This leads to the entire model • Bayesian approaches circumvent this problem for the price � � θ T s ( y ) + φ T t ( y ) P ( Y = y | θ , φ ) = exp of heavy computation (i.e. low acceptance rate) , κ ( θ , φ ) Two reasons for instability: ! 1 The models assume that the nodes are homogeneous P y 1 j , P y 2 j , . . . , P where t ( y ) = y nj . 2 Network statistics are unstable , i.e. there is an avalanche j 6 = 1 j 6 = 2 j 6 = n effect. Ribno, 22.09.16 19 Ribno, 22.09.16 20

Fully Bayesian Inference Instable Network Statistics We are interested in • Network statistics ought to be s ( y ) = 0 ( N 2 ) π ( θ , φ | y ) ∝ π ( y | θ , φ ) π ( θ ) π ( φ ) . (Schweinberger, 2011) This can be estimated with the exchange algorithm from above • two-star, triangle, etc. are all unstable (Bergm). • Geometrically weighted statistics (Snijders et al., 2006), We are additionally interested in σ 2 φ , i.e. e.g. • geometrically weighted degree (gwd) π ( θ , φ , σ 2 φ | y ) ∝ π ( y | θ , φ ) π ( θ ) π ( φ ) π ( σ 2 φ ) . • geometrically weighted edgewise shared parameter (gwesp) with π ( σ 2 φ ) as inverse gamma. • Smooth statistics (Talk on Friday, Thiemichen) Problem: Estimation is numerically very demanding Ribno, 22.09.16 21 Ribno, 22.09.16 22 Stable Network Statistics What happens if Networks are Large? Snijders, Pattison, Robins and Handcock (2006) proposed new geometrically downweigted network statistics which behave stable . • Geometrically weighted degree (gwd) • ”Classical” models hardly scale to large networks with • Geometrically weighted edgewise shared partners (gwesp) 1000 or more actors N � 2 • Estimation becomes computationally too demanding X { 1 − q l ) ESP l ( y ) s ( y , q ) = • Homogeneity of actors is questionalble l = 1 • Clustering (grouping) of actors seems more useful where q is a decay parameter and ESP l ( y ) is the number of edges with l joint partners. • Note: The gwesp statistics is edge based and not node based . • Note: Interpretation gets clumsy . Ribno, 22.09.16 23 Ribno, 22.09.16 24

Exponential Random Graph Models for (Social) Network Data Analysis - - PowerPoint PPT Presentation

Outline Exponential Random Graph Models for (Social) Network Data Analysis - Statistical Models for (Social) Network Data p 0 , p 1 , p 2 and and p* Overview, Challenges, Research Problems Graphons and Networks ERGM, bERGM, tERGM,

Exponential Random Graph Models and Their Polytopes Johannes Rauh York University (the one in

Exponential Families Leila Wehbe March 19, 2013 Leila Wehbe Exponential Families Exponential

Exponential-family Random Network Models (ERNM) Ian Fellows UCLA January 9, 2012 Ian Fellows

Exponential Growth Exponential Growth Introduction Exponential Growth vs. Linear Growth

Applications of exponential functions Applications of exponential functions abound throughout the

Exponential Family Distributions CMSC 691 UMBC Exponential Family Form Exponential Family Form

Network/Graph Network/Graph Informally a graph is a set of nodes Theory Theory joined by a

Exponential smoothing and non-negative data Muhammad Akram Rob J Hyndman J Keith Ord Business

Exponential distribution STAT 587 (Engineering) Iowa State University September 17, 2020

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Lesson 5.5: Exponential and Logarithmic Models Five Most Common Models bx y ae ,

Graphical Models Graphical Models Exponential family & Variational Inference I Siamak

CSci 8980: Advanced Topics in Graphical Models Mixture Models, EM, Exponential Families

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Section5.2 Exponential Functions and Graphs Graphing Definition The exponential function with

Deep Visual Models with Interpretable Features and Modularized Structures Quanshi Zhang John

Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman

Optical Networking Activities in NetherLight TERENA Networking Conference 2003 Zagreb, May

Suhaimi Napis, PhD Chief Technology Officer (CTO) and Director, Chief Technology Officer (CTO)

VigilNet: An Integrated Sensor 70 Mica2 motes along 280 foot long perimeter Network System

Teaching testing to programmers. What sticks, and what slides off? A journey from Teflon to

Social and Technological Networks Rik Sarkar University of Edinburgh, 2017. Course specifics

CS 525M Mobile and Ubiquitous Computing Seminar Ted Goodwin Mitigating Routing Misbehavior

Exponential Random Graph Models for (Social) Network Data Analysis - - PowerPoint PPT Presentation

Outline Exponential Random Graph Models for (Social) Network Data Analysis - Statistical Models for (Social) Network Data p 0 , p 1 , p 2 and and p* Overview, Challenges, Research Problems Graphons and Networks ERGM, bERGM, tERGM,

Exponential Random Graph Models and Their Polytopes Johannes Rauh York University (the one in

Exponential Families Leila Wehbe March 19, 2013 Leila Wehbe Exponential Families Exponential

Exponential-family Random Network Models (ERNM) Ian Fellows UCLA January 9, 2012 Ian Fellows

Exponential Growth Exponential Growth Introduction Exponential Growth vs. Linear Growth

Applications of exponential functions Applications of exponential functions abound throughout the

Exponential Family Distributions CMSC 691 UMBC Exponential Family Form Exponential Family Form

Network/Graph Network/Graph Informally a graph is a set of nodes Theory Theory joined by a

Exponential smoothing and non-negative data Muhammad Akram Rob J Hyndman J Keith Ord Business

Exponential distribution STAT 587 (Engineering) Iowa State University September 17, 2020

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Lesson 5.5: Exponential and Logarithmic Models Five Most Common Models bx y ae ,

Graphical Models Graphical Models Exponential family &amp; Variational Inference I Siamak

CSci 8980: Advanced Topics in Graphical Models Mixture Models, EM, Exponential Families

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Section5.2 Exponential Functions and Graphs Graphing Definition The exponential function with

Deep Visual Models with Interpretable Features and Modularized Structures Quanshi Zhang John

Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman

Optical Networking Activities in NetherLight TERENA Networking Conference 2003 Zagreb, May

Suhaimi Napis, PhD Chief Technology Officer (CTO) and Director, Chief Technology Officer (CTO)

VigilNet: An Integrated Sensor 70 Mica2 motes along 280 foot long perimeter Network System

Teaching testing to programmers. What sticks, and what slides off? A journey from Teflon to

Social and Technological Networks Rik Sarkar University of Edinburgh, 2017. Course specifics

CS 525M Mobile and Ubiquitous Computing Seminar Ted Goodwin Mitigating Routing Misbehavior

Graphical Models Graphical Models Exponential family & Variational Inference I Siamak