exponential family random network models ernm
play

Exponential-family Random Network Models (ERNM) Ian Fellows UCLA - PowerPoint PPT Presentation

Exponential-family Random Network Models (ERNM) Ian Fellows UCLA January 9, 2012 Ian Fellows Exponential-family Random Network Models (ERNM) The landscape Random graphs == Random connections, Fixed nodal attributes Gibbs/Markov


  1. Exponential-family Random Network Models (ERNM) Ian Fellows UCLA January 9, 2012 Ian Fellows Exponential-family Random Network Models (ERNM)

  2. The landscape ◮ Random graphs == Random connections, Fixed nodal attributes ◮ Gibbs/Markov random fields == Fixed connections, Random nodal attributes ◮ ERNM == Random connections, Random nodal attributes Ian Fellows Exponential-family Random Network Models (ERNM)

  3. ERNM Formulation Let Y be an n by n matrix who’s entries Y i , j indicate whether subject i and j are connected, where n is the size of the population. Further let X be a n × q matrix of nodal variates. We define the network to be the random variable ( Y , X ). Then a joint exponential family model for the network may be written as: 1 c ( η ) e η h ( x , y ) , P ( X = x , Y = y | η ) = ( x , y ) ∈ N (1) Ian Fellows Exponential-family Random Network Models (ERNM)

  4. Uninteresting Example: Separable Models Suppose that h is composed such that the model can be expressed as 1 c ( η 1 , η 2 ) e η 1 h 1 ( x )+ η 2 h 2 ( y ) P ( X = x , Y = y | η 1 , η 2 ) = ( x , y ) ∈ N . (2) Then 1 c 1 ( η 1 ) e η 1 h 1 ( x ) P ( X = x | η 1 ) = 1 c 2 ( η 2 ) e η 2 h 2 ( y ) . P ( Y = y | η 2 ) = Ian Fellows Exponential-family Random Network Models (ERNM)

  5. Pathological Example: Ising as a Joint Model Joint model: � � j y i , j + η 2 � � P ( X = x , Y = y | η 1 , η 2 ) ∝ e η 1 j x i y i , j x j . i i With conditional distributions being: e η 1 y i , j + η 2 x i y i , j x j P ( Y i , j = y i , j | X = x , η 1 , η 2 ) ∝ e η 2 � � j x i y i , j x j P ( X = x | Y = y , η 2 ) ∝ i Ian Fellows Exponential-family Random Network Models (ERNM)

  6. Pathological Example: Ising as a Joint Model Degeneracy: Oh My!!! # of edges within x = 1 # of edges within x = -1 Frequency Frequency 20000 20000 0 0 0 50 100 150 200 0 50 100 150 200 Count Count # of edges between x = 1 and x = -1 # of nodes with x = 1 Frequency Frequency 6000 4000 0 0 0 20 40 60 80 100 120 0 5 10 15 20 Count Count Figure: 100,000 draws from an Ising Joint Model with η 1 = 0 and η 2 = 0 . 13. Mean values are marked in red. Ian Fellows Exponential-family Random Network Models (ERNM)

  7. Dispare Oh well, better give up. Ian Fellows Exponential-family Random Network Models (ERNM)

  8. Hope But wait, is there a better measure of homophily which doesn’t display degeneracy? Ian Fellows Exponential-family Random Network Models (ERNM)

  9. ... ... 6 months pass .... Ian Fellows Exponential-family Random Network Models (ERNM)

  10. Regularized Homophily � � � d i , l − E binom ( reg homophily ( k , l ) = d i , l ) , i : x i = k where d i , l is the number of edges connecting node i to nodes in group l , and E binom ( � d i , l ) is the expectation of the square root of a binomial variable, with probability equal to the proportion of nodes in group l and size equal to the out-degree of node i . Ian Fellows Exponential-family Random Network Models (ERNM)

  11. Logistic Regression in Network Data 1 c ( β, η, λ ) e zx β · + η h ( x , y )+ λ g ( z , y ) . P ( Z = z , X = x , Y = y | η, β, λ ) = (3) e x i β P ( z i = 1 | z − i , x i , Y = y , β, λ ) = (4) e λ [ g ( z − , y ) − g ( z + , y )] + e x i β . where z − i represents the set of z not including z i , z + represents z where z i = 1, z − is z where z i = 0,and x i represents the i th row of X . Ian Fellows Exponential-family Random Network Models (ERNM)

  12. A Super-population Model for an Add Health High School Std. Error Z p-value η Mean Degree -167.90 8.51 -19.73 < 0.001 Log Variance of Degree 22.18 10.01 2.22 0.027 Degree = 0 3.91 0.47 8.28 < 0.001 Degree = 1 2.20 0.38 5.86 < 0.001 Degree = 2 0.73 0.35 2.05 0.041 Grade = 9 0.88 0.78 1.13 0.258 Grade = 10 1.74 0.92 1.89 0.058 Grade = 11 2.53 0.79 3.20 0.001 Within Grade Homophily 3.97 0.47 8.44 < 0.001 +1 Grade Homophily 0.50 0.33 1.54 0.125 +2 Grade Homophily -1.07 0.27 -4.03 < 0.001 +3 Grade Homophily -0.59 0.40 -1.47 0.143 Table: ERNM Model with Standard Errors Based on the Fisher Information Ian Fellows Exponential-family Random Network Models (ERNM)

  13. A Super-population Model for an Add Health High School Figure: Model-Based Simulated High School Ian Fellows Exponential-family Random Network Models (ERNM)

  14. A Super-population Model for an Add Health High School In-Degree Out-Degree 25 25 20 20 15 15 10 10 5 5 0 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 # of Edges Between Grades Grade Counts 140 30 120 25 100 80 20 60 15 40 20 10 0 10-10 11-10 12-10 9-10 10-11 11-11 12-11 9-11 10-12 11-12 12-12 9-12 10-9 11-9 12-9 9-9 9 10 11 12 Figure: Model Diagnostics Ian Fellows Exponential-family Random Network Models (ERNM)

  15. Logistic Regression on Substance Use: Naive model Std. Error Z p-value β Intercept -1.70 0.44 -3.84 < 0.001 Male 1.18 0.57 2.09 0.037 Table: Simple Logistic Regression Model Ignoring Network Structure Ian Fellows Exponential-family Random Network Models (ERNM)

  16. Logistic Regression on Substance Use: ERNM model Bootstrap Asymptotic η Std. Error Std. Error Z p-value Mean Degree -164.18 7.86 8.07 -20.36 < 0.001 Log Variance of Degree 20.35 8.85 9.07 2.24 0.025 Degree 0 4.01 0.45 0.44 9.12 < 0.001 Degree 1 2.25 0.37 0.35 6.44 < 0.001 Degree 2 0.74 0.36 0.35 2.08 0.038 Grade Homophily 3.85 0.46 0.46 8.41 < 0.001 +1 Grade Homophily 0.45 0.33 0.33 1.39 0.166 +2 Grade Homophily -1.14 0.28 0.25 -4.50 < 0.001 +3 Grade Homophily -0.58 0.39 0.38 -1.52 0.129 Sex Homophily 0.98 0.28 0.27 3.56 < 0.001 Substance Homophily 0.88 0.25 0.26 3.44 < 0.001 Intercept -1.79 0.49 0.43 -4.11 < 0.001 Male 0.94 0.56 0.52 1.81 0.070 Table: ERNM Model Inference Ian Fellows Exponential-family Random Network Models (ERNM)

  17. Logistic Regression on Substance Use: homophily diagnostics # of edges within non-substance users # of edges within substance users 200 200 Frequency Frequency 100 100 0 0 100 150 200 250 300 350 0 20 40 60 80 Count Count # of edges between users and non-users # of non-substance users 200 200 Frequency Frequency 100 100 0 0 20 40 60 80 100 120 140 45 50 55 60 65 Count Count Figure: Substance Use Homophily Diagnostics. The values of the observed statistics are marked in red. Ian Fellows Exponential-family Random Network Models (ERNM)

  18. Conclusion ERNM is a framework for inference about networks, including both the graph and the nodal characteristics. Ian Fellows Exponential-family Random Network Models (ERNM)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend