Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Benedek András Rózemberczki
Central European University
Supervisor: Professor Rosario Nunzio Mantegna 2016.06.13.
Homophily Rearrangement Algorithms and Similarity Based Diffusion on - - PowerPoint PPT Presentation
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense Benedek Andrs Rzemberczki Central European University Supervisor: Professor Rosario Nunzio Mantegna 2016.06.13. Introduction Informal model
Benedek András Rózemberczki
Central European University
Supervisor: Professor Rosario Nunzio Mantegna 2016.06.13.
Introduction Informal model descriptions Simulations Summary References
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
Initial state Pseudo-ordered state Diffusion process started Diffusion process ended Ordered state Homophily rearrangement Diffusion initialized Diffusion Randomization
Figure 1: The schematics of the modeling framework used in my thesis
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
◮ Birds of a feather (McPherson et al., 2001). ◮ Not just a social phenomenon. ◮ Micro-level similarity results in a macro-level outcome
(Jackson et al., 2016). It is present in numerous socio-economic and non socio-economic networks, such as:
◮ Corporate governance networks (Kogut et al., 2012). ◮ Friendships (Epstein, 1986). ◮ Labor market referrals (Fernandez & Fernandez-Mateo, 2006). ◮ Blogs and webpages (Bisgin et al., 2010). ◮ Interactomes (Navlakha & Kingsford, 2010).
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
◮ Homophilous network generation (van Eck & Jager, 2010;
Quayle et al., 2006).
◮ Homophily rearrangement is used for randomized experiments
(Centola, 2011).
◮ Peer-effects are measurable – it would be nice to have
large-scale homophily rearrangement algorithms.
◮ The Schelling (1969) and Fagiolo et al. (2007) models are
actually homophily rearrangement algorithms.
◮ Later diffusion can be initiated on the network (Yavas & Yusel,
2014).
◮ The diffusion in my model is probabilistic not relative threshold
based (Yavas & Yusel, 2014; Halberstam & Knight, 2014).
◮ The seeders have multiple infection trials – unlike in Kempe
et al. (2003).
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
(a) Perfect heterophily (b) Homophily (c) Strong homophily Figure 2: Different levels of universal homophily on a 4 × 4 square lattice without periodic boundary conditions
The network is defined by the adjacency matrix (W) and the generic feature vector or matrix (x and X respectively).
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
The algorithm design has to include:
◮ Homophily measurement function – H(x, W) or H(X, W). ◮ The type of the generic vertex feature matters – for example
continuous or categorical.
◮ The target homophily level(s) – φ or Φ. ◮ If there are multiple homophily targets they must have the
same sign. The following homophily rearrangement algorithms were implemented for univariate and multivariate systems:
◮ Heuristic ◮ Heuristic with bag of indices ◮ Greedy
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
The model setup consists:
The probability that i transmits the information to agent j is epxressed by the pairwise transmission probability equations. See Equation (1). Pi,j = P0 · Ψ(−γ · d(Xi, Xj))
(1) Specifically Equation (2) describes the pairwise transmission proba- bility equation that I use later during the simulations. Pi,j = P0 · exp (−γ · |xi − xj|) (2)
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
network has one single component.
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
(a) Simulation run 1.
2500 5000
0.05 Iterative steps Inbreeding homophily
(b) Simulation run 2.
1500 3000
0.05 Iterative steps Inbreeding homophily
F M Figure 3: The convergence of gender based homophily to a target in separate simulation runs – friendship network from Harris (2009)
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
(a) Simulation run 1.
250 500
Iterative steps Inbreeding homophily
(b) Simulation run 2.
125 250
Iterative steps Inbreeding homophily
9th 10th 11th 12th Figure 4: The convergence of grade based inbreeding homophily to a target vector in two separate simulation runs – based on the school friendship network
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
(a) Mean solution time
0.25 0.5 100 200 300 400 φ E(t)
(b) Median solution time
0.25 0.5 100 200 300 400 φ Me(t)
Figure 5: Expected average and median convergence times of the heuristic algorithm on a square lattice with periodic boundary conditions as a function of target homophily
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
(a) Mean solution time
0.25 0.5 5 10 φ E(t)
(b) Median solution time
0.25 0.5 5 10 φ Me(t)
Figure 6: Expected average and median convergence times of the greedy algorithm on a square lattice with periodic boundary conditions as a function of target homophily
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
0.2 0.25 0.3 0.35 0.4 0.45 0.5 400 800 1,200 1,600
P(X = 1) E(t) N = 121 N = 196 N = 256
Figure 7: Expected convergence time of the heuristic algorithm as a function of system size and balancedness of the feature distribution
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
−0.5 −0.4 −0.3 −0.2 −0.1 0.1 0.2 0.3 0.4 0.5 1,000 1,500 2,000 2,500 3,000
ρ E(t)
N = 100 N = 144 N = 196 N = 256 Figure 8: Expected solution time of the multivariate heuristic homophily rearrangement algorithms as a function of feature correlation and lattice size
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
0.1 0.2 0.3 0.4 2 4 Transmission probability Density φ = −0.5 φ = 0 φ = 0.5
Figure 9: The distribution of pairwise transmission probabilities
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
5 10 15 20 25 30 0.5 1 t E(Yt)/N φ = −0.8 φ = 0 φ = 0.8
Figure 10: The ratio of infected nodes as a function of time
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
0.2 0.4 0.6 0.8 20 30 40 50 60 γ E(t) φ = −0.8 φ = 0 φ = 0.8
Figure 11: Expected solution time as a function of sensitivity to dissimilarity
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
0.4 0.5 0.6 0.7 0.8 0.9 12 14 16 18 20 P0 E(t) φ = −0.8 φ = 0 φ = 0.8
Figure 12: Expected solution time as a function of baseline transmission probability
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
20 40 60 80 100 120 0.5 1 t E(Yt)/N Discriminator seeder Non-discriminator seeder
Figure 13: Non-homophilous state with discrimination – expected ratio of infected nodes
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
10 20 30 40 50 60 70 80 90 100 110 0.5 1 t E(Yt)/N Discriminator seeder Non-discriminator seeder
Figure 14: Homophilous state with discrimination – expected ratio of infected nodes
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
◮ Models
◮ Homophily rearrangement simulation results
convergence.
◮ Similarity based diffusion simulation results
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
Homophily rearrangement algorithms
Similarity based diffusion
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
Limitations:
◮ Theoretical:
◮ Simulations:
features.
Further research ideas:
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
works’. In IEEE (ed.), Web Intelligence and Intelligent Agent Technology (WI-IAT), IEEE/WIC/ACM International Conference, vol. 1.
The Small Worlds of Corporate Governance,
Scandinavia, pp. 183–202. MIT Press.
Process and Outcome in Peer Relationships, chap. Friendship Selection: Developmental and Environmental Influences., pp. 129–160. New York: Academic Press.
Behavior and Organization 64:316–336.
‘Networks, Race, and Hiring’. American Sociological Review 71(1):42–71.
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
‘Homophily, Group Size, and the Diffusion of Political Information in Social Networks’. National Bureau
‘The National Longitudinal Study of Adolescent to Adult Health (Add Health), Waves I and II, 1994–1996; Wave III, 2001–2002; Wave IV, 2007-2009’. Chapel Hill, NC: Carolina Population Center, University of North Carolina at Chapel Hill. DOI: 10.3886/ICPSR27021.v9. M. O. Jackson, et al. (2016). ‘The Economic Conse- quences
Social Network Structure’. Available at SSRN: http://ssrn.com/abstract=2467812.
a Social Network’. In Proceedings of the ninth ACM SIGKDD interna- tional conference on Knowledge discovery and data mining., pp. 137–
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
Generating Rules and the Social Science of Governance, pp. 259–299. MIT Press.
Networks’. Annual Review of Sociology 27:415–444.
works for Associating Genes with Diseases’. Bioinformatics 26(8):1057– 1063.
works’. American Journal of Sociology 106(3):763–816.
Mixing’. The European Physical Journal B - Condensed Matter and Complex Systems 50(4):617–630.
59:488–493.
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense
Introduction Informal model descriptions Simulations Summary References
Modelling: Finding an Optimal Structure Based on Survey Data (or Finding the Network That Does Not Exist).’. In Proceedings of the 3rd World Congress on social simulation.
Over Social Networks’. Social Science Computer Review 33(3):354–372.
Homophily Rearrangement Algorithms and Similarity Based Diffusion on Networks Thesis defense