23rd International Symposium on
A Tale of two communities
Los Angeles, September 26, 2015 Graph-Drawing and Network Visualization Wouter Meulemans City University London Andr´ e Schulz FernUniversit¨ at in Hagen
A Tale of two communities Assessing Homophily in Node-Link Diagrams - - PowerPoint PPT Presentation
A Tale of two communities Assessing Homophily in Node-Link Diagrams 23rd International Symposium on Graph-Drawing and Network Visualization Los Angeles, September 26, 2015 Wouter Meulemans City University London Andr e Schulz
23rd International Symposium on
Los Angeles, September 26, 2015 Graph-Drawing and Network Visualization Wouter Meulemans City University London Andr´ e Schulz FernUniversit¨ at in Hagen
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis more likely that two individuals with a common charactristic
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis more likely that two individuals with a common charactristic
(example: same-gender links are more likely in a friendship-networks)
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis more likely that two individuals with a common charactristic
(example: same-gender links are more likely in a friendship-networks)
reason 1 for homophily: “Birds of feather flock together”
(social selection)
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis more likely that two individuals with a common charactristic
(example: same-gender links are more likely in a friendship-networks)
reason 1 for homophily: “Birds of feather flock together”
(social selection)
reason 2 for homophily: we form characteristics similar to
(social influence)
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis more likely that two individuals with a common charactristic
(example: same-gender links are more likely in a friendship-networks)
reason 1 for homophily: “Birds of feather flock together”
(social selection)
reason 2 for homophily: we form characteristics similar to
(social influence)
also effects opposite to homophily can occur (heterophily)
A Tale of two Communities Meulemans and Schulz, GD15
homophily is a concept in social network analysis more likely that two individuals with a common charactristic
(example: same-gender links are more likely in a friendship-networks)
reason 1 for homophily: “Birds of feather flock together”
(social selection)
reason 2 for homophily: we form characteristics similar to
(social influence)
also effects opposite to homophily can occur (heterophily) homophily is not restricted to social networks
(Question: groups = clusters?)
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
Group A Group B fraction p of the individuals fraction q of the individuals
A Tale of two Communities Meulemans and Schulz, GD15
Group A Group B fraction p of the individuals fraction q of the individuals
p2 q2 2pq
A Tale of two Communities Meulemans and Schulz, GD15
Group A Group B fraction p of the individuals fraction q of the individuals
p2 q2 2pq Homophily Test
A Tale of two Communities Meulemans and Schulz, GD15
we want to measure the degree of homophily in a network
A Tale of two Communities Meulemans and Schulz, GD15
we want to measure the degree of homophily in a network
A Tale of two Communities Meulemans and Schulz, GD15
we want to measure the degree of homophily in a network
A Tale of two Communities Meulemans and Schulz, GD15
we want to measure the degree of homophily in a network
1/2 1
degree of homophily
fraction of cross-group links
2pq 1
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
Which node-link diagram layout is best suitable for
A Tale of two Communities Meulemans and Schulz, GD15
Is there a tendency for overestimation or underestimation? Which node-link diagram layout is best suitable for
A Tale of two Communities Meulemans and Schulz, GD15
Are there general design principles to improve homophily
Is there a tendency for overestimation or underestimation? Which node-link diagram layout is best suitable for
A Tale of two Communities Meulemans and Schulz, GD15
Are there general design principles to improve homophily
Is there a tendency for overestimation or underestimation? Which node-link diagram layout is best suitable for
A Tale of two Communities Meulemans and Schulz, GD15
force-directed polarized bipartite
A Tale of two Communities Meulemans and Schulz, GD15
force-directed polarized bipartite
layout based on the Fruchtermann–Reingold Algorithm implementation taken from the d3.js library
A Tale of two Communities Meulemans and Schulz, GD15
force-directed polarized bipartite
modification of the force-directed layout additional forces pull blue vertices to the left and red vertices to
the right
A Tale of two Communities Meulemans and Schulz, GD15
force-directed polarized bipartite
groups are placed on opposing vertical lines barycentric layout + sifting to remove crossings different shapes for cross-group/within-group edges
A Tale of two Communities Meulemans and Schulz, GD15
force-directed polarized bipartite group separation
A Tale of two Communities Meulemans and Schulz, GD15
force-directed polarized bipartite group separation homophily detection easier?
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
x < y means y is better than x
A Tale of two Communities Meulemans and Schulz, GD15
x < y means y is better than x
A Tale of two Communities Meulemans and Schulz, GD15
x < y means y is better than x
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
http://tutte.fernuni-hagen.de/~schulza
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
Users have an internal “scale” for the degree of homophily
A Tale of two Communities Meulemans and Schulz, GD15
Users have an internal “scale” for the degree of homophily
True Degree of Homophily Estimation good estimation bad estimation
but overestimated good estimation (different personal scale)
A Tale of two Communities Meulemans and Schulz, GD15
0% 50% 100% 0% 50% 100%
All Sizes
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
A Tale of two Communities Meulemans and Schulz, GD15
0% 50% 100% 0% 50% 100%
All Sizes
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
polarized < bipartite, force-directed
A Tale of two Communities Meulemans and Schulz, GD15
0% 50% 100% 0% 50% 100%
All Sizes
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
polarized < bipartite, force-directed
A Tale of two Communities Meulemans and Schulz, GD15
0% 50% 100% 0% 50% 100%
All Sizes
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
polarized < bipartite, force-directed
no difference between force-direced and bipartite
A Tale of two Communities Meulemans and Schulz, GD15
0% 50% 100% 0% 50% 100%
All Sizes
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
polarized < bipartite, force-directed
no difference between force-direced and bipartite
Deviation Response time
15% 0% All Size 1 Size 2 Size 3 20s 0s All Size 1 Size 2 Size 3 −15%
B P FD
A Tale of two Communities Meulemans and Schulz, GD15
Size 3
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite All Sizes
A Tale of two Communities Meulemans and Schulz, GD15
Size 3
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
individual results, decreasing parts = defects (red)
All Sizes
A Tale of two Communities Meulemans and Schulz, GD15
Size 3
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
individual results, decreasing parts = defects (red) many inconsistencies (not clear from the aggregated data)
All Sizes
A Tale of two Communities Meulemans and Schulz, GD15
Size 3
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
individual results, decreasing parts = defects (red) many inconsistencies (not clear from the aggregated data) evidence that bipartite > force-directed
All Sizes
A Tale of two Communities Meulemans and Schulz, GD15
Size 3
0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100% 0% 50% 100%
Force-Directed Polarized Bipartite
individual results, decreasing parts = defects (red) many inconsistencies (not clear from the aggregated data) evidence that bipartite > force-directed tendency to overestimate in the polarized layout
All Sizes
A Tale of two Communities Meulemans and Schulz, GD15
100% 0% All Size 1 Size 2 Size 3 20s 0s All Size 1 Size 2 Size 3
B P FD
A Tale of two Communities Meulemans and Schulz, GD15
100% 0% All Size 1 Size 2 Size 3 20s 0s All Size 1 Size 2 Size 3
B P FD
(again supported by statistical evidence)
A Tale of two Communities Meulemans and Schulz, GD15
100% 0% All Size 1 Size 2 Size 3 20s 0s All Size 1 Size 2 Size 3
B P FD
(again supported by statistical evidence)
there was one problematic instance in size group 3 for the
A Tale of two Communities Meulemans and Schulz, GD15
100% 0% All Size 1 Size 2 Size 3 20s 0s All Size 1 Size 2 Size 3
B P FD
(again supported by statistical evidence)
there was one problematic instance in size group 3 for the
size was not a big influence
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
we can only partially accept H1:
we can only partially accept H1: internal consistency data supports
A Tale of two Communities Meulemans and Schulz, GD15
we can only partially accept H1:
we can only partially accept H1: internal consistency data supports
A Tale of two Communities Meulemans and Schulz, GD15
we can only partially accept H1:
we can only partially accept H1: internal consistency data supports
we can accept H2 and H3 based on our statistical analysis
A Tale of two Communities Meulemans and Schulz, GD15
A Tale of two Communities Meulemans and Schulz, GD15
homophily is difficult to assess, but when averaging over a
A Tale of two Communities Meulemans and Schulz, GD15
homophily is difficult to assess, but when averaging over a
the bipartite layout helped to assess homophily at the
A Tale of two Communities Meulemans and Schulz, GD15
homophily is difficult to assess, but when averaging over a
the bipartite layout helped to assess homophily at the
node seperation, was not the primary reason for this, since
A Tale of two Communities Meulemans and Schulz, GD15
homophily is difficult to assess, but when averaging over a
the bipartite layout helped to assess homophily at the
node seperation, was not the primary reason for this, since
the unbalanced case is harder
A Tale of two Communities Meulemans and Schulz, GD15
homophily is difficult to assess, but when averaging over a
the bipartite layout helped to assess homophily at the
node seperation, was not the primary reason for this, since
the unbalanced case is harder there is a tendency to overestimate in the polarized layout
A Tale of two Communities Meulemans and Schulz, GD15
homophily is difficult to assess, but when averaging over a
the bipartite layout helped to assess homophily at the
node seperation, was not the primary reason for this, since
the unbalanced case is harder there is a tendency to overestimate in the polarized layout