a tale of two communities
play

A Tale of two communities Assessing Homophily in Node-Link Diagrams - PowerPoint PPT Presentation

A Tale of two communities Assessing Homophily in Node-Link Diagrams 23rd International Symposium on Graph-Drawing and Network Visualization Los Angeles, September 26, 2015 Wouter Meulemans City University London Andr e Schulz


  1. A Tale of two communities Assessing Homophily in Node-Link Diagrams 23rd International Symposium on Graph-Drawing and Network Visualization Los Angeles, September 26, 2015 Wouter Meulemans City University London Andr´ e Schulz FernUniversit¨ at in Hagen

  2. Homophily A Tale of two Communities Meulemans and Schulz, GD15

  3. Homophily � homophily is a concept in social network analysis A Tale of two Communities Meulemans and Schulz, GD15

  4. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily A Tale of two Communities Meulemans and Schulz, GD15

  5. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) A Tale of two Communities Meulemans and Schulz, GD15

  6. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) A Tale of two Communities Meulemans and Schulz, GD15

  7. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) � reason 2 for homophily: we form characteristics similar to our friends (social influence) A Tale of two Communities Meulemans and Schulz, GD15

  8. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) � reason 2 for homophily: we form characteristics similar to our friends (social influence) � also effects opposite to homophily can occur (heterophily) A Tale of two Communities Meulemans and Schulz, GD15

  9. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) � reason 2 for homophily: we form characteristics similar to our friends (social influence) � also effects opposite to homophily can occur (heterophily) � homophily is not restricted to social networks (Question: groups = clusters?) A Tale of two Communities Meulemans and Schulz, GD15

  10. Formalizing Homophily A Tale of two Communities Meulemans and Schulz, GD15

  11. Formalizing Homophily Group B Group A fraction p of the individuals fraction q of the individuals A Tale of two Communities Meulemans and Schulz, GD15

  12. Formalizing Homophily q 2 2 pq Group B Group A p 2 fraction p of the individuals fraction q of the individuals A random link is - with probability p 2 : A ↔ A - with probability q 2 : B ↔ B - with probability 2 pq : A ↔ B A Tale of two Communities Meulemans and Schulz, GD15

  13. Formalizing Homophily q 2 2 pq Group B Group A p 2 fraction p of the individuals fraction q of the individuals A random link is - with probability p 2 : A ↔ A - with probability q 2 : B ↔ B - with probability 2 pq : A ↔ B Homophily Test If the fraction of the between-group links is significantly smaller than 2 pq we have homophily. A Tale of two Communities Meulemans and Schulz, GD15

  14. Degree of Homophily � we want to measure the degree of homophily in a network A Tale of two Communities Meulemans and Schulz, GD15

  15. Degree of Homophily � we want to measure the degree of homophily in a network � only cross-group links (heterophily) 1 Important Cases � 2 pq cross-group links (balanced) 2 � no cross-group links (homophily) 3 A Tale of two Communities Meulemans and Schulz, GD15

  16. Degree of Homophily � we want to measure the degree of homophily in a network � only cross-group links (heterophily) 1 Important Cases 0 � 2 pq cross-group links (balanced) 2 1/2 � no cross-group links (homophily) 3 1 Degree of Homophily A Tale of two Communities Meulemans and Schulz, GD15

  17. Degree of Homophily � we want to measure the degree of homophily in a network � only cross-group links (heterophily) 1 Important Cases 0 � 2 pq cross-group links (balanced) 2 1/2 � no cross-group links (homophily) 3 1 Degree of Homophily degree of homophily 1 1 / 2 interpolate all other values linearly 0 fraction of cross-group links 1 2 pq 0 A Tale of two Communities Meulemans and Schulz, GD15

  18. Research Questions A Tale of two Communities Meulemans and Schulz, GD15

  19. Research Questions Can an observer assess homophily in a node-link diagram? A Tale of two Communities Meulemans and Schulz, GD15

  20. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? A Tale of two Communities Meulemans and Schulz, GD15

  21. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? � Is there a tendency for overestimation or underestimation? A Tale of two Communities Meulemans and Schulz, GD15

  22. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? � Is there a tendency for overestimation or underestimation? � Are there general design principles to improve homophily detection? A Tale of two Communities Meulemans and Schulz, GD15

  23. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? � Is there a tendency for overestimation or underestimation? � Are there general design principles to improve homophily detection? � We only consider node-link diagrams and the ! “two-groups-scenario” A Tale of two Communities Meulemans and Schulz, GD15

  24. Layouts force-directed polarized bipartite A Tale of two Communities Meulemans and Schulz, GD15

  25. Layouts force-directed polarized bipartite � layout based on the Fruchtermann–Reingold Algorithm � implementation taken from the d3.js library A Tale of two Communities Meulemans and Schulz, GD15

  26. Layouts force-directed polarized bipartite � modification of the force-directed layout � additional forces pull blue vertices to the left and red vertices to the right A Tale of two Communities Meulemans and Schulz, GD15

  27. Layouts force-directed polarized bipartite � groups are placed on opposing vertical lines � barycentric layout + sifting to remove crossings � different shapes for cross-group/within-group edges A Tale of two Communities Meulemans and Schulz, GD15

  28. Layouts force-directed polarized bipartite group separation A Tale of two Communities Meulemans and Schulz, GD15

  29. Layouts force-directed polarized bipartite group separation homophily detection easier? other tasks more difficult? A Tale of two Communities Meulemans and Schulz, GD15

  30. Hypothesis A Tale of two Communities Meulemans and Schulz, GD15

  31. Hypothesis H1 For Homophily assessment we have force-directed < polarized < bipartite x < y means y is better than x A Tale of two Communities Meulemans and Schulz, GD15

  32. Hypothesis H1 For Homophily assessment we have force-directed < polarized < bipartite x < y means y is better than x H2 For Homophily assesment we have unbalanced < balanced A Tale of two Communities Meulemans and Schulz, GD15

  33. Hypothesis H1 For Homophily assessment we have force-directed < polarized < bipartite x < y means y is better than x H2 For Homophily assesment we have unbalanced < balanced H3 For shortest path queries we have force-directed > polarized > bipartite A Tale of two Communities Meulemans and Schulz, GD15

  34. User Study Design A Tale of two Communities Meulemans and Schulz, GD15

  35. User Study Design mixed design (too much trials otherwise) A Tale of two Communities Meulemans and Schulz, GD15

  36. User Study Design mixed design (too much trials otherwise) between subject - 3 graph sizes (20-28 nodes, 20-40 edges) A Tale of two Communities Meulemans and Schulz, GD15

  37. User Study Design mixed design (too much trials otherwise) between subject - 3 graph sizes (20-28 nodes, 20-40 edges) within subjects - 3 layouts - balanced (50:50) and unbalanced (25:75) - 5 degree of homophily levels (only 3 for unbalanced) - 2 tasks (homophily / length of shortest path) A Tale of two Communities Meulemans and Schulz, GD15

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend