please feel free to include these slides in your own
play

Please feel free to include these slides in your own material, or - PowerPoint PPT Presentation

S OCIAL M EDIA M INING Influence and Homophily Dear instructors/users of these slides: Please feel free to include these slides in your own material, or modify them as you see fit. If you decide to incorporate these slides into your


  1. S OCIAL M EDIA M INING Influence and Homophily

  2. Dear instructors/users of these slides: Please feel free to include these slides in your own material, or modify them as you see fit. If you decide to incorporate these slides into your presentations, please include the following note: R. Zafarani, M. A. Abbasi, and H. Liu, Social Media Mining: An Introduction , Cambridge University Press, 2014. Free book and slides at http://socialmediamining.info/ or include a link to the website: http://socialmediamining.info/ Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 2 2

  3. Social Forces • Social Forces connect individuals in different ways • When individuals get connected, we observe distinguishable patterns in their connectivity networks. – Assortativity , also known as social similarity • In networks with assortativity: – Similar nodes are connected to one another more often than dissimilar nodes. • Social networks are assortative – A high similarity between friends is observed – We observe similar behavior, interests, activities, or shared attributes such as language among friends Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 3 3

  4. Why are connected people similar? Influence • The process by which a user (i.e., influential) affects another user • The influenced user becomes more similar to the influential figure. • Example: If most of our friends/family members switch to a cellphone company, we might switch [i.e., become influenced] too. Homophily • Similar individuals becoming friends due to their high similarity Example: Two musicians are more likely to • become friends. Confounding • The environment’s effect on making individuals similar Example: Two individuals living in the same city are more likely to become • friends than two random individuals Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 4 4

  5. Influence, Homophily, and Confounding Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 5 5

  6. Source of Assortativity in Networks Both influence and Homophily generate similarity in social networks Influence Makes connected nodes similar to each other Homophily Selects similar nodes and links them together Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 6 6

  7. Assortativity Example The city's draft tobacco control strategy says more than 60% of under-16s in Plymouth smoke regularly Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 7 7

  8. Why? • Smoker friends influence their Influence non-smoker friends • Smokers become friends Homophily – Can this explain smoking behavior? • There are lots of places that Confounding people can smoke Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 8 8

  9. Our goal? 1. How can we measure assortativity ? 2. How can we measure influence or homophily ? 3. How can we model influence or homophily ? 4. How can we distinguish between the two ? Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 9 9

  10. Measuring Assortativity Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 10 10

  11. Assortativity: An Example • The friendship network in a US high school in 1994 • Colors represent races, : whites – Grey : blacks – Light Grey : hispanics – Black : others • High assortativity between individuals of the same race Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 11 11

  12. Measuring Assortativity for Nominal Attributes • Assume nominal attributes are assigned to nodes – Example: race • Edges between nodes of the same type can be used to measure assortativity of the network – Same type = nodes that share an attribute value – Node attributes could be nationality, race, sex, etc. 𝑢(𝑤 𝑗 ) denotes type of vertex 𝑤 𝑗 Kronecker delta function Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 12 12

  13. Assortativity Significance • Assortativity significance – The difference between measured assortativity and expected assortativity – The higher this difference, the more significant the assortativity observed Example – In a school, 50% of the population is white and the other 50% is hispanic . – We expect 50% of the connections to be between members of different races. – If all connections are between members of different races, then we have a significant finding Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 13 13

  14. Assortativity Significance Assortativity Expected assortativity (according to configuration model) This is modularity Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 14 14

  15. Normalized Modularity [Finding the Maximum] The maximum happens when all vertices of the same type are connected to one another Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 15 15

  16. Modularity: Matrix Form • Let ∆∈ ℝ 𝑜×𝑙 denote the indicator matrix and let 𝑙 denote the number of types • The Kronecker delta function can be reformulated using the indicator matrix • Therefore, Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 16 16

  17. Normalized Modularity: Matrix Form Let Modularity matrix be 𝒆 ∈ ℝ 𝒐 ×𝟐 is the degree vector Modularity can be reformulated as Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 17 17

  18. Modularity Example The number of edges between nodes of the same color is less than the expected number of edges between them Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 18 18

  19. Measuring Assortativity for Ordinal Attributes • A common measure for analyzing the relationship between ordinal values is covariance • It describes how two variables change together • In our case, we have a network – We are interested in how values assigned to nodes that are connected (via edges) are correlated Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 19 19

  20. Covariance Variables • The value assigned to node 𝑤 𝑗 is 𝑦 𝑗 • We construct two variables 𝑌 𝑀 and 𝑌 𝑆 • For any edge (𝑤 𝑗 , 𝑤 𝑘 ) , we assume that 𝑦 𝑗 is observed from variable 𝑌 𝑀 and 𝑦 𝑘 is observed from variable 𝑌 𝑆 • 𝑌 𝑀 represents the ordinal values associated with the left-node (the first node) of the edges • 𝑌 𝑆 represents the values associated with the right-node (the second node) of the edges • We need to compute the covariance between variables 𝑌 𝑀 and 𝑌 𝑆 Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 20 20

  21. Covariance Variables: Example List of edges: (A, C) (C, A) (C, B) (B, C) 𝑌 𝑀 : (18, 21, 21, 20) 𝑌 𝑆 : (21, 18, 20, 21) Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 21 21

  22. Covariance For two given column variables 𝑌 𝑀 and 𝑌 𝑆 the covariance is 𝐹(𝑌 𝑀 ) is the mean of the variable and 𝐹(𝑌 𝑀 𝑌 𝑆 ) is the mean of the multiplication 𝑌 𝑀 and 𝑌 𝑆 Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 22 22

  23. Covariance Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 23 23

  24. Normalizing Covariance Pearson correlation 𝜍(𝑌, 𝑍) is the normalized version of covariance In our case: Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 24 24

  25. Correlation Example Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 25 25

  26. Influence • Measuring Influence • Modeling Influence Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 26 26

  27. Influence: Definition Influence The act or power of producing an effect without apparent exertion of force or direct exercise of command Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 27 27

  28. Measuring Influence Social Media Mining Social Media Mining http://socialmediamining.info/ Influence and Homophily Measures and Metrics 28 28

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend