a distance measure for the analysis of polar opinion
play

A Distance Measure for the Analysis of Polar Opinion Dynamics in - PowerPoint PPT Presentation

A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu 1 / 26 Contributors 1,2 Petko Bogdanov Ambuj K. Singh


  1. A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu 1 / 26

  2. Contributors 1,2 Petko Bogdanov Ambuj K. Singh Victor Amelkin UC Santa Barbara University at Albany, SUNY UC Santa Barbara pbogdanov@albany.edu ambuj@cs.ucsb.edu victor@cs.ucsb.edu 1 Victor Amelkin, Petko Bogdanov, and Ambuj K Singh. “A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks”. In: Proc. IEEE ICDE . 2017, pp. 159–162. 2 Victor Amelkin, Petko Bogdanov, and Ambuj K. Singh. “A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks (Extended Paper)”. In: arXiv:1510.05058 [cs.SI] (2015). 2 / 26

  3. Table of Contents ◮ Polar Opinion Dynamics in Social Networks ◮ Distance Measure-Based Analysis ◮ Social Network Distance (SND) ◮ Using SND in Applications ◮ Conclusions and Future Work 3 / 26

  4. Introduction • Directed social network, | V | = n users, | E | = m social ties • Network is sparse: m = O ( n ) • User opinions are polar (e.g., the Republicans vs. the Democrats ) • Opinion ∈ { +1 , 0 , − 1 } • Network structure does not change much, but user opinions evolve Figure: Zachary’s Karate Club network 3 3 Wayne Zachary. “An information flow model for conflict and fission in small groups”. In: Journal of Anthropological Research (1977), pp. 452–473. 4 / 26

  5. Polar Opinion Dynamics • Network state G t ∈ { +1 , 0 , − 1 } n : opinions of all users at time t • A time series of network states + + + + + + - - - - + + + + 5 / 26

  6. Polar Opinion Dynamics • Network state G t ∈ { +1 , 0 , − 1 } n : opinions of all users at time t • A time series of network states + + + + + + - - - - + + + + Questions: • How does the network evolve? • What will be the future opinions of individual users? • When does the network “behave” unexpectedly? 5 / 26

  7. Application I: Anomalous Event Detection • d t = d ( G t , G t +1 ) : “the amount of change” in the network’s state • d t measures the unexpectedness of transition G t → G t +1 • What is expected is determined by a given opinion dynamics model → → expected unexpected • Anomaly: an unexpected value in the series d 0 , d 1 , d 2 , . . . , d t • A distance-based approach to anomaly detection 4 4 Stephen Ranshous et al. “Anomaly detection in dynamic networks: a survey”. In: Wiley Interdisciplinary Reviews: Computational Statistics 7.3 (2015), pp. 223–247. 6 / 26

  8. Application II: User Opinion Prediction • d t = d ( G t , G t +1 ) – “the amount of change” in the network’s state • d t measures the unexpectedness of transition G t → G t +1 • What is expected is determined by a given opinion dynamics model • Having observed the network state’s evolution G 0 , G 1 , . . . , G now we would like to predict G future • Distance-based approach to future network state prediction: extrapolate reconstruct d 0 , d 1 , . . . , d now − − − − − − − → d future − − − − − − − → G future 7 / 26

  9. Distance Measure-Based Analysis • Central question: How to measure the distance d ( G 1 , G 2 ) between network states ? • The distance measure d ( • , • ) should ⊲ capture how polar opinions evolve in the network; ⊲ be efficiently computable; ⊲ be a metric. 8 / 26

  10. Existing Vector Space Distance Measures • Coordinate-wise comparison i | x i − y i | p ) 1 /p ⊲ ℓ p d ( x, y ) = ( � ⊲ Hamming d ( x, y ) = � i δ x i ,y i | x i − y i | ⊲ Canberra d ( x, y ) = � i | x i | + | y i | d ( x, y ) = | x ∩ y | ⊲ Jaccard | x ∪ y | d ( x, y ) = cos � � x,y � ⊲ Cosine ( x, y ) = � x � � y � ⊲ Kullback-Leibler d ( x, y ) = ( d KL ( x || y )) = � i ln [ x i /y i ] x i • Using the difference vector ( x − y ) T A ( x − y ) � ⊲ Quadratic Form d ( x, y ) = ( x − y ) T cov − 1 ( x, y )( x − y ) ⊲ Mahalanobis � d ( x, y ) = 9 / 26

  11. Existing Network-Specific Distance Measures • Isomorphism-based distance measures 5 • Graph Edit Distance 6 • Iterative distance measures 7 • Graph Kernels 8 • Feature-based distance measures 9 5 Horst Bunke and Kim Shearer. “A graph distance metric based on the maximal common subgraph”. In: Pattern recognition letters 19.3 (1998), pp. 255–259. 6 Xinbo Gao et al. “A survey of Graph Edit Distance”. In: Pattern Analysis and Applications 13.1 (2010), pp. 113–129. 7 Sergey Melnik, Hector Garcia-Molina, and Erhard Rahm. “Similarity flooding: A versatile graph matching algorithm and its application to schema matching”. In: IEEE Data Engineering . 2002, pp. 117–128. 8 S Vichy N Vishwanathan et al. “Graph kernels”. In: The Journal of Machine Learning Research 11 (2010), pp. 1201–1242. 9 Owen Macindoe and Whitman Richards. “Graph comparison using fine structure analysis”. In: IEEE SocialCom . IEEE. 2010, pp. 193–200. 10 / 26

  12. Existing Network-Specific Distance Measures • Isomorphism-based distance measures ⊲ compare networks structurally ⊲ disregard node states • Graph Edit Distance ⊲ edit distance over node/edge insertion, deletion, substitution operations ⊲ mostly, structure-driven; expensive to compute • Iterative distance measures ⊲ nodes are similar if their neighborhoods are similar ⊲ hard to account for node state differences in a socially meaningful way; expensive to compute • Graph Kernels ⊲ compare substructures—walks, paths, cycles, trees—of non-aligned (small) networks ⊲ opinion dynamics-unaware; expensive to compute • Feature-based distance measures ⊲ compare degree, clust. coeff., betweenness, diameter, frequent substructures, spectra ⊲ only look at summaries; does not capture opinion dynamics 10 / 26

  13. Social Network Distance (SND): Overview 5 5 Amelkin, Bogdanov, and Singh, “A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks (Extended Paper)”. 11 / 26

  14. Social Network Distance (SND): Overview • Exact computation of P : computationally hard • Assume user activations are independent • Assume activations happens via the most likely scenarios 5 0 0 . 3 0 0 . 11 / 26

  15. Social Network Distance (SND): Overview • Exact computation of P : computationally hard • Assume user activations are independent ∼ “opinion flows” in the network do not interfere with each other • Assume activations happens via the most likely scenarios ∼ opinions spread via shortest paths 5 0 0 . 3 0 0 . • ⇒ SND is defined as a transportation problem 11 / 26

  16. Social Network Distance (SND): Overview • Exact computation of P : computationally hard • Assume user activations are independent ∼ “opinion flows” in the network do not interfere with each other • Assume activations happens via the most likely scenarios ∼ opinions spread via shortest paths 5 0 0 . 3 0 0 . • ⇒ SND is defined as a transportation problem that can be exactly solved in O ( n ) /*under some reasonable assumptions*/ 11 / 26

  17. Earth Mover’s Distance ( EMD ) as a Basic Primitive • Earth Mover’s Distance ( EMD ): “edit distance for histograms” • Edit: transportation of a mass unit from i ’th to j ’th bin at cost D ij ... (network state) (histogram) (ground distance) (histogram) (network state) � n n � � D ij � � EMD( P, Q, D ) = f ij f ij , i,j =1 i,j =1 � n � � n � n � � n f ij D ij → min , f ij = min P i , Q i i,j =1 i,j =1 i =1 i =1 � n � n f ij ≥ 0 , f ij ≤ P i , f ij ≤ Q j , (1 ≤ i, j ≤ n ) j =1 i =1 12 / 26

  18. Earth Mover’s Distance ( EMD ) as a Basic Primitive • Earth Mover’s Distance ( EMD ): “edit distance for histograms” • Edit: transportation of a mass unit from i ’th to j ’th bin at cost D ij ... (network state) (histogram) (ground distance) (histogram) (network state) � n n � � D ij � � EMD( P, Q, D ) = f ij f ij , i,j =1 i,j =1 � n � � n � n � � n f ij D ij → min , f ij = min P i , Q i i,j =1 i,j =1 i =1 i =1 � n � n f ij ≥ 0 , f ij ≤ P i , f ij ≤ Q j , (1 ≤ i, j ≤ n ) j =1 i =1 12 / 26

  19. Social Network Distance (SND) – Definition Ground distance computed in: Opinion type + – “transported”: 13 / 26

  20. Social Network Distance (SND) – Definition Ground distance computed in: Opinion type + – “transported”: 13 / 26

  21. Social Network Distance (SND) – Definition Ground distance computed in: Opinion type + – “transported”: 13 / 26

  22. Social Network Distance (SND) – Definition Ground distance computed in: Opinion type + – “transported”: 13 / 26

  23. EMD ⋆ – Redesign of Earth Mover’s Distance for SND • EMD has 2 problems: (i) cannot adequately compare histograms with different total mass (ii) cannot express a single user infecting multiple other users • EMD ⋆ —generalization of EMD —resolves both issues. 1 0 1 1 1/2 1/2 1 0 1 0 1/2 1/2 “bank bins” “bank bins” 14 / 26

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend