A Distance Measure for the Analysis of Polar Opinion Dynamics in - PowerPoint PPT Presentation

A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu 1 / 26

Contributors 1,2 Petko Bogdanov Ambuj K. Singh Victor Amelkin UC Santa Barbara University at Albany, SUNY UC Santa Barbara pbogdanov@albany.edu ambuj@cs.ucsb.edu victor@cs.ucsb.edu 1 Victor Amelkin, Petko Bogdanov, and Ambuj K Singh. “A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks”. In: Proc. IEEE ICDE . 2017, pp. 159–162. 2 Victor Amelkin, Petko Bogdanov, and Ambuj K. Singh. “A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks (Extended Paper)”. In: arXiv:1510.05058 [cs.SI] (2015). 2 / 26

Table of Contents ◮ Polar Opinion Dynamics in Social Networks ◮ Distance Measure-Based Analysis ◮ Social Network Distance (SND) ◮ Using SND in Applications ◮ Conclusions and Future Work 3 / 26

Introduction • Directed social network, | V | = n users, | E | = m social ties • Network is sparse: m = O ( n ) • User opinions are polar (e.g., the Republicans vs. the Democrats ) • Opinion ∈ { +1 , 0 , − 1 } • Network structure does not change much, but user opinions evolve Figure: Zachary’s Karate Club network 3 3 Wayne Zachary. “An information flow model for conflict and fission in small groups”. In: Journal of Anthropological Research (1977), pp. 452–473. 4 / 26

Polar Opinion Dynamics • Network state G t ∈ { +1 , 0 , − 1 } n : opinions of all users at time t • A time series of network states + + + + + + - - - - + + + + 5 / 26

Polar Opinion Dynamics • Network state G t ∈ { +1 , 0 , − 1 } n : opinions of all users at time t • A time series of network states + + + + + + - - - - + + + + Questions: • How does the network evolve? • What will be the future opinions of individual users? • When does the network “behave” unexpectedly? 5 / 26

Application I: Anomalous Event Detection • d t = d ( G t , G t +1 ) : “the amount of change” in the network’s state • d t measures the unexpectedness of transition G t → G t +1 • What is expected is determined by a given opinion dynamics model → → expected unexpected • Anomaly: an unexpected value in the series d 0 , d 1 , d 2 , . . . , d t • A distance-based approach to anomaly detection 4 4 Stephen Ranshous et al. “Anomaly detection in dynamic networks: a survey”. In: Wiley Interdisciplinary Reviews: Computational Statistics 7.3 (2015), pp. 223–247. 6 / 26

Application II: User Opinion Prediction • d t = d ( G t , G t +1 ) – “the amount of change” in the network’s state • d t measures the unexpectedness of transition G t → G t +1 • What is expected is determined by a given opinion dynamics model • Having observed the network state’s evolution G 0 , G 1 , . . . , G now we would like to predict G future • Distance-based approach to future network state prediction: extrapolate reconstruct d 0 , d 1 , . . . , d now − − − − − − − → d future − − − − − − − → G future 7 / 26

Distance Measure-Based Analysis • Central question: How to measure the distance d ( G 1 , G 2 ) between network states ? • The distance measure d ( • , • ) should ⊲ capture how polar opinions evolve in the network; ⊲ be efficiently computable; ⊲ be a metric. 8 / 26

Existing Vector Space Distance Measures • Coordinate-wise comparison i | x i − y i | p ) 1 /p ⊲ ℓ p d ( x, y ) = ( � ⊲ Hamming d ( x, y ) = � i δ x i ,y i | x i − y i | ⊲ Canberra d ( x, y ) = � i | x i | + | y i | d ( x, y ) = | x ∩ y | ⊲ Jaccard | x ∪ y | d ( x, y ) = cos � � x,y � ⊲ Cosine ( x, y ) = � x � � y � ⊲ Kullback-Leibler d ( x, y ) = ( d KL ( x || y )) = � i ln [ x i /y i ] x i • Using the difference vector ( x − y ) T A ( x − y ) � ⊲ Quadratic Form d ( x, y ) = ( x − y ) T cov − 1 ( x, y )( x − y ) ⊲ Mahalanobis � d ( x, y ) = 9 / 26

Existing Network-Specific Distance Measures • Isomorphism-based distance measures 5 • Graph Edit Distance 6 • Iterative distance measures 7 • Graph Kernels 8 • Feature-based distance measures 9 5 Horst Bunke and Kim Shearer. “A graph distance metric based on the maximal common subgraph”. In: Pattern recognition letters 19.3 (1998), pp. 255–259. 6 Xinbo Gao et al. “A survey of Graph Edit Distance”. In: Pattern Analysis and Applications 13.1 (2010), pp. 113–129. 7 Sergey Melnik, Hector Garcia-Molina, and Erhard Rahm. “Similarity flooding: A versatile graph matching algorithm and its application to schema matching”. In: IEEE Data Engineering . 2002, pp. 117–128. 8 S Vichy N Vishwanathan et al. “Graph kernels”. In: The Journal of Machine Learning Research 11 (2010), pp. 1201–1242. 9 Owen Macindoe and Whitman Richards. “Graph comparison using fine structure analysis”. In: IEEE SocialCom . IEEE. 2010, pp. 193–200. 10 / 26

Existing Network-Specific Distance Measures • Isomorphism-based distance measures ⊲ compare networks structurally ⊲ disregard node states • Graph Edit Distance ⊲ edit distance over node/edge insertion, deletion, substitution operations ⊲ mostly, structure-driven; expensive to compute • Iterative distance measures ⊲ nodes are similar if their neighborhoods are similar ⊲ hard to account for node state differences in a socially meaningful way; expensive to compute • Graph Kernels ⊲ compare substructures—walks, paths, cycles, trees—of non-aligned (small) networks ⊲ opinion dynamics-unaware; expensive to compute • Feature-based distance measures ⊲ compare degree, clust. coeff., betweenness, diameter, frequent substructures, spectra ⊲ only look at summaries; does not capture opinion dynamics 10 / 26

Social Network Distance (SND): Overview 5 5 Amelkin, Bogdanov, and Singh, “A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks (Extended Paper)”. 11 / 26

Social Network Distance (SND): Overview • Exact computation of P : computationally hard • Assume user activations are independent • Assume activations happens via the most likely scenarios 5 0 0 . 3 0 0 . 11 / 26

Social Network Distance (SND): Overview • Exact computation of P : computationally hard • Assume user activations are independent ∼ “opinion flows” in the network do not interfere with each other • Assume activations happens via the most likely scenarios ∼ opinions spread via shortest paths 5 0 0 . 3 0 0 . • ⇒ SND is defined as a transportation problem 11 / 26

Social Network Distance (SND): Overview • Exact computation of P : computationally hard • Assume user activations are independent ∼ “opinion flows” in the network do not interfere with each other • Assume activations happens via the most likely scenarios ∼ opinions spread via shortest paths 5 0 0 . 3 0 0 . • ⇒ SND is defined as a transportation problem that can be exactly solved in O ( n ) /*under some reasonable assumptions*/ 11 / 26

Earth Mover’s Distance ( EMD ) as a Basic Primitive • Earth Mover’s Distance ( EMD ): “edit distance for histograms” • Edit: transportation of a mass unit from i ’th to j ’th bin at cost D ij ... (network state) (histogram) (ground distance) (histogram) (network state) � n n � � D ij � � EMD( P, Q, D ) = f ij f ij , i,j =1 i,j =1 � n � � n � n � � n f ij D ij → min , f ij = min P i , Q i i,j =1 i,j =1 i =1 i =1 � n � n f ij ≥ 0 , f ij ≤ P i , f ij ≤ Q j , (1 ≤ i, j ≤ n ) j =1 i =1 12 / 26

Social Network Distance (SND) – Definition Ground distance computed in: Opinion type + – “transported”: 13 / 26

EMD ⋆ – Redesign of Earth Mover’s Distance for SND • EMD has 2 problems: (i) cannot adequately compare histograms with different total mass (ii) cannot express a single user infecting multiple other users • EMD ⋆ —generalization of EMD —resolves both issues. 1 0 1 1 1/2 1/2 1 0 1 0 1/2 1/2 “bank bins” “bank bins” 14 / 26

A Distance Measure for the Analysis of Polar Opinion Dynamics in - PowerPoint PPT Presentation

A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu 1 / 26 Contributors 1,2 Petko Bogdanov Ambuj K. Singh

Research on polar bears at Norwegian Polar Tracking the Polar Bear Institute Dr. Jon Aars,

Polar Tankers - ConocoPhillips Five Endeavour Class Tankers Built in 2001-2006 30,000

Polar Decomposition of a Matrix Garrett Buffington May 4, 2014 The Polar Decomposition SVD and

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Opinion Mining Opinion Mining Feiyu Xu DFKI, LT-Lab Xu, LT1, 2013 Outline Outline

Opinion Extraction Task Opinion Mining Reviews A popular topic in opinion analysis is

Observations for the Polar Upper Atmosphere Research at Korea Polar Research Institute (KOPRI)

Polar Knowledge Canada Collaborating for the Future Canadas Polar Agency An Integrated

POLAR CAPITAL HOLDINGS PLC Results Presentation June 2007 Contents Overview of Polar

JUST THE MATHS SLIDES NUMBER 6.3 COMPLEX NUMBERS 3 (The polar & exponential forms)

First Results of POLAR: A dedicated Gamma-Ray Burst Polarimeter Merlin Kole on behalf of the

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Opinion Integration Through Opinion Integration Through Semi supervised Topic Modeling

Opinion Mining Opinion Mining Feiyu Xu DFKI, LT-Lab Xu, LT1, 2011 Outline Outline

Opinion Spam and Analysis NITIN JINDAL & BING LIU, WSDM 08 UIUC Opinion/Review Spam All

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

Kipf, T., Welling, M.: Semi-Supervised Classification with Graph Convolutional Networks Radim

An Information Flow Model for Conflict and Fission in Small Groups By: Wayne W. Zachary

Sean P. Cornelius With Emma K. Towlson and Albert-Lszl Barabsi www.BarabasiLab.com

x ? Machine Learning 5/4/20 Tim Althoff, UW CS547: Machine Learning for Big Data,

Statistical Inference for Networks 4th Lehmann Symposium, Rice University, May 2011 Peter Bickel

CMU 15-251 Graphs: Basics Teachers: Anil Ada Ariel Procaccia (this time) Zachary Karate Club

Recommender Systems Instructor: Ekpe Okorafor 1. Accenture Big Data Academy 2. Computer

Social and Technological Networks Rik Sarkar Social Networks Network of friends Node:

A Distance Measure for the Analysis of Polar Opinion Dynamics in - PowerPoint PPT Presentation

A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu 1 / 26 Contributors 1,2 Petko Bogdanov Ambuj K. Singh

Research on polar bears at Norwegian Polar Tracking the Polar Bear Institute Dr. Jon Aars,

Polar Tankers - ConocoPhillips Five Endeavour Class Tankers Built in 2001-2006 30,000

Polar Decomposition of a Matrix Garrett Buffington May 4, 2014 The Polar Decomposition SVD and

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Opinion Mining Opinion Mining Feiyu Xu DFKI, LT-Lab Xu, LT1, 2013 Outline Outline

Opinion Extraction Task Opinion Mining Reviews A popular topic in opinion analysis is

Observations for the Polar Upper Atmosphere Research at Korea Polar Research Institute (KOPRI)

Polar Knowledge Canada Collaborating for the Future Canadas Polar Agency An Integrated

POLAR CAPITAL HOLDINGS PLC Results Presentation June 2007 Contents Overview of Polar

JUST THE MATHS SLIDES NUMBER 6.3 COMPLEX NUMBERS 3 (The polar &amp; exponential forms)

First Results of POLAR: A dedicated Gamma-Ray Burst Polarimeter Merlin Kole on behalf of the

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Opinion Integration Through Opinion Integration Through Semi supervised Topic Modeling

Opinion Mining Opinion Mining Feiyu Xu DFKI, LT-Lab Xu, LT1, 2011 Outline Outline

Opinion Spam and Analysis NITIN JINDAL &amp; BING LIU, WSDM 08 UIUC Opinion/Review Spam All

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

Kipf, T., Welling, M.: Semi-Supervised Classification with Graph Convolutional Networks Radim

An Information Flow Model for Conflict and Fission in Small Groups By: Wayne W. Zachary

Sean P. Cornelius With Emma K. Towlson and Albert-Lszl Barabsi www.BarabasiLab.com

x ? Machine Learning 5/4/20 Tim Althoff, UW CS547: Machine Learning for Big Data,

Statistical Inference for Networks 4th Lehmann Symposium, Rice University, May 2011 Peter Bickel

CMU 15-251 Graphs: Basics Teachers: Anil Ada Ariel Procaccia (this time) Zachary Karate Club

Recommender Systems Instructor: Ekpe Okorafor 1. Accenture Big Data Academy 2. Computer

Social and Technological Networks Rik Sarkar Social Networks Network of friends Node:

JUST THE MATHS SLIDES NUMBER 6.3 COMPLEX NUMBERS 3 (The polar & exponential forms)

Opinion Spam and Analysis NITIN JINDAL & BING LIU, WSDM 08 UIUC Opinion/Review Spam All