uncovering news twitter reciprocity via interaction
play

Uncovering News-Twitter Reciprocity via Interaction Patterns Yue - PowerPoint PPT Presentation

Uncovering News-Twitter Reciprocity via Interaction Patterns Yue Ning 1 Sathappan Muthiah 1 Ravi Tandon 2 Naren Ramakrishnan 1 1 Discovery Analytics Center, Department of Computer Science, Virginia Tech 2 Now with Department of Electrical and


  1. Uncovering News-Twitter Reciprocity via Interaction Patterns Yue Ning 1 Sathappan Muthiah 1 Ravi Tandon 2 Naren Ramakrishnan 1 1 Discovery Analytics Center, Department of Computer Science, Virginia Tech 2 Now with Department of Electrical and Computer Engineering, The University of Arizona

  2. Outline Introduction Problem Definition Methodology Story Chaining Retrieval of Tweets Identify Interaction Patterns Clustering Topic Modeling Experiments and Results Dataset Results Conclusion

  3. Problem Introduction News Media Social Media

  4. Problem Introduction News Media Social Media

  5. Problem Introduction News Media Social Media

  6. Motivation ◮ News -> Twitter ◮ Twitter -> News Media ◮ Explosion of information to comment/feed upon ◮ Cause for variations in such interdependencies ◮ Temporal popularity of a "topic" ◮ Geo-location (Africa vs Asia)

  7. Motivation ◮ News -> Twitter ◮ Twitter -> News Media ◮ Explosion of information to comment/feed upon ◮ Cause for variations in such interdependencies ◮ Temporal popularity of a "topic" ◮ Geo-location (Africa vs Asia)

  8. Motivation ◮ News -> Twitter ◮ Twitter -> News Media ◮ Explosion of information to comment/feed upon ◮ Cause for variations in such interdependencies ◮ Temporal popularity of a "topic" ◮ Geo-location (Africa vs Asia)

  9. Motivation ◮ News -> Twitter ◮ Twitter -> News Media ◮ Explosion of information to comment/feed upon ◮ Cause for variations in such interdependencies ◮ Temporal popularity of a "topic" ◮ Geo-location (Africa vs Asia)

  10. Motivation ◮ News -> Twitter ◮ Twitter -> News Media ◮ Explosion of information to comment/feed upon ◮ Cause for variations in such interdependencies ◮ Temporal popularity of a "topic" ◮ Geo-location (Africa vs Asia)

  11. Motivation ◮ News -> Twitter ◮ Twitter -> News Media ◮ Explosion of information to comment/feed upon ◮ Cause for variations in such interdependencies ◮ Temporal popularity of a "topic" ◮ Geo-location (Africa vs Asia)

  12. One Example D 1 1/27/2013: Fire at Kiss night- club: security guards tried to stop nightclub output

  13. One Example N D 1 1/27/2013: Fire at Kiss night- club: security guards tried to stop nightclub output

  14. One Example N D 1 D 2 1/27/2013: Fire 1/27/2013: at Kiss night- Rises to 233 club: security the number guards tried to killed in the stop nightclub fire nightclub output in Santa Maria (RS)

  15. One Example N T D 1 D 2 1/27/2013: Fire 1/27/2013: at Kiss night- Rises to 233 club: security the number guards tried to killed in the stop nightclub fire nightclub output in Santa Maria (RS)

  16. One Example N T D 1 D 2 D 3 1/27/2013: Fire 1/27/2013: 1/27/2013: at Kiss night- Bodies of fire Rises to 233 club: security victims in Santa the number guards tried to Maria began to killed in the stop nightclub be veiled fire nightclub output in Santa Maria (RS)

  17. One Example N T N D 1 D 2 D 3 1/27/2013: Fire 1/27/2013: 1/27/2013: at Kiss night- Bodies of fire Rises to 233 club: security victims in Santa the number guards tried to Maria began to killed in the stop nightclub be veiled fire nightclub output in Santa Maria (RS)

  18. One Example N T N D 1 D 2 D 3 D 4 1/27/2013: Fire 1/27/2013: 1/27/2013: 1/28/2013: Rel- at Kiss night- Bodies of fire Rises to 233 atives fill for in- club: security victims in Santa the number formation about guards tried to Maria began to killed in the fire victims stop nightclub be veiled fire nightclub output in Santa Maria (RS)

  19. One Example N T N T D 1 D 2 D 3 D 4 1/27/2013: Fire 1/27/2013: 1/27/2013: 1/28/2013: Rel- at Kiss night- Bodies of fire Rises to 233 atives fill for in- club: security victims in Santa the number formation about guards tried to Maria began to killed in the fire victims stop nightclub be veiled fire nightclub output in Santa Maria (RS)

  20. One Example N T N T D 1 D 2 D 3 D 4 D 5 1/27/2013: Fire 1/27/2013: 1/27/2013: 1/28/2013: 1/28/2013: Rel- at Kiss night- Bodies of fire Rises to 233 Gaucho police atives fill for in- club: security victims in Santa the number formation about arrest owner of guards tried to Maria began to killed in the Kiss nightclub fire victims stop nightclub be veiled and two band fire nightclub output in Santa Maria members (RS)

  21. One Example N T N T B D 1 D 2 D 3 D 4 D 5 1/27/2013: Fire 1/27/2013: 1/27/2013: 1/28/2013: 1/28/2013: Rel- at Kiss night- Bodies of fire Rises to 233 Gaucho police atives fill for in- club: security victims in Santa the number formation about arrest owner of guards tried to Maria began to killed in the Kiss nightclub fire victims stop nightclub be veiled and two band fire nightclub output in Santa Maria members (RS)

  22. Goals 1. Understanding the type of information flow between news and Twitter. 2. Chaining similar news articles together. 3. Identifying major interaction patterns ◮ Cluster story chains and understanding their differences ◮ Identify main topics of interest within such clusters.

  23. System Framework Ne ws re ports (docume nts)

  24. System Framework Ne ws re ports Story Chaining (docume nts)

  25. System Framework Ne ws re ports Story Chaining Twe e t Re trie val (docume nts)

  26. System Framework Ne ws re ports Inte raction Patte rn Story Chaining Twe e t Re trie val Mining (docume nts)

  27. System Framework ... C1 C0 CK Inte raction Patte rn Base d Cluste ring Ne ws re ports Inte raction Patte rn Story Chaining Twe e t Re trie val Mining (docume nts)

  28. System Framework Topic Mode ling Cluste r Topic Distributions ... Story-chain Cluste rs C1 C0 CK Inte raction Patte rn Base d Cluste ring Ne ws re ports Inte raction Patte rn Story Chaining Twe e t Re trie val Mining (docume nts)

  29. Story Chaining Algorithm 1 Goal: identifying all documents related to a news story and to keep track of the news story as new documents arrive. Method: To assess if two documents are referring to the same underlying context, we calculate their similarity scores with respect to three features: ◮ - textual features, denoted by T ( D i ) ◮ - spatial features, denoted by L ( D i ) , e.g. city, state, country ◮ - actors, denoted by A ( D i ) , e.g. Hillary Clinton. 1 J. Schlachter, A. Ruvinskya, L. Asencios Reynoso, S. Muthiah, and N. Ramakrishnan, “Leveraging topic models to develop metrics for evaluating the quality of narrative threads extracted from news stories”, in Proc. of the 6th International Conference on Applied Human Factors and Ergonomics , AHFE, Elsevier, 2015.

  30. Story Chaining Algorithm (Cont.) The total weighted similarity measure between two documents, D i and D j , is then defined as follows: sim ( D i , D j ) � α f ( T ( D i ) , T ( D j )) + β f ( L ( D i ) , L ( D j )) � �� � � �� � textul features spatial features + η f ( A ( D i ) , A ( D j )) � �� � actor features The coherence between a chain C j and document D i is defined as coh ( D i , C j ) = θ g ( L ( D i ) , L ( C j )) + φ g ( A ( D i ) , A ( C j )) where g is any similarity measure and the coefficients θ, φ are chosen such that θ + φ = 1.

  31. Twitter Profile for News 1. Collect tweets based on URL. 2. Extract entity keywords from news. 3. Filter keywords together. 4. Download hourly count metrics.

  32. Interaction Patterns ◮ Peak detection 2 ◮ Incoming influence ( W pre ) and outgoing influence ( W post ): v s v s � � W pre = W post = (1) , t A − t s t s − t A s ∈ S pre s ∈ S post t A -t s1 t A -t s2 v s1 Post Peaks t A -t s3 Pre Peaks t s2 t s3 t s1 t A 2 M. Duarte, “Notes on scientific computing for biomechanics and motor control”, 2015.

  33. Interaction States N T E B

  34. Interaction States (Cont.)  N , if W pre < ρ, W post ≥ ( 1 + λ ) W pre     E , if W pre < ρ, W post < ( 1 + λ ) W pre  State ( D i ) = T , if W pre ≥ ρ, W post < ( 1 + λ ) W pre    B , if W pre ≥ ρ, W post ≥ ( 1 + λ ) W pre  

  35. Interaction States (Cont.) W post B Twitter ↔ News W pre W pre N W post News → Twitter ) λ + 1 ( = W post t A : article publish time T Twitter → News E W pre ρ Figure: Geometric Interpretation of States

  36. Clustering on Encoded Chains Clustering via qualitative encoding (e.g. “NNTNTBBE”) ◮ Levenshtein distance ◮ Jaro-Winkler distance ◮ Ratcliff-Obershelp pattern recognition Clustering via quantitative encoding ( e.g. “0.5, -0.9, 0.88,0.3,-0.4”) ◮ Multi-dimensional Dynamic Time Warping(DTW)

  37. Interpretation from a Different Dimension ◮ Are sports events always related with Bi-directional Interactions? ◮ Do Twitter users focus more on sports and entertainment? ◮ Latent Dirichlet Allocation (LDA) for hidden topic analysis on clusters. The topic distributions for one cluster is defined by: � d ij ∈ c j n d ij θ ( d ij , k ) C j , k = (2) , � d ij n d ij where ◮ n d ij refers to the frequency of d i in cluster C j . ◮ θ ( d ij , k ) refers to the topic proportions for this document. ◮ k is the topic index.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend