Detecting Events and Patterns in the Social Web with Statistical - PowerPoint PPT Presentation

Detecting Events and Patterns in the Social Web with Statistical Learning Vasileios Lampos Computer Science Department University of Sheffield 1 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 1/34

Outline ⊥ Motivation, Aims ⊥ Data ⊣ Nowcasting Events from the Social Web ⊣ Extracting Mood Patterns from the Social Web | = Conclusions 2 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 2/34

Facts We started to work on this idea in 2008, when... • Web contained 1 trillion unique pages (Google) • Social Networks were rising, e.g. ◦ Facebook : 100m users in 2008, 955m in 2012 (June) ◦ Twitter : 6m users in 2008, 500m active users in 2012 (April) • User behaviour was changing ◦ Socialising via the Web ◦ Giving up privacy (Debatin et al. , 2009) 3 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 3/34

Questions • Does user generated text posted on Social Web platforms include useful information ? • How can we extract this useful information... ... automatically ? Therefore, not we, but a machine . • Practical / real-life applications ? • Can those large samples of human input assist studies in other scientific fields ? Social Sciences , Psychiatry ... 4 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 4/34

One slide on @Twitter. What does a ‘tweet’ look like? Figure 1: Some biased and anonymised examples of tweets (limit of 140 characters /tweet, # denotes a topic ) (a) (user will remain anonymous) (b) they live around us (c) citizen journalism (d) flu attitude 5 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 5/34

Data Collection • Considered to be the easiest part of the process... ... not true ! ◦ Storage space ◦ Crawler implementation, parallel data processing ◦ Equipment, new technologies ( e.g. Map-Reduce) • Data collected and used in the following experiments ◦ tweets geo-located in 54 urban centres in the UK ◦ collected periodically (every 3 or 5 minutes per urban centre) ◦ approx. 0.5 billion tweets by 10 million users (06/2009 to 01/2012) ◦ ground truth (regional flu & local rainfall rates) 6 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 6/34

Nowcasting Events from the Social Web 7 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 7/34

‘Nowcasting’? We do not predict the future, but infer the present − δ i.e. the very recent past State of the World ( u ) W M  ( u ) ( ) ( u ) S Figure 2: Nowcasting the magnitude of an event ( ε ) emerging in the real world from Web information Our case studies: nowcasting (a) flu rates & (b) rainfall rates ( ?! ) 8 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 8/34

What do we get in the end? 16 Rainfall rate (mm) − Bristol 14 Actual Inferred 12 10 8 6 4 2 0 0 5 10 15 20 25 30 Days Figure 3: Inferred rainfall rates for Bristol, UK (October, 2009) 9 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 9/34

Core Methodology (1/3) – Turning text into numbers Candidate features ( n -grams): C = { c i } Set of Twitter posts for a time interval u : P ( u ) = { p j } Frequency of c i in p j : � ϕ if c i ∈ p j , g ( c i , p j ) = 0 otherwise. – g Boolean, maximum value for ϕ is 1 – Score of c i in P ( u ) : |P ( u ) | � g ( c i , p j ) j =1 � c i , P ( u ) � s = |P ( u ) | 10 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 10/34

Core Methodology (2/3) Set of time intervals : U = { u k } ∼ 1 hour, 1 day, ... Time series of candidate features scores : x ( u 1 ) ... x ( u |U| ) � T , X ( U ) = � where c |C| , P ( u i ) �� T x ( u i ) = � � c 1 , P ( u i ) � � s ... s Target variable (event): � T y ( U ) = � y 1 ... y |U| 11 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 11/34

Core Methodology (3/3) – Feature selection Solve the following optimisation problem : �X ( U ) w − y ( U ) � 2 min ℓ 2 w s.t. � w � ℓ 1 ≤ t, t = α · � w OLS � ℓ 1 , α ∈ (0 , 1] . • Least Absolute Shrinkage and Selection Operator ( LASSO ) (Tibshirani, 1996) • Enforce sparsity on w (feature selection) • Least Angle Regression ( LARS ) – computes entire regularisation path (Efron et al. , 2004) 12 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 12/34

How do we form candidate features? • Commonly formed by indexing the entire corpus (Manning, Raghavan and Schütze, 2008) • We extract them from Wikipedia, Google Search results, Public Authority websites ( e.g. NHS) Why? ◦ reduce dimensionality to bound the error of LASSO � W 2 N , W 2 N + p N + W 1 � 1 1 L ( w ) ≤ L ( ˆ w ) + Q , with Q ∼ min √ N p candidate features, N samples, empirical loss L ( ˆ w ) and � ˆ w � ℓ 1 ≤ W 1 (Bartlett, Mendelson and Neeman, 2011) ◦ Harry Potter Effect! 13 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 13/34

The ‘Harry Potter’ effect (1/2) Figure 4: Events co-occurring with the inference target may affect feature selection, especially when the sample size is small. Flu (England & Wales) 300 Hypothetical Event I Hypothetical Event II 250 Event Score 200 150 100 50 0 180 200 220 240 260 280 300 320 340 Day Number (2009) (Lampos, 2012a) 14 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 14/34

The ‘Harry Potter’ effect (2/2) Table 1: Top-20 1-grams correlated with flu rates in England/Wales (06–12/2009) 1-gram Event Corr. Coef. latitud Latitude Festival 0.9367 flu Flu epidemic 0.9344 swine 0.9212 � harri Harry Potter Movie 0.9112 slytherin 0.9094 � potter 0.8972 � benicassim Benicàssim Festival 0.8966 graduat Graduation (?) 0.8965 dumbledor Harry Potter Movie 0.8870 hogwart 0.8852 � quarantin Flu epidemic 0.8822 gryffindor Harry Potter Movie 0.8813 ravenclaw 0.8738 � princ 0.8635 � swineflu Flu epidemic 0.8633 ginni Harry Potter Movie 0.8620 weaslei 0.8581 � hermion 0.8540 � draco 0.8533 � snape 0.8486 � Solution : ground truth with as many peaks/troughs as possible (Lampos, 2012a) 15 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 15/34

About n -grams 1-grams : • decent (dense) representation in the Twitter corpus • unclear semantic interpretation Example: “ I am not sick. But I don’t feel great either! ” 2-grams : • very sparse representation in tweets • possibly clearer semantic interpretation Based on our experimental process... a hybrid combination of 1 -grams and 2 -grams improves inference performance 16 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 16/34

Flu rates – Example of selected features Figure 5: Font size is proportional to the weight of each feature; flipped n-grams are negatively weighted. All words are stemmed (Porter, 1980) . (Lampos and Cristianini, 2012) 17 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 17/34

Rainfall rates – Example of selected features Figure 6: Font size is proportional to the weight of each feature; flipped n-grams are negatively weighted. All words are stemmed (Porter, 1980) . (Lampos and Cristianini, 2012) 18 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 18/34

Examples of inferences Flu Rate − C.England & Wales 120 120 Actual Actual Flu Rate − S.England 100 100 Inferred Inferred 80 80 60 60 40 40 20 20 0 0 0 5 10 15 20 25 30 0 5 10 15 20 25 30 Days Days (a) Central England/Wales (flu) (b) South England (flu) 16 Rainfall rate (mm) − Bristol 14 Actual Inferred 12 10 8 6 4 2 0 0 5 10 15 20 25 30 Days (c) Bristol (rain) Figure 7: Examples of flu and rainfall rates inferences from Twitter content (Lampos and Cristianini, 2012) 19 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 19/34

Flu Detector URL: http://geopatterns.enm.bris.ac.uk/epidemics Figure 8: Flu Detector uses the content of Twitter to nowcast flu rates in several UK regions (Lampos, De Bie and Cristianini, 2010) 20 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 20/34

Extracting Mood Patterns from the Social Web 21 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 21/34

Computing a mood score Table 2: Mood terms from WordNet Affect Fear Sadness Joy Anger afraid depressed admire angry fearful discouraged cheerful despise frighten disheartened enjoy enviously horrible dysphoria enthousiastic harassed panic gloomy exciting irritate ... ... ... ... ( 92 terms ) ( 115 terms ) ( 224 terms ) ( 146 terms ) Mood score computation for a time interval u using n mood terms and a sample of D days : | D | � n � 1 1 sf ( t j,u ) � � M s ( u ) = i | D | n j =1 i =1 = f ( t d,u ) − ¯ f i sf ( t d,u ) i , i ∈ { 1 , ..., n } . i σ f i ( t d,u ) : normalised frequency of a mood term i during time interval u in day d ∈ D f i 22 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 22/34

Detecting Events and Patterns in the Social Web with Statistical - PowerPoint PPT Presentation

Detecting Events and Patterns in the Social Web with Statistical Learning Vasileios Lampos Computer Science Department University of Sheffield 1 / 34 V. Lampos bill@lampos.net Detecting Events and Patterns in the Social Web 1/34 Outline

Detecting Events and Patterns in the Social Web with Statistical Learning Vasileios Lampos

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

Factory Patterns: Factory Method and Abstract Factory Design Patterns In Java Bob Tarr

Detecting abnormal events Detecting abnormal events Jaechul Kim Purpose Purpose Introduce

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Principles and Patterns 26 February, 2020 Recap Principles Patterns Inheritance Anti-patterns

NetFlow Analysis: Detecting covert channels on the network Detecting malicious traffic by using

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

1 Closed Patterns and Max-Patterns Closed Patterns and Max-Patterns A long pattern contains a

Design Patterns 1 What are Design Patterns? Design patterns describe common (and successful)

Design Patterns in Eiffel Dr. Till Bay design patterns? [Design Patterns] are

Design Patterns Applications Programming What is design patterns? The design patterns are

Software, Faster Patterns of Effective Delivery Dan North @tastapod Patterns of Effective

More Design Patterns Horstmann ch.10.1,10.4 Design patterns Structural design patterns

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Global Software Engineering: The Future of Socio- technical Coordination Jim Herbsleb School

Integrating Individual and Social Creativity Creativity Research in the USA and at L3D

SELF-DISCLOSURE AND PERCEIVED TRUSTWORTHINESS OF AIRBNB HOST PROFILES Xiao Ma [1] Jeff Hancock [2]

Research on Social Network Sites: Facebook Use and Social Capital Nicole Ellison Dept. of

Biases in Decision Making Alexander Felfernig alexander.felfernig@ist.tugraz.at Decision Biases

Dialogsysteme mit Gefhlen: Wie, warum und wohin? Dr. Felix Burkhardt Research director,

+ T owards Access Control for Isolated Applications SECRYPT 2016, Lisbon, Portugal Kirill

Integrated Microlearning Concepts, Cases, and Scenarios Theo Hug, Innsbruck (Austria) Prof.