The Geography of Online News Engagement Martin Saveski, MIT Media - - PowerPoint PPT Presentation

the geography of online news engagement
SMART_READER_LITE
LIVE PREVIEW

The Geography of Online News Engagement Martin Saveski, MIT Media - - PowerPoint PPT Presentation

The Geography of Online News Engagement Martin Saveski, MIT Media Lab, Cambridge, USA Daniele Quercia, Yahoo Labs, Barcelona, Spain Amin Mantrach, Yahoo Labs, Barcelona, Spain 2 Motivations Despite its importance, the geographic


slide-1
SLIDE 1

The Geography of Online News Engagement

Martin Saveski, MIT Media Lab, Cambridge, USA Daniele Quercia, Yahoo Labs, Barcelona, Spain Amin Mantrach, Yahoo Labs, Barcelona, Spain

slide-2
SLIDE 2

2

slide-3
SLIDE 3

Motivations

3

§ Despite its importance, § the geographic processes of online engagement

  • n news platforms have not been widely studied;

§ à Yahoo News site for more than two years: articles and user comments.

slide-4
SLIDE 4

Contributions

4

In Online news platforms, Users engage with each other depending on where they live

slide-5
SLIDE 5

Contributions: Socio economic factors and user engagement

5

Users in states with: high levels of education and well-being comment articles about research & technology, but not politics, gossip or sport; high levels of crime and unemployment comment on articles about sports, but not those about economy or research and technology;

slide-6
SLIDE 6

Contributions: Personality Traits and user engagement

6

Users from states low in neuroticism (emotionally stable) comment on articles about music ;

  • pen and extravert states comment on articles about sport;

conscientious states on articles about economics.

slide-7
SLIDE 7

Related work

7

§ Influence of time on our actions online:

› “emotion words by Twitter users influenced by times

zones” § News in tweets and geographics spread on Twitter:

› “Reciprocal relations between people who live no more

than 3 times zones away”

› Physical distance constrained the spread of hashtags

slide-8
SLIDE 8

Data description

8

§ Random sample of 200K news articles and 41M associated comments; § Published from August 2010 to February 2013; § On Yahoo! News US; § English articles; § Sources: Reuters, ABC News, AP, etc.; § From anonymous user we have IP address à state(Yahoo places Web service).

slide-9
SLIDE 9

State commenting graph

9

§ Nodes: US states § Egde weight: number of times two users in state i and j comment on the same article

25 30 35 40 45 50 −120 −100 −80

long lat

breaks (−0.003,0.749] (0.749,1.5] (1.5,2.25] (2.25,3]

slide-10
SLIDE 10

The time zone effect

10

§ Users in the same TZ preferentially engage with the same articles, while users in different time zones engage with different articles; § Engagement in k-time zone apart: count the number of times users from k-time zone apart engage in the same articles; § Null model: We shuffle user’s time zone assignment.

0.0 0.1 0.2 0.3

Same TZ 1 TZ Apart 2 TZ Apart 3 TZ Apart

Expected Observed

slide-11
SLIDE 11

The Geography of News Engagement

11

§ The gravity model § F measures the estimated engagement between two states; § g is a scaling constant; § di,j is the euclian distance between two states centers; § mk is number of users of state k. § F correlates with number of the observed number of comments with a pearson correlation of 0.70

slide-12
SLIDE 12

12

slide-13
SLIDE 13

Assigning topic distributions to states

13

§ 13.8% of articles are editorially labeled with IPTC categories § IPTC consists of 1400 topics organized in a taxonomy; § Average category per articles is 5; § By using user engagement (on comments) we aggregated articles topics to form states topics:

› Each time a user from state comment on an article, the tags contribute to the state

topical distribution;

› Tags contribution is normalized by the number of times they are used (to discount

popular tags, similar to idf to discount stop words).

slide-14
SLIDE 14

14

slide-15
SLIDE 15

The Big Five Personnality Traits

15

1. Openess: Imaginative, spontaneous, and adventurous individuals; 2. Conscientiousness: Ambitious, resourceful and persistent individuals; 3. Extraversion: Individuals who are sociable and tend to seek excite- ment;

  • 4. Agreeableness: are trusting, altruistic, tender-minded, and are

motivated to maintain positive relationships with others 5. Neuroticism: Finally, emotionally liable and impulsive individuals. These factors have been also studied at the state level (strongly correlated with socio-economic indicators) [Rentfrow et al.]

slide-16
SLIDE 16

Correlation of news user engagement factors with state personality scores

16

Economy Conflict.and.Unrest Religion Music Sports Agree. Cons. Em.St. Extrav Openness

Correlation

0.3 0.0 −0.3 −0.6

slide-17
SLIDE 17

17

slide-18
SLIDE 18

Socioeconomic Indicators

18

§ 1. Well-being index; § 2. Crime level § 3. Rate of unemployment; § 4. Gross State Product; § 5. Education level.

slide-19
SLIDE 19

Correlation of news user engagement factors with socio economic indicators

19

Economy Elections Gossip Research.and.Tech. Sports Crime Education GSP Unemployment Well being

Correlation

0.50 0.25 0.00 −0.25 −0.50

slide-20
SLIDE 20

Conclusions: Putting all together

20

0.70 0.75 0.79 0.82

0.62 0.64 0.66 0.68 0.7 0.72 0.74 0.76 0.78 0.8 0.82 0.84

Gravitational Model + Time Zone Difference + Socio-economical Factors + Personality Traits

slide-21
SLIDE 21

Questions

21