Predic'ng Responses to Microblog Posts Yoav Artzi 1 , - PowerPoint PPT Presentation

Predic'ng ¡Responses ¡to ¡ Microblog ¡Posts ¡ Yoav ¡Artzi 1 , ¡Patrick ¡Pantel 2 ¡and ¡Michael ¡Gamon 2 ¡ ¡ 1 University ¡of ¡Washington, ¡MicrosoC ¡Research ¡ ¡ Work ¡conducted ¡at ¡MicrosoC ¡Research ¡ ¡

Twee'ng ¡on ¡TwiGer ¡ A ¡tweet ¡is ¡140 ¡characters ¡long ¡ social ¡network ¡ TwiGer ¡is ¡a ¡ news ¡agency ¡ replying ¡ Users ¡respond ¡by ¡ retwee'ng ¡

The ¡Problem ¡ • Given ¡a ¡tweet ¡

The ¡Problem ¡ • Given ¡a ¡tweet ¡ • Predict ¡response ¡ – Reply ¡ – Retweet ¡ Retweet ¡ Reply ¡

No ¡Response ¡ Retweeted ¡

Mo'va'on ¡ • Good ¡indica'on ¡of ¡impact ¡ • Increases ¡impact ¡ • So ¡who ¡might ¡care ¡about ¡this? ¡ – Adver'sers ¡ – Celebri'es ¡ – Media ¡organiza'ons ¡ • Also, ¡a ¡way ¡to ¡rank ¡tweets ¡

Goal ¡ • What ¡triggers ¡a ¡response? ¡ • What ¡features ¡are ¡good ¡for ¡predic'on? ¡ • Empirical ¡explora'on ¡

Our ¡Approach: ¡Learning ¡ Social ¡ Network ¡ Tweets ¡ Extract ¡ + ¡ Model ¡ Learner ¡ Features ¡ Response ¡ Boosted ¡Decision ¡Trees ¡ Maximum ¡Entropy* ¡ *MaxEnt ¡by ¡Chris ¡Quirk, ¡Boosted ¡Decision ¡Trees ¡by ¡Qiang ¡Wu ¡ ¡

Our ¡Approach: ¡Tes'ng ¡ Social ¡ Network ¡ Tweet ¡ Model ¡ Predic7on ¡

Experimental ¡Setup ¡ • One ¡week ¡of ¡TwiGer ¡data ¡ • Searched ¡for ¡response ¡over ¡two ¡weeks ¡ • Randomly ¡sampled ¡training ¡and ¡tes'ng ¡sets: ¡ – 750K ¡tweets ¡for ¡training ¡ – 188K ¡tweets ¡for ¡tes'ng ¡

Results ¡ 1 ¡ Boosted ¡ 0.8 ¡ Decision ¡ Precision ¡ 0.6 ¡ Trees ¡ MaxEnt ¡ 0.4 ¡ 0.2 ¡ 0 ¡ 0 ¡ 0.2 ¡ 0.4 ¡ 0.6 ¡ 0.8 ¡ 1 ¡ Recall ¡

Results ¡ 1 ¡ Boosted ¡ 0.8 ¡ Decision ¡ Precision ¡ 0.6 ¡ Trees ¡ MaxEnt ¡ 0.4 ¡ 0.2 ¡ 0 ¡ 0 ¡ 0.2 ¡ 0.4 ¡ 0.6 ¡ 0.8 ¡ 1 ¡ Recall ¡ Hard ¡to ¡predict ¡response, ¡for ¡most ¡tweet, ¡but ¡… ¡

Results ¡ 1 ¡ Boosted ¡ 0.8 ¡ Decision ¡ Precision ¡ 0.6 ¡ Trees ¡ MaxEnt ¡ 0.4 ¡ 0.2 ¡ 0 ¡ 0 ¡ 0.2 ¡ 0.4 ¡ 0.6 ¡ 0.8 ¡ 1 ¡ Recall ¡ Hard ¡to ¡predict ¡response, ¡for ¡most ¡tweet, ¡but ¡there ¡ exists ¡a ¡large ¡set ¡for ¡which ¡we ¡can ¡predict ¡accurately ¡

Results ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ 0.6 ¡ Precision ¡ 0.5 ¡ 0.4 ¡ 0.3 ¡ 0.2 ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Building ¡the ¡Model ¡ • What ¡can ¡we ¡get ¡form ¡the ¡language ¡of ¡the ¡ tweet? ¡ • Can ¡we ¡use ¡the ¡social ¡network ¡for ¡predic'on? ¡

Features: ¡Sen'ment ¡ • How ¡the ¡sen'ment ¡of ¡a ¡tweet ¡influences ¡the ¡ response ¡behavior? ¡ • Count ¡of ¡nega've/posi've ¡sen'ment ¡words* ¡ +1 +1 +1 +1 *Sen'ment ¡lexicon ¡provided ¡by ¡Livia ¡Polanyi ¡

Building ¡the ¡Model ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ 0.6 ¡ Precision ¡ 0.5 ¡ sen'ment ¡ 0.4 ¡ 0.3 ¡ 0.2 ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Features: ¡Pos'ng ¡ • Tweeter ¡pos'ng ¡trends ¡are ¡influenced ¡by ¡'me ¡ and ¡day ¡of ¡the ¡week ¡ • Does ¡it ¡influence ¡response ¡behavior? ¡ • Included ¡features: ¡ – Local ¡'me ¡of ¡pos'ng ¡ – Day ¡of ¡the ¡week ¡

Building ¡the ¡Model ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ 0.6 ¡ Precision ¡ 0.5 ¡ +pos'ng ¡ 0.4 ¡ sen'ment ¡ 0.3 ¡ 0.2 ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Features: ¡Content ¡ • 45 ¡simple ¡features ¡over ¡the ¡content ¡of ¡the ¡tweet ¡ • Manually ¡developed ¡by ¡observing ¡large ¡number ¡ of ¡tweets ¡ ¡ # ¡stop ¡words ¡ # ¡user ¡references ¡ # ¡hash ¡tags ¡ % ¡non ¡English* ¡ # ¡tokens ¡ … ¡ ¡ *English ¡lexicon ¡provided ¡by ¡Lucy ¡Vanderwend ¡

Building ¡the ¡Model ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ 0.6 ¡ Precision ¡ +content ¡ 0.5 ¡ +pos'ng ¡ 0.4 ¡ sen'ment ¡ 0.3 ¡ 0.2 ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Features: ¡Lexical ¡Ra'o ¡Buckets ¡ • Detect ¡lexical ¡items ¡indica'ng ¡towards ¡certain ¡ response ¡behavior ¡ – 14M ¡bigrams ¡ – 400K ¡hashtags ¡ – Collected ¡from ¡186M ¡tweets ¡ • Use ¡as ¡flags ¡on ¡each ¡tweet ¡that ¡has ¡them ¡ • Issues: ¡ – Scalability ¡of ¡learning ¡ – Sparsity ¡

Features: ¡Lexical ¡Ra'o ¡Buckets ¡ Collapsing ¡ • For ¡every ¡lexical ¡item ¡ l : ¡ tweets ¡containing ¡ l ¡that ¡ received ¡ no ¡response ¡ tweets ¡containing ¡ l ¡that ¡ received ¡a ¡response ¡ • Define ¡each ¡such ¡ n ¡as ¡a ¡feature ¡ • Trigger ¡feature ¡ n ¡for ¡each ¡sample ¡that ¡ contains ¡ l

Features: ¡Lexical ¡Ra'o ¡Buckets ¡ Collapsing ¡ • For ¡every ¡lexical ¡item ¡ l : ¡ tweets ¡containing ¡ l ¡that ¡ received ¡ no ¡response ¡ n tweets ¡containing ¡ l ¡that ¡ received ¡a ¡response ¡ • Define ¡each ¡such ¡ n ¡as ¡a ¡feature ¡ • Trigger ¡feature ¡ n ¡for ¡each ¡sample ¡that ¡ contains ¡ l

Building ¡the ¡Model ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ 0.6 ¡ Precision ¡ +lexical ¡ 0.5 ¡ +content ¡ 0.4 ¡ +pos'ng ¡ 0.3 ¡ sen'ment ¡ 0.2 ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Features: ¡Social ¡ • What ¡are ¡the ¡characteris'cs ¡of ¡the ¡user’s ¡ network? ¡ • Simple ¡social ¡sta's'cs ¡ – Number ¡of ¡followers ¡ – Number ¡of ¡followings ¡

Building ¡the ¡Model ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ 0.6 ¡ +socialNet ¡ Precision ¡ +lexical ¡ 0.5 ¡ +content ¡ 0.4 ¡ +pos'ng ¡ 0.3 ¡ sen'ment ¡ 0.2 ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Features: ¡User ¡History ¡ • Aggregate ¡historical ¡ response ¡to ¡user ¡ • 3 ¡months ¡of ¡TwiGer ¡data ¡ – Over ¡2 ¡billion ¡tweets ¡ • Compute ¡sta's'cs ¡ – For ¡example: ¡ra'o ¡of ¡tweets ¡ retweeted ¡

Building ¡the ¡Model ¡ 1 ¡ 0.9 ¡ 0.8 ¡ 0.7 ¡ +history ¡ 0.6 ¡ Precision ¡ +socialNet ¡ 0.5 ¡ +lexical ¡ 0.4 ¡ +content ¡ 0.3 ¡ +pos'ng ¡ 0.2 ¡ sen'ment ¡ 0.1 ¡ 0 ¡ 0 ¡ 0.02 ¡ 0.04 ¡ 0.06 ¡ 0.08 ¡ 0.1 ¡ 0.12 ¡ 0.14 ¡ 0.16 ¡ 0.18 ¡ 0.2 ¡ Recall ¡

Predic'ng Responses to Microblog Posts Yoav Artzi 1 , - PowerPoint PPT Presentation

Predic'ng Responses to Microblog Posts Yoav Artzi 1 , Patrick Pantel 2 and Michael Gamon 2 1 University of Washington, MicrosoC Research Work conducted at

Branch Predic,on J. Nelson Amaral Why Branch Predic,on?

Nested Resources July 2012 by Anton Nested resources resources :pages do resources :posts

Mesa Master Planning Convocation Spring 2019 Padlet Responses Responses Total Responses, n

A Joint Model for Chinese Microblog Sentiment Analysis Yuhui Cao, Zhao Chen, Ruifeng Xu, Tao Chen

Placing images on the world map: a microblog- based enrichment approach Claudia Hau ff &

Using WordNet for Query Expansion: ADAPT @ FIRE 2016 Microblog Track Wei Li , Debasis Ganguly,

The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions Tao Zhu

Paraphrasing 4 Microblog Normalization Wang Ling Carnegie Mellon University Chris Dyer

for Microblog Search A Preliminary Study Maram Hasanain, Rana Malhas, Tamer Elsayed 11 July 2014

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification Yan Chen 1 , 2 Zhoujun

wPod Weibo Public Opinion (Polarity) Detection Haotian He & Sanae Sato Microblog is

Text REtrieval Conference (TREC) TREC TRACKS Crowdsourcing Personal Blog, Microblog documents

Responses Immediately after the Accident Outline of the Accident Responses by Tokyo Electric

microblogging posts Jasmina Smailovi Joef Stefan Institute Department of Knowledge Technologies

Agenda Historical Voyage of MSEB to MSEDCL Work force of MSEDCL Organizational

2020 Share Allocation - Amersham Deanery 2019 Costs of Ministry Number of posts Cost per post

Authenticated Encryption Atul Luykx COSIC, ESAT, KU Leuven, Belgium July 15, 2016 1 2 2 2 2

IO on Lustre and GPFS David Henty and Adrian Jackson (EPCC, The University of Edinburgh) Charles

Log-Structured Merge Trees CSCI 333 How Should I Organize My Stuff (Data)? How Should I

Utilizing Micr Utilizing Microblogs f oblogs for A r Automatic matic Ne News Highlights

Microblogs as Parallel Corpora Wang Ling, Guang Xiang, Chris Dyer, Isabel Trancoso, Alan W Black

Real-time #SemanticWeb in <= 140 chars Linked Data on the Web (LDOW2010) April 27 th , 2010

DETECTING RUMORS FROM MICROBLOGS WITH RECURRENT NEURAL NETWORKS 515030910611 INTRODUCTION

Diffusion of Following Links in Microblogging Networks Jing Zhang Tsinghua University

Sambuz

Useful Links

Newsletter

Mail Us

Predic'ng Responses to Microblog Posts Yoav Artzi 1 , - PowerPoint PPT Presentation

Predic'ng Responses to Microblog Posts Yoav Artzi 1 , Patrick Pantel 2 and Michael Gamon 2 1 University of Washington, MicrosoC Research Work conducted at

Branch Predic,on J. Nelson Amaral Why Branch Predic,on?

Nested Resources July 2012 by Anton Nested resources resources :pages do resources :posts

Mesa Master Planning Convocation Spring 2019 Padlet Responses Responses Total Responses, n

A Joint Model for Chinese Microblog Sentiment Analysis Yuhui Cao, Zhao Chen, Ruifeng Xu, Tao Chen

Placing images on the world map: a microblog- based enrichment approach Claudia Hau ff &amp;

Using WordNet for Query Expansion: ADAPT @ FIRE 2016 Microblog Track Wei Li , Debasis Ganguly,

The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions Tao Zhu

Paraphrasing 4 Microblog Normalization Wang Ling Carnegie Mellon University Chris Dyer

for Microblog Search A Preliminary Study Maram Hasanain, Rana Malhas, Tamer Elsayed 11 July 2014

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification Yan Chen 1 , 2 Zhoujun

wPod Weibo Public Opinion (Polarity) Detection Haotian He &amp; Sanae Sato Microblog is

Text REtrieval Conference (TREC) TREC TRACKS Crowdsourcing Personal Blog, Microblog documents

Responses Immediately after the Accident Outline of the Accident Responses by Tokyo Electric

microblogging posts Jasmina Smailovi Joef Stefan Institute Department of Knowledge Technologies

Agenda Historical Voyage of MSEB to MSEDCL Work force of MSEDCL Organizational

2020 Share Allocation - Amersham Deanery 2019 Costs of Ministry Number of posts Cost per post

Authenticated Encryption Atul Luykx COSIC, ESAT, KU Leuven, Belgium July 15, 2016 1 2 2 2 2

IO on Lustre and GPFS David Henty and Adrian Jackson (EPCC, The University of Edinburgh) Charles

Log-Structured Merge Trees CSCI 333 How Should I Organize My Stuff (Data)? How Should I

Utilizing Micr Utilizing Microblogs f oblogs for A r Automatic matic Ne News Highlights

Microblogs as Parallel Corpora Wang Ling, Guang Xiang, Chris Dyer, Isabel Trancoso, Alan W Black

Real-time #SemanticWeb in &lt;= 140 chars Linked Data on the Web (LDOW2010) April 27 th , 2010

DETECTING RUMORS FROM MICROBLOGS WITH RECURRENT NEURAL NETWORKS 515030910611 INTRODUCTION

Diffusion of Following Links in Microblogging Networks Jing Zhang Tsinghua University

Sambuz

Useful Links

Newsletter

Mail Us

Placing images on the world map: a microblog- based enrichment approach Claudia Hau ff &

wPod Weibo Public Opinion (Polarity) Detection Haotian He & Sanae Sato Microblog is

Real-time #SemanticWeb in <= 140 chars Linked Data on the Web (LDOW2010) April 27 th , 2010