Sentiment Analysis & Computational Argumentation CMU 11-411/611 - PowerPoint PPT Presentation

Lecture 21 Sentiment Analysis & Computational Argumentation CMU 11-411/611 Natural Language Processing April 7, 2020 Yohan Jo 1 / 56

fact or opinion ? positive or negative toward veganism? argument or no argument ? everyone should go vegan because vegan diets are healthier than eating meat 2 / 56

• Many times what we say is not merely an objective fact • Instead, it often expresses our attitudes or opinions toward some topic ▶︎ What attitude/opinion? — Sentiment Analysis ▶︎ Why that attitude/opinion? — Argumentation 3 / 56

Sentiment Analysis Part 1. 4 / 56

Twitter mood predicts the stock market (Bollen et al. 2011) bank 2 2 bail-out DJIA z-score Calm z-score 1 1 0 0 -1 -1 -2 -2 We will see how they computed the mood score soon! 5 / 56

Outline • What is sentiment analysis? • How to do sentiment analysis? ▶︎ Lexicon Approach ▶︎ Machine Learning Approach 6 / 56

What is sentiment analysis? Wikipedia (2020-04-06) • Sentiment analysis (also known as opinion mining or emotion AI) refers to the use of natural language processing, text analysis, computational ▶︎ linguistics, and biometrics to systematically identify, extract, quantify, and study ▶︎ • affective states and • subjective information . 7 / 56

Affective States (Scherer 1984, 2005) • Emotion: Brief and intense episode of organic response to the evaluation of an event ( angry, sad, joyful, fearful, ashamed, proud ) • Mood: Enduring and less intensive predominance of subjective feelings, often without apparent cause ( cheerful, gloomy, irritable, depressed ) • Interpersonal stance: Affective stance occurring in the interaction with another person, coloring the interpersonal exchange of that situation ( polite, distant, cold, warm, supportive ) • Attitudes: Enduring beliefs and predispositions towards specific objects or persons ( like, hate, value, desire ) • Personality traits (affect disposition): Stable personality dispositions and behavior tendencies ( nervous, anxious, reckless, morose, hostile, envious, jealous ) 8 / 56

Subjective Information (Hovy 2014) • Judgment/Evaluation: determination of the value, nature, character, or quality of something or someone ( this movie is garbage / unfortunately the battery lasts less than 5 minutes ) • Personal belief ( I don't think the earth is round ) Not the main focus of many sentiment analysis systems ▶︎ But argumentation systems care! ▶︎ 9 / 56

Structure of Sentiment Information • Holder: Experiencer of the affective state or opinion (usually the speaker if not stated otherwise) • Target: Target of the affective state or opinion • Type: Type of the affective state or opinion (often simplified as positive/negative/neutral) • Claim: Text that contains the affective state or opinion their new ice cream is awful but Chris loves it Most Holder: speaker Holder: Chris sentiment Target: their new ice cream Target: their new ice cream analysis Type: negative Type: positive systems focus on these Claim: their new ice cream is awful Claim: Chris loves it 10 / 56

How to do sentiment analysis Lexicon Approach 11 / 56

12 / 56

Sentiment Lexicons • General Inquirer (Stone et al. 1966, http://www.wjh.harvard.edu/~inquirer) Positive (1,915), negative (2,291), hostile, strong, weak, active, passive, arousal, ... ▶︎ • Linguistic Inquiry and Word Count (LIWC) (Pennebaker et al. 2007, http://www.liwc.net/) Positive (408), negative (498), affective (917), social (456), causal (108), certainty (83), ... ▶︎ • MPQA Subjectivity Lexicon (Wilson et al. 2005, http://www.cs.pitt.edu/mpqa/subj_lexicon.html) Positive (2,718), negative (4,912), neutral ▶︎ Strongly subjective and weakly subjective ▶︎ • SentiWordNet (Baccianella et al. 2010, http://sentiwordnet.isti.cnr.it/) WordNet synsets automatically annotated for positivity, negativity, and neutrality ▶︎ This sentiment information is only tendency! Actual sentiment of a word is context-dependent! 13 / 56

Target-Specific Lexicons Restaurant Reviews Speaker Reviews • 👎 My dish was served hot • 👏 It gets hot very easily • 👎 The food is generally cheap • 👏 The company uses cheap materials • 👏 The music was so loud that we • 👎 The speaker is very loud couldn't really talk Building target-specific lexicons manually is very time-consuming and expensive 14 / 56

Target-Specific Lexicons General positive love like Restaurant Speaker impressed happy friendly bluetooth cheap loud hot clear co-occurrence bland big loud hot cash-only cheap materials disappointed hate sorry General negative 15 / 56

How to quantify co-occurrence between words? • Pointwise Mutual Information (PMI) PMI ( w 1 , w 2 ) = log p ( w 1 , w 2 ) p ( w 1 ) p ( w 2 ) = log p ( w 1 | w 2 ) p ( w 1 ) The difference between "the probability of occurring when is observed" and "the w 1 w 2 ▶︎ marginal probability of occurring". w 1 • If and are completely independent: and low PMI . w 1 w 2 p ( w 1 | w 2 ) = p ( w 1 ) • If and always occur together: and high PMI . w 1 w 2 p ( w 1 | w 2 ) > p ( w 1 ) • Choose words (e.g., adjectives) that have high PMI scores with general sentiment words. 16 / 56

General Sentiment Words • Sentiment lexicons • Emoticons/Emoji :) <3 ^^ :/ :( T-T • Hashtags #happy #excited #sad #exhausted 17 / 56

Expanding a lexicon • Synonyms and antonyms (from thesaurus, WordNet) fast = expeditious, rapid, quick, speedy ▶︎ calm ↔ agitated, angry, inclement ▶︎ 18 / 56

Expanding a lexicon (Hatzivassiloglou and McKeown 1997) • Conjunctions and , or , but this laptop is fast and quiet ▶︎ 0.2 risky the staff was friendly but too busy ▶︎ good 0.7 0.6 0.7 fine 0.4 P( good and sound have same polarity) 0.8 0.7 based on the counts of different selfish 0.1 0.8 0.3 conjunctions between them sound harmful (" good and sound ", " good or sound ", " good but sound ") 19 / 56

Twitter mood predicts the stock market (Bollen et al. 2011) bank 2 2 bail-out DJIA z-score Calm z-score 1 1 0 0 -1 -1 -2 -2 Weighted Sum of Mood Scores Profile of Mood States Expanded Lexicon (964 terms for six mood types) (72 terms for six mood types) Score(tweet, Tension) = • Tension ( ↔ calm): word1 (score1), ... • Tension ( ↔ calm): tense, on-edge ∑ • Depression ( ↔ happy): ... • Depression ( ↔ happy): unhappy, sad s × 1( tweet has w ) • Anger ( ↔ kind): ... • Anger ( ↔ kind): angry, grouchy ( w , s ) ∈ Tension • Vigor (vital): ... • Vigor (vital): lively, active (Average the Tension scores of • Fatigue ( ↔ alert): ... • Fatigue ( ↔ alert): worn out, fatigued tweets) • Confusion ( ↔ sure): ... • Confusion ( ↔ sure): confused, bewildered 20 / 56

How to do sentiment analysis Machine Learning Approach 21 / 56

We need: (1) labeled data • Collect already-labeled data • Annotate your own data (6) The criteria set by Rice are the following: the three countries in question are repressive ( negative ) and grave human rights violators ( negative ) . . . (Wilson et al., 2005) @MargaretsBelly Amy Schumer is the stereotypical 1st world Laci Green feminazi. Plus she's unfunny • Target: amy shumer • Sentiment: negative (Rosenthal et al., 2017) cheap but limited domains any domains but expensive 22 / 56

We need: (1) labeled data • Use noisy labels Sunny Again Work Tomorrow :-| TV Tonight #Shanghai has the most amazing skyline! <3 #sad because i just realized i’ll never be an uncle Great idea! I mean... California state government is so effective at everything else. #sarcasm many domains and cheap, but can be inaccurate 23 / 56

We need: (2) class type and classifier • Binary (positive/negative) Logistic regression: Y = { positive if p ( Y = positive | x ) > 0.5 negative otherwise ▶︎ • Continuous (star rating: 1..5) Linear regression: Y = w ⋅ x ▶︎ 24 / 56

We need: (2) class type and classifier • Multiclass (positive/neutral/negative) Two-step binary classification ( neutral vs. non-neutral and positive vs. negative ) ▶︎ One-versus-rest ▶︎ • Train a binary classifier for each class ( positive vs. other and negative vs. other and neutral vs. other ) • Choose the decision of the classifier with the highest confidence score (probability) One-step ▶︎ neural network decision tree support vector machine ... 25 / 56

We need: (3) evaluation metrics • Binary Pos Neg True\Pred N PP + N NN Balanced: Accuracy = ▶︎ N PP + N PN + N NP + N NN N PP N PN Pos Skewed (e.g., "negative" is majority): ▶︎ N NP N NN Neg 2 Precision*Recall N PP N PP • Precision= , Recall= , F1-score= Precision + Recall N PP + N NP N PP + N PN True\Pred Pos Neu Neg • Multiclass N PP + N UU + N NN N PP N PU N PN Pos Balanced: Accuracy = ▶︎ ∑ N * N UP N UU N UN Neu Skewed: AverageRecall= ▶︎ 3 ( N NP + N NU + N NN ) N UU N NN N PP 1 N NP N NU N NN + + Neg N PP + N PU + N PN N UP + N UU + N UN 26 / 56

̂ We need: (3) evaluation metrics • Continuous N 1 ∑ Mean Absolute Error: MAE= , | y i − ̂ y i | ▶︎ N i =1 where and are the true and predicted scores of the th instance, respectively y i y i i 27 / 56

Sentiment Analysis & Computational Argumentation CMU 11-411/611 - PowerPoint PPT Presentation

Lecture 21 Sentiment Analysis & Computational Argumentation CMU 11-411/611 Natural Language Processing April 7, 2020 Yohan Jo 1 / 56 fact or opinion ? positive or negative toward veganism? argument or no argument ? everyone

Twitter Sentiment Analysis Twitter Sentiment Analysis Presented by: Loitongbam Gyanendro Singh

ABA + : Assumption-Based Argumentation with Preferences Cardiff Argumentation Forum Kristijonas

Aesthetics For Life W6: Analysis and Argumentation Dr. Meagan Louie Waltz (1891) Anders

Sentiment Analysis & Computational Argumentation CMU 11-411/611 Natural Language Processing

Sentiment analysis Christopher Potts CS 244U: Natural language understanding May 19 1 / 83

Pl u tchik ' s w heel of emotion , polarit y v s . sentiment SE N TIME N T AN ALYSIS IN R Ted K

A Review Corpus for Argumentation Analysis Henning Wachsmuth , Martin Trenkmann, Benno

Towards Relation-based Argumentation Mining? Francesca Toni Department of Computing Imperial

Linguistic Expressions of Sentiment, Subjectivity & Stance Ling575 Sentiment April 1, 2014

Welcome! Julia Silge Data Scientist at Stack Overflow DataCamp Sentiment Analysis in R: The

Sentiment Analysis What is Sentiment Analysis? Positive or negative

Sentiment Analysis What is Sentiment Analysis? Dan Jurafsky Positive or negative movie review?

Multilingual Sentiment Analysis in Social Media Supervisors Candidate Dr. Rodrigo Agerri Iaki

Sentiment analysis IN TRODUCTION TO N ATURAL LAN GUAGE P ROCES S IN G IN R Kasey Jones

Sentiment Analysis in Twitter Rohit Kumar Jha, Sakaar Khurana Sentiment Analysis in Twitter

Analysis in Hindi Naman Bansal Umair Z Ahmed MOTIVATION Why Sentiment Analysis? Labeling

So Sorting ing do documents uments by b y base se the heme me wit ith h sy synt nthe

CSE 521: Algorithms Linear Programming Slides by Paul Beame, Anna Karlin, probably nameless

Service Unit Fall Product Manager Training 1 FALL 2018 Who is a first time Service Unit Fall

"Evidence as a Contact Sport!" A few lessons from the bench and war stories from the

IAT100 D100 Fall 2013 Week 3 Introduc7on to Light

Knowledge translation in mental health research and care: critical but often overlooked Erin

Chaining, R Markdown Steve Bagley somgen223.stanford.edu 1 cw <- read_csv ( str_c (data_dir,

Combining Data Sharing with the Master-Worker Paradigm in the Common Component Architecture

Sentiment Analysis & Computational Argumentation CMU 11-411/611 - PowerPoint PPT Presentation

Lecture 21 Sentiment Analysis & Computational Argumentation CMU 11-411/611 Natural Language Processing April 7, 2020 Yohan Jo 1 / 56 fact or opinion ? positive or negative toward veganism? argument or no argument ? everyone

Twitter Sentiment Analysis Twitter Sentiment Analysis Presented by: Loitongbam Gyanendro Singh

ABA + : Assumption-Based Argumentation with Preferences Cardiff Argumentation Forum Kristijonas

Aesthetics For Life W6: Analysis and Argumentation Dr. Meagan Louie Waltz (1891) Anders

Sentiment Analysis &amp; Computational Argumentation CMU 11-411/611 Natural Language Processing

Sentiment analysis Christopher Potts CS 244U: Natural language understanding May 19 1 / 83

Pl u tchik ' s w heel of emotion , polarit y v s . sentiment SE N TIME N T AN ALYSIS IN R Ted K

A Review Corpus for Argumentation Analysis Henning Wachsmuth , Martin Trenkmann, Benno

Towards Relation-based Argumentation Mining? Francesca Toni Department of Computing Imperial

Linguistic Expressions of Sentiment, Subjectivity &amp; Stance Ling575 Sentiment April 1, 2014

Welcome! Julia Silge Data Scientist at Stack Overflow DataCamp Sentiment Analysis in R: The

Sentiment Analysis What is Sentiment Analysis? Positive or negative

Sentiment Analysis What is Sentiment Analysis? Dan Jurafsky Positive or negative movie review?

Multilingual Sentiment Analysis in Social Media Supervisors Candidate Dr. Rodrigo Agerri Iaki

Sentiment analysis IN TRODUCTION TO N ATURAL LAN GUAGE P ROCES S IN G IN R Kasey Jones

Sentiment Analysis in Twitter Rohit Kumar Jha, Sakaar Khurana Sentiment Analysis in Twitter

Analysis in Hindi Naman Bansal Umair Z Ahmed MOTIVATION Why Sentiment Analysis? Labeling

So Sorting ing do documents uments by b y base se the heme me wit ith h sy synt nthe

CSE 521: Algorithms Linear Programming Slides by Paul Beame, Anna Karlin, probably nameless

Service Unit Fall Product Manager Training 1 FALL 2018 Who is a first time Service Unit Fall

&quot;Evidence as a Contact Sport!&quot; A few lessons from the bench and war stories from the

IAT100 D100 Fall 2013 Week 3 Introduc7on to Light

Knowledge translation in mental health research and care: critical but often overlooked Erin

Chaining, R Markdown Steve Bagley somgen223.stanford.edu 1 cw &lt;- read_csv ( str_c (data_dir,

Combining Data Sharing with the Master-Worker Paradigm in the Common Component Architecture

Sentiment Analysis & Computational Argumentation CMU 11-411/611 Natural Language Processing

Linguistic Expressions of Sentiment, Subjectivity & Stance Ling575 Sentiment April 1, 2014

"Evidence as a Contact Sport!" A few lessons from the bench and war stories from the

Chaining, R Markdown Steve Bagley somgen223.stanford.edu 1 cw <- read_csv ( str_c (data_dir,