Maintaining sentiment polarity in translation of user-generated - PowerPoint PPT Presentation

Maintaining sentiment polarity in translation of user-generated content Pintu Lohar, Haithem Afli and Andy Way ADAPT Centre, School of Computing, Dublin City University The ADAPT Centre is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.

Contents www.adaptcentre.ie  Objective & Motivation  Sentiment analysis of user-generated content  Data Preparation  Corpus development  Sentiment annotation and classification  Experiments  Sentiment Translation Architecture  Results  Discussion  Conclusions and future work

Objective www.adaptcentre.ie  Analyse sentiment preservation & MT quality in the context of user-generated content (UGC)

Objective www.adaptcentre.ie  Analyse sentiment preservation & MT quality in the context of user-generated content (UGC)  Focus on whether sentiment classification helps improve sentiment preservation in MT of UGC

Motivation www.adaptcentre.ie • Translation quality per se is not the main concern

Motivation www.adaptcentre.ie • Translation quality per se is not the main concern  Sentiment preservation is (arguably more) important e.g. companies want to know what their customers think of their products and services. It is crucial that user sentiment in one language is preserved in the target language (typically, English).

Motivation www.adaptcentre.ie Customer feedback in Japanese

Motivation www.adaptcentre.ie Customer feedback in Japanese Japanese English Sentiment Translate Sentiment analysis data data classes

Track Record in UGC www.adaptcentre.ie

Track Record in UGC www.adaptcentre.ie 13 languages and 24 language pairs 85,047,110 tweets in total Irish Spanish Korean Italian Farsi German English French Portuguese Greek Croatian Japanese Chinese

Sentiment analysis of UGC www.adaptcentre.ie  UGC includes blog posts, podcasts, online videos, tweets etc.  UGC is usually multilingual and of varying quality (sometimes deliberately)  Sentiment analysis of UGC has many applications

Sentiment analysis of UGC www.adaptcentre.ie Crosslingual sentiment analysis(CLSA):  The task of predicting the polarity of the opinion of a text in a language using a classifier trained on the corpus of another language (Balamurli et al. (2012))

Sentiment analysis of UGC www.adaptcentre.ie Crosslingual sentiment analysis(CLSA):  The task of predicting the polarity of the opinion of a text in a language using a classifier trained on the corpus of another language (Balamurli et al. (2012)) MT-based CLSA:  MT is utilized to leverage its capability, existing SA resources available in English to classify sentiment in other languages (Mihalcea et al. (2012))

Related work www.adaptcentre.ie MT can alter the sentiment (Mohammad et al. (2016)) Google Translate from English to German on 25/05/2017 English: he is out of the world cup negative German: Er ist aus des weltmeisterschaft neutral

Sentiment Analysis of UGC www.adaptcentre.ie • Can a sentiment classification approach help improve sentiment preservation in the target language ?

Sentiment Analysis of UGC www.adaptcentre.ie • Can a sentiment classification approach help improve sentiment preservation in the target language ? • Is it useful to select a specific-sentimented MT model to translate the UGC with the same sentiment ?

Data preparation www.adaptcentre.ie Corpus development:  Twitter data set comprising 4,000 English tweets from the FIFA World Cup 2014 and their manual translations into German

Data preparation www.adaptcentre.ie Corpus development:  Twitter data set comprising 4,000 English tweets from the FIFA World Cup 2014 and their manual translations into German  Informal translations of English tweets into German e.g. English tweet German tweet Goaaaal Toooor

Sentiment annotation and classification www.adaptcentre.ie  Sentiment annotation Manually annotated sentiment scores between 0 and 1

Sentiment annotation and classification www.adaptcentre.ie  Sentiment annotation Manually annotated sentiment scores between 0 and 1  Sentiment classes (i) Negative: sentiment score ≤ 0.4 (ii) Neutral: sentiment score ≈ 0.5 (iii) Positive: sentiment score ≥ 0.6 e.g. Tweet Sentiment score injured Neymar out of World Cup 0.2

Sentiment annotation and classification www.adaptcentre.ie  Manual annotation of Twitter data is considered as the “gold - standard”

Sentiment annotation and classification www.adaptcentre.ie  Manual annotation of Twitter data is considered as the “gold - standard”  50 tweets per sentiment (negative, neutral and positive) are held out for tuning and testing purposes Development Test Data Train Total #neg #neu #pos #neg #neu #pos Twitter 3,700 50 50 50 50 50 50 4,000 Data distribution of Twitter data for Training, development and test

Sentiment annotation and classification www.adaptcentre.ie  Flickr and News commentary (``News’’) data are used as additional resources  Automatic sentiment analysis tool (Afli et. al. (2017)) is applied to Flickr and News data

Sentiment annotation and classification www.adaptcentre.ie  Flickr and News commentary (``News’’) data are used as additional resources  Automatic sentiment analysis tool (Afli et. al. (2017)) is applied to Flickr and News data Performance accuracy:  2,994 tweets out of 4,000 correctly classified by this tool when compared to the ‘gold standard’ data  Accuracy = 74.85%

Sentiment annotation and classification www.adaptcentre.ie Data Sentiment #neg #neu #pos #total classification Twitter manual 919 1,308 1,473 3,700 Flickr automatic 9,677 11,065 8,258 29,000 News automatic 111,337 14,306 113,200 238,843 Data distribution after sentiment classification

Experiments www.adaptcentre.ie I. Translation without sentiment classification

Experiments www.adaptcentre.ie I. Translation without sentiment classification II. Translation with sentiment classification i. Manual sentiment classification (only Twitter data) ii. Automatic sentiment classification (Flickr & News data)

Experiments www.adaptcentre.ie I. Translation without sentiment classification II. Translation with sentiment classification i. Manual sentiment classification (only Twitter data) ii. Automatic sentiment classification (Flickr & News data) III. Translation by wrong MT engines i. Negative tweets by positive model ii. Neutral tweets by negative model iii. Positive tweets by neutral model

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive model model model

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive Negative Neutral Positive model model model model model model

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive Negative Neutral Positive Baseline model model model model model model model

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive Negative Neutral Positive Baseline model model model model model model model Translate

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive Negative Neutral Positive Baseline model model model model model model model Translate Negative Neutral Positive test test test

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive Negative Neutral Positive Baseline model model model model model model model Translate Negative Neutral Positive Neutral Negative Positive test test test test test test

Sentiment Translation Architecture www.adaptcentre.ie Parallel corpus Sentiment No Sentiment Classification Classification Manual Automatic Negative Neutral Positive Negative Neutral Positive Baseline model model model model model model model Translate Negative Neutral Positive Neutral Negative Positive whole test test test test test test test data

Maintaining sentiment polarity in translation of user-generated - PowerPoint PPT Presentation

Maintaining sentiment polarity in translation of user-generated content Pintu Lohar, Haithem Afli and Andy Way ADAPT Centre, School of Computing, Dublin City University The ADAPT Centre is funded under the SFI Research Centres Programme (Grant

MAINTAINING COMPLIANCE MAINTAINING COMPLIANCE MAINTAINING COMPLIANCE MAINTAINING MAINTAINING

Learning Sentiment Polarity of Multiword Expressions M A X K A U F M A N N , N I C K C H E N ,

Exploiting Syntax in Sentiment Polarity Classification Wolfgang Seeker joint work with Adam

Constructing Sentiment Sensitive Vectors for Word Polarity Classification Speaker: Johann Chu

Twitter Sentiment Analysis Twitter Sentiment Analysis Presented by: Loitongbam Gyanendro Singh

Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis Presented by Kay Lu, Ashley

Solenoid Polarity Yngve Inntjore Levinsen CERN 14. of December, 2012 Solenoid Polarity

Polarity By adding the individual bond dipoles, one can determine the overall dipole moment

Sentiment analysis Christopher Potts CS 244U: Natural language understanding May 19 1 / 83

Linguistic Expressions of Sentiment, Subjectivity & Stance Ling575 Sentiment April 1, 2014

Pl u tchik ' s w heel of emotion , polarit y v s . sentiment SE N TIME N T AN ALYSIS IN R Ted K

Ontology-Driven Sentiment Analysis of Product and Service Aspects Kim Schouten and Flavius

Bag-of-Words Models and Beyond Sentiment, Subjectivity, and Stance Ling 575 April 8, 2014

1 Budapest Workshop, Oct. 2015 / 38 Quantifier Polarity and Verification Goal To gain

Polarity of points for systems of linear spdes in critical dimensions Robert C. Dalang Ecole

Surface Reasoning Lecture 4: Negative Polarity and Antitonicity Thomas Icard June 18-22, 2012

Matthew Series Lesson #115 April 3, 2016 Dean Bible Ministries www.deanbibleministries.org

Third-Person Visual Imitation Learning via Decoupled Hierarchical Control Pratyusha Sharma,

The Ultimate Friend LESSON 12 Your Response to the Lesson What was most interesting in the Bible

Jesus sets up the Seder Jesus sets up the Seder Bir irkat Haner er (Blessing of

Bayesian Networks and Decision Graphs Chapter 1 Chapter 1 p. 1/13 Two perspectives on

CS260-002: Spatial Data Modeling and Analysis Course Outline Instructor: Amr Magdy Computer

Basics of Functional Programming 1 / 12 A Motivating Example: Cafe 1 class Cafe { 2 def

Topic 7 parameters "We're flooding people with information. We need to feed it through a

Maintaining sentiment polarity in translation of user-generated - PowerPoint PPT Presentation

Maintaining sentiment polarity in translation of user-generated content Pintu Lohar, Haithem Afli and Andy Way ADAPT Centre, School of Computing, Dublin City University The ADAPT Centre is funded under the SFI Research Centres Programme (Grant

MAINTAINING COMPLIANCE MAINTAINING COMPLIANCE MAINTAINING COMPLIANCE MAINTAINING MAINTAINING

Learning Sentiment Polarity of Multiword Expressions M A X K A U F M A N N , N I C K C H E N ,

Exploiting Syntax in Sentiment Polarity Classification Wolfgang Seeker joint work with Adam

Constructing Sentiment Sensitive Vectors for Word Polarity Classification Speaker: Johann Chu

Twitter Sentiment Analysis Twitter Sentiment Analysis Presented by: Loitongbam Gyanendro Singh

Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis Presented by Kay Lu, Ashley

Solenoid Polarity Yngve Inntjore Levinsen CERN 14. of December, 2012 Solenoid Polarity

Polarity By adding the individual bond dipoles, one can determine the overall dipole moment

Sentiment analysis Christopher Potts CS 244U: Natural language understanding May 19 1 / 83

Linguistic Expressions of Sentiment, Subjectivity &amp; Stance Ling575 Sentiment April 1, 2014

Pl u tchik ' s w heel of emotion , polarit y v s . sentiment SE N TIME N T AN ALYSIS IN R Ted K

Ontology-Driven Sentiment Analysis of Product and Service Aspects Kim Schouten and Flavius

Bag-of-Words Models and Beyond Sentiment, Subjectivity, and Stance Ling 575 April 8, 2014

1 Budapest Workshop, Oct. 2015 / 38 Quantifier Polarity and Verification Goal To gain

Polarity of points for systems of linear spdes in critical dimensions Robert C. Dalang Ecole

Surface Reasoning Lecture 4: Negative Polarity and Antitonicity Thomas Icard June 18-22, 2012

Matthew Series Lesson #115 April 3, 2016 Dean Bible Ministries www.deanbibleministries.org

Third-Person Visual Imitation Learning via Decoupled Hierarchical Control Pratyusha Sharma,

The Ultimate Friend LESSON 12 Your Response to the Lesson What was most interesting in the Bible

Jesus sets up the Seder Jesus sets up the Seder Bir irkat Haner er (Blessing of

Bayesian Networks and Decision Graphs Chapter 1 Chapter 1 p. 1/13 Two perspectives on

CS260-002: Spatial Data Modeling and Analysis Course Outline Instructor: Amr Magdy Computer

Basics of Functional Programming 1 / 12 A Motivating Example: Cafe 1 class Cafe { 2 def

Topic 7 parameters &quot;We're flooding people with information. We need to feed it through a

Linguistic Expressions of Sentiment, Subjectivity & Stance Ling575 Sentiment April 1, 2014

Topic 7 parameters "We're flooding people with information. We need to feed it through a