Text Mining Paper Presentation: Determining the Sentiment of Opinions
Soo-Min Kim Eduard Hovy Presenters: Karthik Chinnathambi (kc4bf) Prashant Bhanu Gorthi (pg3bh) Sofia Francis Xavier (sf4uh)
Text Mining Paper Presentation: Determining the Sentiment of - - PowerPoint PPT Presentation
Text Mining Paper Presentation: Determining the Sentiment of Opinions Soo-Min Kim Eduard Hovy Presenters: Karthik Chinnathambi (kc4bf) Prashant Bhanu Gorthi (pg3bh) Sofia Francis Xavier (sf4uh) Background Opinion: [Topic, Holder, Claim,
Soo-Min Kim Eduard Hovy Presenters: Karthik Chinnathambi (kc4bf) Prashant Bhanu Gorthi (pg3bh) Sofia Francis Xavier (sf4uh)
Given a Topic and a set of texts about the topic, find the Sentiments expressed about the Topic in each text, and identify the people who hold each sentiment.
Given a topic and a set of texts, the system operates in four steps.
classifier
positive and 19 negative), adding nouns later.
appropriate seed lists.
negative verbs. Challenge: Some words are both positive and negative!
Given a new word use WordNet to obtain a synonym set of the unseen word argmaxP ( c | w ) ≅argmaxP ( c | syn1, syn2,....synn)
Model 1 : Model 2 :
fk: kth feature of sentiment class c and a member of the synonym set of w count(fk,synset(w)): total number of occurrences of fk in the synonym set of w.
P(w|c) : Probability of word w given a sentiment class c.
Two sets of experiments to examine the performance of:
Classification task defined as assigning each word / sentence as:
Training Data
Methodology
○ Baseline for evaluating models proposed in the paper
○ Model that randomly assigns a sentiment category to each word (averaged over 10 iterations) ○ Model 1 proposed in slide 9 - statistical model that takes into account both polarity and strength of the sentiment
Testing the models
Evaluation
○ Strict agreement - Agree over all 3 categories ○ Lenient agreement - Merge positive and neutral into one category. Differentiate words with negative sentiment.
Results
Training Data
Testing and Evaluation
○ 3 models for sentence classification ○ 4 different window definitions ○ 4 variations of word level classifiers
Observations
○ 81% accuracy with manually annotated holder ○ 67% accuracy with automatic holder identification
Best performance achieved using:
Effect of sentiment categories:
○ E.g., ‘Term Limits really hit at democracy’, says Prof. Fenno
○ E.g., She thinks term limits will give women more opportunities in politics
sentence
Future work identified by the authors