Twe Tweet ets Didem Demira Anisa Halimi Nora von Thenen 2 - - PowerPoint PPT Presentation

twe tweet ets
SMART_READER_LITE
LIVE PREVIEW

Twe Tweet ets Didem Demira Anisa Halimi Nora von Thenen 2 - - PowerPoint PPT Presentation

Emot otion ion De Dete tectio ction n fr from om Twe Tweet ets Didem Demira Anisa Halimi Nora von Thenen 2 Problem Description Tweet Emotion Enjoy the little things in life! Happy In the end you have to be your own hero


slide-1
SLIDE 1

Emot

  • tion

ion De Dete tectio ction n fr from

  • m

Twe Tweet ets

Didem Demirağ Anisa Halimi Nora von Thenen

slide-2
SLIDE 2

Problem Description

Tweet Emotion Enjoy the little things in life! Happy In the end you have to be your own hero because everyone else is too busy trying to save themselves. Sad Tomorrow will be a better day! Hopeful Bought 5 things from whole foods and it cost $230… Complainable Can’t wait to work again! Complainable

2

Table 1: Examples of emotions in tweets

slide-3
SLIDE 3

Motivation

  • Market researchers and companies
  • Political campaigns
  • People's reactions in a crisis
  • Keeping track of the emotional state of a patient with a certain disease
  • Define certain psychological disorders
  • Sociologist can infer the life quality of a population.

3

slide-4
SLIDE 4

Selected Emotions Based on Ekman’s Model

  • Happy
  • Sad
  • Angry
  • Afraid
  • Hopeful
  • Complainable

4

slide-5
SLIDE 5

Methodology

  • 1. Classify the tweets into three main groups; 'positive', 'negative' and 'neutral'.
  • 'Natural Language Toolkit' of Python (Bayesian classifier)
  • 2. Process tweets.
  • Change hashtags into normal words.
  • Apply stemming and filter out stop words.
  • 3. Use keyword lists to assign a more specific emotion to the as 'neutral', 'positive' or

'negative‘ pre-labeled tweets.

  • The keyword list with the most matches within the tweet will determine the emotional

label.

5

slide-6
SLIDE 6

Methodology

  • After labeling:
  • Use different machine learning algorithms to train and test our dataset
  • SVM (Support Vector Machine)
  • Naive Bayes
  • KNN (K-Nearest Neighbor)
  • Compute the recall and precision rates to measure the accuracy of the algorithms.
  • Compare these algorithms to decide which works most efficiently.

6

slide-7
SLIDE 7

Data

Datasets Number of tweets positive labeled tweets 2.949 negative labeled tweets 3.293 neutral labeled tweets 6.353 total 12.595

7

Table 2: Dataset size

slide-8
SLIDE 8

First Results - pre-labeled Tweets

Tweet Label Enjoy the little things in life! positive In the end you have to be your own hero because everyone else is too busy trying to save themselves negative Tomorrow will be a better day positive Bought 5 things from whole foods and it cost $230… negative Can’t wait to work again! negative

8

Table 3: Some examples of pre-labeled tweets

slide-9
SLIDE 9

Expected Results

  • Aim: to group the tweets according to their emotion with high efficiency and

accuracy.

  • Obtain statistical data about which method is efficient for detecting emotions in

tweets.

  • Decide whether using hashtags, emoticons, keyword lists or a combination of

them is more precise in terms of detecting emotions.

9

slide-10
SLIDE 10

References

[1] M. Hasan, E. Rundensteiner, and E. Agu. Emotex: Detecting emotions in twitter

  • messages. In ASE BIGDATA/SOCIALCOM/CYBERSECURITY Conference, pages

27–31, May 2014. [2] P. Ekman. Basic emotions. Handbook of Cognition and Emotion, 98:45-60, 1999.

10