Emo$on Recogni$on in Images and Text Agata Lapedriza - - PowerPoint PPT Presentation

emo on recogni on in images and text
SMART_READER_LITE
LIVE PREVIEW

Emo$on Recogni$on in Images and Text Agata Lapedriza - - PowerPoint PPT Presentation

Emo$on Recogni$on in Images and Text Agata Lapedriza alapedriza@uoc.edu / agata@mit.edu Associate Professor Visi$ng Researcher https://pxhere.com/en/photo/686169 Recognizing others emotions Why is this capacity useful? Recognizing others


slide-1
SLIDE 1

Emo$on Recogni$on in Images and Text

Agata Lapedriza

alapedriza@uoc.edu / agata@mit.edu

Associate Professor Visi$ng Researcher

slide-2
SLIDE 2

https://pxhere.com/en/photo/686169

slide-3
SLIDE 3

Why is this capacity useful?

Recognizing others emotions…

slide-4
SLIDE 4

Why is this capacity useful?

  • Social Interactions
  • Detecting people’s needs
  • Predicting people’s reactions

Recognizing others emotions…

slide-5
SLIDE 5

h"ps://www.maxpixel.net/Quiz-Think-Ques7on-Thinking-Brain-Answer-2004314 h"ps://www.maxpixel.net/Calm-Smiley-Ball-Angry-Anxiety-Emo7con-Anger-2979107

Cogni$on Emo$ons

slide-6
SLIDE 6

AI in Science Fic$on Movies

slide-7
SLIDE 7

Emo$on Recogni$on

A lot of signals in our bodies change when our emo$ons change Wri$ng Voice Typing

ETC…

Pose Gestures

slide-8
SLIDE 8

SENSORS (ex: cameras, …) SIGNALS (ex: face, heart rate) EMOTIONS (ex: happiness,…)

Emo$on Recogni$on

slide-9
SLIDE 9

SENSORS (ex: cameras, …)

Emo$on Recogni$on

slide-10
SLIDE 10

Facial expression analysis

hLps://www.affec$va.com AffdexMe app

slide-11
SLIDE 11

https://pxhere.com/en/photo/686169

slide-12
SLIDE 12

Happiness

Smile (0.99), Attention (0.54)

slide-13
SLIDE 13

Happiness

Smile (0.99), Attention (0.54)

?

slide-14
SLIDE 14

Facial expression analysis

!

  • 1. Non-frontal Faces and Par$al Occlusions.
slide-15
SLIDE 15

Surprise

Mouth Open (1.00)

slide-16
SLIDE 16
slide-17
SLIDE 17

Facial expression analysis

!

  • 1. Non-frontal Faces and Par$al Occlusions.
  • 2. Some Facial Expressions are not related to our

emo$ons, but to our ac$ons.

  • 3. Difficulty in giving Emo$onal Meaning to an

isolated Facial Expression.

slide-18
SLIDE 18

Aviezer, H., Hassin, R., Ryan, J., Grady, C., Susskind, J., Anderson, A., Moscovitch, M., & Ben$n, S. Angry, disgusted or afraid? Studies on the malleability of emo$on percep$on. Psychological Science, 19, 724-732 (2008a)

slide-19
SLIDE 19

Aviezer, H., Hassin, R., Ryan, J., Grady, C., Susskind, J., Anderson, A., Moscovitch, M., & Ben$n, S. Angry, disgusted or afraid? Studies on the malleability of emo$on percep$on. Psychological Science, 19, 724-732 (2008a)

  • Anger
  • Contempt
  • Disgust
  • Fear

?

slide-20
SLIDE 20
slide-21
SLIDE 21
slide-22
SLIDE 22

Disgust Anger Sadness Fear

slide-23
SLIDE 23

Apparent Emo$onal States Recogni$on

slide-24
SLIDE 24

Apparent Emo$onal States Recogni$on

slide-25
SLIDE 25

Apparent Emo$onal States Recogni$on

Confidence feeling of being certain; convic$on that an outcome will be favorable; encouraged; proud

slide-26
SLIDE 26

From pictures to emotions

Emo$ons

Deep Learning Model

slide-27
SLIDE 27

From pictures to emotions Deep Learning Model

Emo$ons

Challenge: Training Data

slide-28
SLIDE 28

Collec$ng images

Happiness Esteem Excitement Confidence

V A D 5 10

Anticipation Excitement Engagement

V A D 5 10

Engagement

V A D 5 10

  • Images manually downloaded from search engines
  • Images from other public datasets: MSCOCO, Ade20k
slide-29
SLIDE 29

Back Go to Next Image (Image 1 of 20)

Peace (well being and relaxed/no worry/positive sensation/satisfjed) Afgection (fond feelings/tenderness/love/compassion) Expectation (state of anticipating/hoping on something or someone) Esteem (favorable opinion or judgment/gratefulness/admiration/respect) Confjdence (feeling of being certain/proud/encouraged/optimistic) Engagement (occupied/absorbed/interested/paying attention to something) Pleasure (feeling of delight in the senses) Happiness (feeling delighted/enjoyment/amusement) Excitement (pleasant and excited state/stimulated/energetic/enthusiastic) Surprise (sudden discovery of something unexpected) Sufgering (distressed/perturbed/anguished) Disapproval (think that something is wrong or reprehensible/contempt/hostile) Yearning (strong desire to have something/jealous/envious) Fatigue (weariness/tiredness/sleepy) Pain (physical sufgering) Doubt/Confusion (diffjculty to understand or decide/sceptical/lost) Fear (feeling afraid of danger/evil/pain/horror) Vulnerability (feeling of being physically or emotionally wounded) Disquitement (unpleasant restlessness/tense/worried/upset/stressed) Annoyance (bothered/iritated/impatient/troubled/frustrated) Anger (intense displeasure or rage/furious/resentful) Disgust (feeling dislike or repulsion/feeling hateful) Sadness (feeling unhappy/grief/disappointed/discouraged) Disconnection (not participating/indifgerent/bored/distracted) Embarrassment (feeling ashamed or guilty)

slide-30
SLIDE 30

Back Go to Next Image (Image 1 of 20)

Valence: Negative vs. Positive Arousal (awakeness): Calm vs. Ready to act Dominance: Dominated vs. In control

Gender and age of the person in the yellow box Positive (pleasant) Negative (unpleasant) Ready to act (active) Calm In control Dominated (no control) Male Female Kid (0-12) Teenager (13-20) Adult (more than 20)

slide-31
SLIDE 31

Crowdsourcing

slide-32
SLIDE 32

Emotic Database 34,320 Annotated People 23,571 Annotated Images

slide-33
SLIDE 33

Deep Learning Model Emotic Database

slide-34
SLIDE 34

Person features Context features

slide-35
SLIDE 35

Anticipation Engagement Excitement Confidence

slide-36
SLIDE 36

Pleasure Happiness Affection

slide-37
SLIDE 37

Happiness

slide-38
SLIDE 38

Pleasure Affection Happiness

Pleasure Disaproval Doubt/Confusion Disquietment Surprise Sensitivity Aversion Fatigue Sadness Esteem

slide-39
SLIDE 39

Emo$ons in Context

hLp://sunai.uoc.edu/emo$c/

Ronak Kos$ Jose Alvarez Adria Recasens Agata Lapedriza

  • R. Kos$, J.M Alvarez, A. Recasens, A.Lapedriza. "Emo$on Recogni$on in Context". Computer Vision and

PaLern Recogni$on (CVPR), 2017.

  • R. Kos$, J.M Alvarez, A. Recasens, A.Lapedriza. ”Context based Emo$on Recogni$on using EMOTIC

dataset". IEEE Transac$ons on PaLern Analysis and Machine Intelligence (PAMI), 2019.

slide-40
SLIDE 40

Text Sen$ment Analysis

slide-41
SLIDE 41

Text Sen$ment Analysis ☺ or ?

What a delight! Terrific menu, great cral cocktails, unpreten$ous atmosphere of mostly locals and college professors chamng over dinner.

slide-42
SLIDE 42

Text Sen$ment Analysis

Felbo, B., Mislove, A., Sogaard, A., Rahwan, I. and Lehmann, S., 2017. Using millions of emoji occurrences to learn any-domain representa$ons for detec$ng sen$ment, emo$on and

  • sarcasm. arXiv preprint arXiv:

1708.00524.

DeepMoji

slide-43
SLIDE 43

DeepMoji

1246 million tweets containing, at least, one

  • f the 64 common emojis

Example: U2 Wonderful concert yesterday in Barcelona U2 Wonderful concert yesterday in Barcelona

x y

slide-44
SLIDE 44
slide-45
SLIDE 45

Interac$ve scenarios

The capacity of recognizing emo$ons in sentences is also interes$ng in interac$ve scenarios.

Hello vs. Goal Oriented Open Domain

slide-46
SLIDE 46

Interac$ve scenarios

Hello Open Domain

slide-47
SLIDE 47

Open Domain Dialog Systems

How do we evaluate open domain dialog systems?

OPTION 1: Automa$c text metrics (word overlap metrics; ex: BLEU score), Embedding-distance based metrics (ex: Average, Greedy, Extrema)

Chia-Wei Liu, Ryan Lowe, Iulian Serban, Mike Noseworthy, Laurent Charlin, and Joelle

  • Pineau. How not to evaluate your dialogue system: An empirical study of unsupervised

evalua$on metrics for dialogue response genera$on. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 2122–2132, 2016.

slide-48
SLIDE 48

How do we evaluate open domain dialog systems?

OPTION 2: Human Evalua$on

Open Domain Dialog Systems

OPTION 1: Automa$c text metrics (word overlap metrics; ex: BLEU score), Embedding-distance based metrics (ex: Average, Greedy, Extrema)

slide-49
SLIDE 49

Human evalua$on of open domain dialog systems

The common prac$ce is to use 1-turn evalua$on

A human rates how good the response: 7/10

slide-50
SLIDE 50

Human evalua$on of open domain dialog systems

The common prac$ce is to use 1-turn evalua$on A human rates how good the response: 7/10

slide-51
SLIDE 51

Evalua$ng Dialog Systems: Our Proposal

  • We need interac$ve evalua$on
  • We need to evaluate different aspects of the system
  • Quality: overall, how was the quality of the chat?
  • Diversity: how non-repe$$ve were the chat bot’s responses?
  • Fluency: how correct were the grammar and sentence structure
  • f the chat bot’s response?
  • ConJngency: how related to your messages were the chat bot’s

response?

  • Empathy: how emo$onally appropriate were the chat bot’s response?
slide-52
SLIDE 52

Our open source platorm for Mul$-turn evalua$on

slide-53
SLIDE 53

HRED VHRED VHCR

[1] Iulian V Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, and Joelle Pineau. Building end-to-end dialogue systems using genera$ve hierarchical neural network models. In Thir7eth AAAI Conference on Ar7ficial Intelligence, 2016. [2] Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron Courville, and Yoshua

  • Bengio. A hierarchical latent variable encoder-decoder model for genera$ng dialogues. In Thirty-First AAAI Conference
  • n Ar7ficial Intelligence, 2017.

[3] Yookoon Park, Jaemin Cho, and Gunhee Kim. A hierarchical latent structure for varia$onal conversa$on modeling. In Proceedings of the 2018 Conference of the North American Chapter of the Associa7on for Computa7onal Linguis7cs: Human Language Technologies, Volume 1 (Long Papers), pages 1792–1801, 2018.

References:

Dialog Models

Genera$ve Neural Network Models

slide-54
SLIDE 54

HRED VHRED VHCR HRED + EI VHRED + EI VHCR + EI

  • The text sen$ment
  • The text topic

RegularizaJon technique that makes the dialog model to be more aware of:

Dialog Models

Genera$ve Neural Network Models

slide-55
SLIDE 55

HRED

slide-56
SLIDE 56

HRED + EI Regulariza$on

Emo$on Seman$c Similarity [1]

[1] A. Conneau, D. Kiela, H. Schwenk, L. Barrault, and A. Bordes. Super- vised learning of universal sentence representa$ons from natural language inference data. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 670–680, 2017.

slide-57
SLIDE 57

Data

Cornell Dataset

  • The largest and most commonly used movie scripts dataset
  • ~ 200K conversa$onal exchanges between pairs of

characters engaging in at least 3 turns

Reddit Dataset

  • 109K conversa$ons of at least 3 turns from conversa$onal

exchanges on the platorm in 2018

  • Conversa$ons forum
slide-58
SLIDE 58

Human Interac$ve Evalua$on

slide-59
SLIDE 59

Tradi$onal Automa$c Metrics

slide-60
SLIDE 60

Examples of dialogs

slide-61
SLIDE 61
  • 1. SenJment Metrics

Sen$ment Score; Sen$ment Coherence; Sen$ment Transi$on; Sen$ment Min-Max; Laughter

  • 2. Engagement Metrics

Ques$on Score; #Words

  • 3. SemanJc Metrics

Seman$c Similarity; Average Word Coherence; Extrema Word Coherence; Greedy Word Coherence

New Automa$c Metrics to analyze conversa$ons

M1, M2, …, M11

Hybrid Metric

slide-62
SLIDE 62
  • 1. SenJment Metrics

Sen$ment Score; Sen$ment Coherence; Sen$ment Transi$on; Sen$ment Min-Max; Laughter

  • 2. Engagement Metrics

Ques$on Score; #Words

  • 3. SemanJc Metrics

Seman$c Similarity; Average Word Coherence; Extrema Word Coherence; Greedy Word Coherence

M1, M2, …, M11

Hybrid Metric

Goal: to approximate the human ra$ngs

New Automa$c Metrics to analyze conversa$ons

slide-63
SLIDE 63

H

Human evalua$on High correla$on

New Automa$c Metrics to analyze conversa$ons

Hybrid Metric

slide-64
SLIDE 64

New Automa$c Metrics to analyze conversa$ons

Hybrid Metric

slide-65
SLIDE 65

New Automa$c Metrics to analyze conversa$ons

Self-Play Scenario

slide-66
SLIDE 66

New Automa$c Metrics to analyze conversa$ons

Self-Play Scenario Hybrid Metric

slide-67
SLIDE 67

Correla$on between Metrics and Human Interac$ve Evalua$on

Tradi$onal Automa$c Metrics New Automa$c Metrics (on self play scenario)

slide-68
SLIDE 68

PAPER: Asma Ghandeharioun∗, Judy Hanwen Shen∗, Natasha Jaques∗, Craig Ferguson, Noah Jones, Agata Lapedriza, Rosalind Picard, “Approxima$ng Interac$ve Human Evalua$on with Self-Play for Open-Domain Dialog”, in Proceedings of the Conference on Neural Informa7on Processing Systems (NeurIPS), 2019. Asma Ghandeharioun Judy Shen Natasha Jaques Craig Ferguson Noah Jones Agata Lapedriza Roz Picard CODE: hLps://github.com/natashamjaques/neural_chat

slide-69
SLIDE 69

Applica$ons of Affec$ve Compu$ng

  • Emo$onal Wellbeing
  • User Experience
  • Suicide preven$on
  • Early detec$on of depression
  • Educa$on
slide-70
SLIDE 70

Emotional Wellbeing

slide-71
SLIDE 71

Mul$modal

Stress-free Driving

Javier Hernandez Diego Muñoz Vincent Chen Craig Ferguson

Mul$modal Sensing

  • Heart Rate
  • Respira$on
  • Pressure sensors on

the steering wheel

  • Face Analysis
  • Context Analysis

Car intervenJons for a bePer driving experience.

Emotional Navigation SIG, enavigation.media.mit.edu Sponsors: Hyundai, NTT Data, Daimler

slide-72
SLIDE 72

We are nowhere close to have machines that understand emo$ons as humans do.

Final Conclusions

We are slowly finding solu$ons to specific problems that require Emo$onal Intelligence. Emo$on AI technologies are useful and have a lot of applica$ons.

slide-73
SLIDE 73

References and Resources

hLp://places2.csail.mit.edu

[3] B. Zhou, A. Lapedriza, A. Khosla, A. Oliva, and A. Torralba. "Places: A 10 Million Image Database for Scene Recogni$on". IEEE Transac$ons on PaLern Analysis and Machine Intelligence (PAMI), July 2017.

CAM (Class AcJvaJon Maps)

[5] B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. "Learning Deep Features for Discrimina$ve Localiza$on". Computer Vision and PaLern Recogni$on (CVPR), 2016. [4] B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. "Object Detectors Emerge in Deep Scene CNNs." Interna$onal Conference on Learning Representa$ons (ICLR), 2015.

hLp://cnnlocaliza$on.csail.mit.edu

Emo$ons in Context

hLp://sunai.uoc.edu/emo$c/

[1] R. Kos$, J.M Alvarez, A. Recasens, A.Lapedriza. "Emo$on Recogni$on in Context". Computer Vision and PaLern Recogni$on (CVPR), 2017 [2] R. Kos$, J.M Alvarez, A. Recasens, A.Lapedriza. ”Context based Emo$on Recogni$on using EMOTIC dataset". IEEE Transac$ons on PaLern Analysis and Machine Intelligence (PAMI), 2019.

Dialog & Neural Chat

[7] Asma Ghandeharioun∗, Judy Hanwen Shen∗, Natasha Jaques∗, Craig Ferguson, Noah Jones, Agata Lapedriza, Rosalind Picard, “Approxima$ng Interac$ve Human Evalua$on with Self-Play for Open-Domain Dialog”, NeurIPS, Vancouver, Canada, 2019

hLps://www.media.mit.edu/ projects/elsa/overview/

Sequence Bias

[6] Judy Shen, Agata Lapedriza, Rosalind Picard, “Uninten$onal affec$ve priming during labeling may bias labels”, 8th Interna7onal Conference on Affec7ve Compu7ng and Intelligent Interac7on, Cambridge, UK, 2019.