We Are Humor Beings: Understanding and Predicting Visual Humor - - PowerPoint PPT Presentation

we are humor beings understanding and predicting visual
SMART_READER_LITE
LIVE PREVIEW

We Are Humor Beings: Understanding and Predicting Visual Humor - - PowerPoint PPT Presentation

We Are Humor Beings: Understanding and Predicting Visual Humor Shuai Wang University of Toronto March 29, 2016 1 / 31 Intro An integral part but not understood in detail 2 / 31 Intro An integral part but not understood in detail


slide-1
SLIDE 1

We Are Humor Beings: Understanding and Predicting Visual Humor

Shuai Wang

University of Toronto

March 29, 2016

1 / 31

slide-2
SLIDE 2

Intro

◮ An integral part but not understood in detail

2 / 31

slide-3
SLIDE 3

Intro

◮ An integral part but not understood in detail ◮ An adult laughs 18 times a day

3 / 31

slide-4
SLIDE 4

Intro

◮ An integral part but not understood in detail ◮ An adult laughs 18 times a day ◮ A good sense humor

◮ is related to communication competence ◮ helps raise an individual’s social status & popularity ◮ even helps attract compatible mates ◮ makes yourself happier :) 4 / 31

slide-5
SLIDE 5

What makes an image funny?

5 / 31

slide-6
SLIDE 6

Humor Techniques

◮ Animal doing something unusual ◮ Person doing something unusual ◮ Somebody getting hurt ◮ Somebody getting scared

6 / 31

slide-7
SLIDE 7

Animal doing something unusual

7 / 31

slide-8
SLIDE 8

Person doing something unusual

8 / 31

slide-9
SLIDE 9

Somebody getting hurt

9 / 31

slide-10
SLIDE 10

Somebody getting scared

10 / 31

slide-11
SLIDE 11

Changing objects can alter the funniness of a scene

11 / 31

slide-12
SLIDE 12

Removing Incongruities

An elderly person kicking a football while skateboarding is incongruous, but a young girl doing so is not

12 / 31

slide-13
SLIDE 13

Adding Incongruities

Add incongruities (and humor) by replacing the expected with the unexpected

13 / 31

slide-14
SLIDE 14

Two Tasks to Understand Visual Humor

◮ Predicting how funny a given scene is (scene-level) ◮ Changing the funniness of a scene (object-level)

14 / 31

slide-15
SLIDE 15

Object-level Features

◮ Object embedding (150-d): captures the context in which

an object usually occurs

◮ Local embedding (150-d): weighted sum of object

embeddings of all other instances

15 / 31

slide-16
SLIDE 16

Scene-level Features

◮ Cardinality (150-d): bag-of-words representation of how

many instances of each object are in the scene

◮ Location (300-d): horizontal and vertical coordinates of every

  • bject (closest to the center if multiple instance)

◮ Scene embedding (150-d): sum of object embeddings of all

  • bjects in the scene

16 / 31

slide-17
SLIDE 17

Predicting Funniness Score

◮ Dataset: 6,400 scenes, with funny score from 1-5 labelled by

workers from Amazon Mechanical Turk

17 / 31

slide-18
SLIDE 18

Predicting Funniness Score

◮ Dataset: 6,400 scenes, with funny score from 1-5 labelled by

workers from Amazon Mechanical Turk

◮ Support Vector Regressor (SVR) on scene-level features

18 / 31

slide-19
SLIDE 19

Predicting Funniness Score

◮ Dataset: 6,400 scenes, with funny score from 1-5 labelled by

workers from Amazon Mechanical Turk

◮ Support Vector Regressor (SVR) on scene-level features ◮ Metric: average relative error

19 / 31

slide-20
SLIDE 20

Predicting Funniness Score: Ablation Analysis

Different feature subsets perform about the same: slightly better than baseline (average score of the training scenes)

20 / 31

slide-21
SLIDE 21

Alter Funniness of a Scene

◮ Detect the objects that do (or do not) contribute to humor ◮ Identify which objects should replace the objects from step 1

21 / 31

slide-22
SLIDE 22

Predicting Objects to be Replaced

◮ On average, the model replaces 3.67 objects (2.54 ground

truth) → this bias towards replace ensures a large ‘margin’

22 / 31

slide-23
SLIDE 23

Predicting Objects to be Replaced

◮ On average, the model replaces 3.67 objects (2.54 ground

truth) → this bias towards replace ensures a large ‘margin’

◮ Animate objects like humans and animals are more likely

sources of humor → tends to replace these objects

23 / 31

slide-24
SLIDE 24

Funny → Unfunny

Old man dancing → young boy dancing Hawk stealing meat → baseball

24 / 31

slide-25
SLIDE 25

Funny → Unfunny

Cute puppy → Insect Watermelon → Ax

25 / 31

slide-26
SLIDE 26

Unfunny → Funny

Couple having dinner at the table → Puppies having dinner at the table

26 / 31

slide-27
SLIDE 27

Unfunny → Funny

Cating playing around → Racoon driving motorcycle

27 / 31

slide-28
SLIDE 28

Discussion

◮ Style/genre of an image or painting can make a difference

28 / 31

slide-29
SLIDE 29

Discussion

◮ Style/genre of an image or painting can make a difference ◮ Dataset is small: 6,400 images

29 / 31

slide-30
SLIDE 30

Discussion

◮ Style/genre of an image or painting can make a difference ◮ Dataset is small: 6,400 images ◮ Feature representation can be improved

30 / 31

slide-31
SLIDE 31

31 / 31