Where are they looking? Adria Recasens*, Aditya Khosla*, Carl - - PowerPoint PPT Presentation

where are they looking
SMART_READER_LITE
LIVE PREVIEW

Where are they looking? Adria Recasens*, Aditya Khosla*, Carl - - PowerPoint PPT Presentation

Where are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick, Antonio Torralba Presented by: Surbhi Goel Where are they looking? Follow the gaze of the person and identify the object being looked at Demo:


slide-1
SLIDE 1

Where are they looking?

Adria Recasens*, Aditya Khosla*, Carl Vondrick, Antonio Torralba

Presented by: Surbhi Goel

slide-2
SLIDE 2

Where are they looking?

Follow the gaze of the person and identify the object being looked at

slide-3
SLIDE 3

Demo: http://gazefollow.csail.mit.edu/demo.html

slide-4
SLIDE 4

Experiments

  • Dataset Visualizations

○ Images in the Dataset ○ Head Locations ○ Gaze Locations/Length

  • Model Experiments

○ Qualitative Evaluation ○ Visualizing Gaze Mask and Saliency Map ○ Animal Gaze Following ○ Extending to Short Video

slide-5
SLIDE 5

Dataset Visualizations

slide-6
SLIDE 6

Training Set Images

slide-7
SLIDE 7

Training Set Images

slide-8
SLIDE 8

Training Set Images

slide-9
SLIDE 9

Heatmaps for Head Location

Train Test

slide-10
SLIDE 10

Heatmaps for Gaze Location

Train Test

slide-11
SLIDE 11

Heatmaps for Relative Gaze Location

Train Test

slide-12
SLIDE 12

Histogram for Length of Gaze

Train Test

slide-13
SLIDE 13

Observations

  • Head/Gaze are concentrated for train and scattered for test
  • Relative gaze is concentrated for both
  • Gaze length relatively short (0.2 peak)
slide-14
SLIDE 14

Model Evaluation

slide-15
SLIDE 15

Good Cases

slide-16
SLIDE 16

Good Cases

slide-17
SLIDE 17

Bad Cases

Head fully tilted but missed

slide-18
SLIDE 18

Bad Cases

Face forward but eyes tilted No object of attention

slide-19
SLIDE 19

Bad Cases

Back facing

slide-20
SLIDE 20

Observations

  • Handle groups well
  • Gaze location is very accurate, head location often not
  • Unable to capture eye movement independent of face orientation
  • Fails at a lot of back facing cases
slide-21
SLIDE 21

Gaze Mask and Saliency Map

slide-22
SLIDE 22

Gaze Mask and Saliency Map

  • Gaze Mask incorporates the general direction of gaze
  • Saliency Map incorporates the salient objects in image
  • Element-wise product captures locations that satisfy both
slide-23
SLIDE 23

Gaze Mask and Saliency Map

Image with Gaze Gaze Mask Saliency Map

slide-24
SLIDE 24

Animal Gaze Follow

slide-25
SLIDE 25

Animal Gaze Follow

slide-26
SLIDE 26

Animal Gaze Follow

Works (almost) for even birds

slide-27
SLIDE 27

Animal Gaze Follow

Works even when more than one salient object

slide-28
SLIDE 28

Animal Gaze Follow

  • Model generalizes to animals

Initialized with ImageNet which has animal data

  • Able to learn properties based on orientation of head
  • Point of gaze is not always correct
slide-29
SLIDE 29

Extension to a Short Video

Apply model per frame of video

slide-30
SLIDE 30

Extension to a Short Video

Head detector often fails, could use temporal context to improve

slide-31
SLIDE 31

Conclusions

  • Can be confused with mixed orientations and back-facing
  • Model generalizes well to animals
  • Could be potentially extended to videos
  • Could be applied to other domains?
slide-32
SLIDE 32

Thank You!