Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. - PowerPoint PPT Presentation

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. Amitabha Mukerjee Presented by Vempati Anurag Sai SE367 – Cognitive Science

Introduction  Humans deploy anticipatory gaze in many situations. While moving around, driving…  Google’s self driving car has a Kalman Filter that tracks each and every vehicle in its sight and anticipates their future positions so that it doesn’t run into them.  Human Gaze – Tightly connected to motor resonance system. [Sciuttu et al.]  Sports persons.  Batsmen’s eye movements monitor the moment when the ball is released, make a predictive saccade to the place where they expect it to hit the ground, wait for it to bounce, and follow its trajectory for 100 – 200 ms after the bounce. [Land & McLeod]

Introduction

Mechanism  Basically, hoping to achieve the degree of anticipation as in a professional cricketer  The model is learnt in unsupervised fashion.  Various sequences of a ball bouncing off the walls/floor viewed from different viewpoints is created for the training phase.

Mechanism  Then we search for any moving round objects. The pixel coordinates and size of the ball are stored to get a dataset for training phase.  Segmentation/ Optical flow will be a better choice in general. But, since we know the shape of object, better options are available.  ‘Canny edge detector’ + ‘Hough Transform’

Mechanism  Size of the ball gives ‘z’ component.  Using (x, y, z) pairs in the dataset, learn the state transition matrix F .  Regression problem. State Transition Matrix State vector

Mechanism  Kalman Filter is then used to predict the trajectory in advance.  Why Kalman Filter?  Takes care of Noisy Measurements  Just the measurement of position will do  Several cycles of prediction can be done before next measurement update

Kalman Filter  Assumes the true state at time k is evolved from the state at (k-1) according to:  F k is the state transition model which is applied to the previous state x k-1  B k is the control-input model which is applied to the control vector u k  w k is the process noise which is assumed to be drawn from a zero mean multivariate normal distribution with covariance Q k .  At time k an observation (or measurement) z k of the true state x k is made according to  where H k is the observation model which maps the true state space into the observed space and v k is the observation noise which is assumed to be zero mean Gaussian noise with covariance R k

What next?  Evaluate performance on real videos  Answer the bigger question!  Better Learning Paradigm  Compare human gaze anticipation with the developed model

REFERENCES Land, Michael F., and Peter McLeod. "From eye I. movements to actions: how batsmen hit the ball." Nature neuroscience 3.12 (2000): 1340-1345. Sciutti, Alessandra, et al. "Anticipatory gaze in II. human-robot interactions." Gaze in HRI from modeling to communication” workshop at the 7th ACM/IEEE international conference on human-robot interaction, Boston, Massachusetts, USA . 2012. Perse, Matej, et al. "Physics-based modelling of III. human motion using kalman filter and collision avoidance algorithm." International Symposium on Image and Signal Processing and Analysis, ISPA05, Zagreb, Croatia. 2005. http://en.wikipedia.org/wiki/Kalman_filter IV.

QUESTIONS??

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. - PowerPoint PPT Presentation

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. Amitabha Mukerjee Presented by Vempati Anurag Sai SE367 Cognitive Science Introduction Humans deploy anticipatory gaze in many situations. While moving around, driving

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman,

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based

Top-Down Parsing Slides modified from Louden Book and Dr. Scherger Top Down Parsing A

Agenda What is Top-down Web services? Benefit of top-down Web services How to develop

To TOP or NOT to TOP www.SAS.com To TOP or NOT to TOP Using the TOP command in Linux By Len van

The Presentaion-Based Paper The Paper A Top-Down Row Enumeration Approach of Top-Down

Down Syndrome by Birth Order and Moms Age 3/20/2017 V0 2017-Down-Syndrome 1 2017-Down-Syndrome

Lay Them Down Chorus: Lay them down, Lay them down, Lay your branches down for Him Spread them

Boosted Top Tagging Seung J. Lee Outline Introduction: top jets @ LHC Modern boosted top

DEEP UNCONSTRAINED GAZE ESTIMATION WITH SYNTHETIC DATA Shalini De Mello, Rajeev Ranjan, Jan Kautz

13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our

MSc International Migration and Public Policy (IMPP2021) IMPP Core Team Joseph Downing

Today HealthSystemPharm2 Sue Skledar P3 Assessment Colloquium #1 Spring 2018 dates

The Role and Future of Schools of Education Dr. Michael E. Spagna Dean Michael D. Eisner

2020 Effective Mentoring Program Refresher Program 1 2020 Refresher Workshop v1 How today will

Collaboration for Effective Educator Development, Accountability and Reform (CEEDAR)

Automatic induction of a PoS tagset for Italian R. Bernardi 1 , A. Bolognesi 2 , C. Seidenari 2 ,

LLMAT Class teacher Job Description The Class Teachers job description is in line with the

Car to car communication of autonomous driving vehicles in dangerous situations NAME: FABIAN

Sambuz

Useful Links

Newsletter

Mail Us