Following Gaze in Video A. Recasens et al. Presented by: Keivaun - PowerPoint PPT Presentation

Feb 04, 2023 •409 likes •653 views

Following Gaze in Video A. Recasens et al. Presented by: Keivaun Waugh and Kapil Krishnakumar Background Given face in one frame, how can we figure out where that person is looking? Target object might not be in the same frame Sample

Following Gaze in Video A. Recasens et al. Presented by: Keivaun Waugh and Kapil Krishnakumar
Background ● Given face in one frame, how can we figure out where that person is looking? Target object might not be in the same frame ●
Sample Results Input Video Gaze Density Gazed Area
Architecture
VideoGaze Dataset ● 160k annotations of video frames from MoviesQA dataset ● Annotations: ○ Source Frame Head Location ○ ○ Body Target Frame ( 5 per source frame) ○ ■ Gaze Location Time difference between Source and ■ Target
Experiments ● Naive network architecture Don’t segment network into different into different pathways ○ ○ Concatenate all inputs and predict directly Replace transformation pathway with SIFT+RANSAC affine fit finding ● ● Various neighboring frame prediction windows ● Examine failure cases ○ “Look cone” doesn’t take into account the eye position ○ Other failures
Naive Model
Naive Architecture ● Use fusion of target frame and source frame to predict gaze location Alex Net Source Frame 0 …………… 0 0, 0.4, 0.3, 0 0 ………….. 0 Target Frame 0 ………….. 0 20x20
Alternate Transformation Pathway
Architecture ● Replace deep CNN pathway with traditional SIFT+RANSAC affine warp SIFT + RANSAC
Quantitative Results
Results AUC (higher KL Divergence (lower L2 Dist (lower Description better) better) better) 73.7 8.048 0.225 Normal model with transformation pathway 60.2 6.604 0.294 Normal model with sparse affine 60.2 6.6604 0.294 Normal model with dense affine 60.9 6.641 0.242 Naive model 56.9 28.39 0.437 Random
Qualitative Results
Results ● Input video is 150 frames long Full Video Cropped Head What I’m looking at
Results - Search 150 Neighboring Frames Original Transformation Pathway Naive Model
Results - Search 150 Neighboring Frames Sparse SIFT Affine Warp Dense SIFT Affine Warp
Results - Search 25 Neighboring Frames Original Transformation Pathway Naive Model
Results - Search 25 Neighboring Frames Sparse SIFT Affine Warp Dense SIFT Affine Warp
Target in Same Frame Original Video Original Transformation Pathway Naive Model
Target in Same Frame Sparse SIFT Affine Warp Dense SIFT Affine Warp
Runtimes ● GTX 1070 and Haswell Core i5 Generating results is CPU bound ● ● 5 second video with 150 frame search width ○ Deep transformation pathway: 6.5 minutes Sparse affine: 10.5 minutes ○ ○ Dense affine: 32 minutes 100% CPU Usage GPU Usage 0% Usage when running model with transformation pathway
Failure Cases Input Video Original Transformation Pathway
Failure Cases Input Video Original Transformation Pathway
Conclusions ● Separating input modalities for Saliency and Head Pose provides significant information to the model. ○ Illustrates importance of hand-crafted architecture even though features are automatically discovered ● Head Direction != Eye Direction ● Frame Predictor window selection determines whether match can be found or not.

Recommend

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

gaze-following and recognizing intentions from gaze Outline infant gaze following studies and intentionality gaze following and object processing Do infants gaze-follow? Infants turn in the direction that an adult has turned.

415 views • 25 slides

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze tracking can be used in interactive applications as a means to take inputs. Possible Approaches 3-D gaze tracking 2-D gaze tracking

514 views • 7 slides

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations social functions of gaze behaviour gaze and task performance previous work on simulating gaze behaviour in agents and robots How are

245 views • 11 slides

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman,

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman, Manor Akanksha Saran CS381V: Experiment Presentation Outline Description of Gaze Datasets -DIEM -CRCNS Analysis of Human Gaze Datasets

593 views • 47 slides

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based Interaction Non-Linear Storytelling Media Authoring Luiz Velho VISGRAF Lab - IMPA Case Study Omnidirectional Video Equirectangular Format -

446 views • 10 slides

Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze

Derivation of Saccade Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze Shifts Catch-up Saccades Saccadic Fast Phase Ballistic nature of saccades. Pulse and step are pre-programmed Latency

4.22k views • 45 slides

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: - What is visual saliency (through Itti Koch & Torralba methods) - Gaze distributions for GTEA Gaze + dataset (per user and per task) - Local

580 views • 21 slides

DEEP UNCONSTRAINED GAZE ESTIMATION WITH SYNTHETIC DATA Shalini De Mello, Rajeev Ranjan, Jan Kautz

DEEP UNCONSTRAINED GAZE ESTIMATION WITH SYNTHETIC DATA Shalini De Mello, Rajeev Ranjan, Jan Kautz NVIDIA AI CO-PILOT 2 APPLICATIONS INTERFACE DESIGN AR/VR ACCESSIBILITY 3 TRADITIONAL GAZE TRACKERS Fovea Cornea C E Sclera Pupil C c ,

665 views • 47 slides

13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our

Our eye gaze solutions presentation to the AAC SIG 13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our range 2 PC Eye Explore Entry level camera Works with eye gaze games and Gaze Viewer

798 views • 34 slides

Implementation Strategies for Eye Gaze Users Katelyn Oeser SLP Brenda Del Monte SLP They are

Implementation Strategies for Eye Gaze Users Katelyn Oeser SLP Brenda Del Monte SLP They are doing it with their eyes! Why do we put kiddos on eye gaze? Vision is a relative strength Direct Select with a pointer finger is not reliable

721 views • 28 slides

Three classes of eye movements: Gaze Stabilization with body movement Optokinetic Nystagmus (OKN)

Three classes of eye movements: Gaze Stabilization with body movement Optokinetic Nystagmus (OKN) Vestibulo-ocular reflex (VOR) Foveal gaze shifts with attention shifts Saccades Asymmetric vergence Foveal Maintenance of stationary &

477 views • 20 slides

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas,

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas, PhD (Cantab) Assistant Professor Indian Institute of Science http://cpdm.iisc.ernet.in/PBiswas.htm What is Eye Tracking & Gaze Control Eye

665 views • 18 slides

Realtime Gaze Estimation with Online Calibration Li Sun, Mingli Song, Zicheng Liu, Ming-Ting Sun

Realtime Gaze Estimation with Online Calibration Li Sun, Mingli Song, Zicheng Liu, Ming-Ting Sun Outline Introduction Limitations & goals Proposed method Results Demo Gaze Estimation Dia iagnostic applications:

520 views • 24 slides

Visual Attention in Spoken HRI Maria Staudte & Matthew Crocker Saarland University, Germany

Visual Attention in Spoken HRI Maria Staudte & Matthew Crocker Saarland University, Germany 1 Gaze in HRI Robot gaze appears active,enjoyable,friendly (e.g. Kanda et al. 01, Kuno et al. 07) Robot gaze is different

465 views • 32 slides

Gaze-Assisted Remote Communication Between Teacher And Students Kari-Jouko Rih, Oleg pakov,

Gaze-Assisted Remote Communication Between Teacher And Students Kari-Jouko Rih, Oleg pakov, Howell Istance Diederick C. Niehorster University of Tampere Lund University Previous Work on Shared Gaze Offline (experts pre-recorded

817 views • 22 slides

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. Amitabha Mukerjee Presented by

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. Amitabha Mukerjee Presented by Vempati Anurag Sai SE367 Cognitive Science Introduction Humans deploy anticipatory gaze in many situations. While moving around, driving

346 views • 13 slides

Project 2 Q&A Alexandre Alahi Vignesh Ramanathan 1

Project 2 Q&A Alexandre Alahi Vignesh Ramanathan 1 Fei-Fei Li, Alexandre Alahi, Vignesh Ramanathan Lecture 6 - 4-May-15 Outline TLD Review Error metrics

738 views • 62 slides

iLab Basics Benjamin Hof hof@in.tum.de Lehrstuhl fr Netzarchitekturen und Netzdienste

iLab Basics Benjamin Hof hof@in.tum.de Lehrstuhl fr Netzarchitekturen und Netzdienste Fakultt fr Informatik Technische Universitt Mnchen Lab 1 15ss 1 Outline Internet protocol architecture MAC addresses Internet protocol

241 views • 21 slides

CSE 311 Foundations of Administrative Computing I Course web:

CSE 311 Foundations of Administrative Computing I Course web: http://www.cs.washington.edu/311 Spring 2013, Lecture 3 Homework, Lecture slides, Office Hours ... Propositional Logic, Boolean Logic/Boolean Algebra Homework:

417 views • 9 slides

Arithmetic circuits with locally low algebraic rank Mrinal Kumar Joint work with Shubhangi Saraf

Arithmetic circuits with locally low algebraic rank Mrinal Kumar Joint work with Shubhangi Saraf Plan for the talk Plan for the talk Depth four arithmetic circuits. Plan for the talk Depth four arithmetic circuits. The problem we

1.23k views • 106 slides

(A final note on) Mobile Kinematics Manipulator Kinematics 3 4 Goal: take robot from A I to B

4/2/20 (A final note on) Mobile Kinematics Kinematics: overview, 2 transforms, and wheels Given this setup: P P x z I = y y P y 1 1 P x x 1 1 x We can map { X I ,Y I } (global) { X R ,Y R } (robot) y

458 views • 7 slides

Testbeam analysis of a single chip timepix3 ingrid Kees Ligtenberg Nikhef November 20, 2017

Timepix hits Telescope track pixel z-axis (drift direction) [mm] 16 1.6 14 pixel z-axis (drift direction) [mm] 12 1.4 10 1.2 8 6 1 s] 4 ToT [ 0.8 2 0.6 0 2 0.4 4 6 pixel y-axis [mm] 8 0.2 0 10 2 4 12 6 8 0 10

344 views • 23 slides

Electrical Method: Description of the system 3% resolution arXiv:1804.05941 [physics.ins-det]

Electrical Method: Description of the system 3% resolution arXiv:1804.05941 [physics.ins-det] Performance of the system Dependency on V AC and V DC Dependency on wire length Electrical Method: Setup Connector for SBND boards About 40

541 views • 15 slides

Towards a Regular Theory of Parameterized Concurrent Systems Benedikt Bollig

Towards a Regular Theory of Parameterized Concurrent Systems Benedikt Bollig Laboratoire Spcification et Vrification ENS Cachan & CNRS, France Reports on joint works with Paul Gastin, Akshay Kumar, and Jana Schubert. ACTS

1.7k views • 168 slides