Objec bject t tra tracking CV3DST | Prof. Leal-Taix 1 Pr - PowerPoint PPT Presentation

Objec bject t tra tracking CV3DST | Prof. Leal-Taixé 1

Pr Proble blem statement • Given a video, find out which parts of the image depict the same object in different frames • Often we use detectors as starting points t t+1 CV3DST | Prof. Leal-Taixé 2

Wh Why do do we e need eed trac acking? • To model objects when detection fails: – Occlusions – Viewpoint/pose/blur/illumination variations (in a few frames of a sequence) – Background clutter • To reason about the dynamic world, e.g., trajectory prediction (is the person going to cross the street?) CV3DST | Prof. Leal-Taixé 3

Tr Tracki cking ng is…. • Similarity measurement • Correlation • Correspondence – Story time: „A young graduate student asked Takeo Kanade what are the three most important problems in computer vision. Kanade replied: “Correspondence, correspondence, correspondence!” • Matching/retrieval • Data association CV3DST | Prof. Leal-Taixé 4

Tr Tracki cking ng is is also…. Learning to model our target’s • Appearance : we need to know how the target looks like • Ap – Single object tracking – Re-identification • Mo Motio ion : to make predictions of where the targets goes – Trajectory prediction (lecture 6) CV3DST | Prof. Leal-Taixé 5

Si Sing ngle le Ta Target t Tr Tracki cking ng • STT (1) as a matching/correspondence problem: – GOTURN: no online appearance modeling • STT (2) as an appearance learning problem: – MDNet: quick online finetuning of the network • STT (3) as a (temporal) prediction problem: – ROLO = CNN + LSTM CV3DST | Prof. Leal-Taixé 6

Si Sing ngle le Ta Target t Tr Tracki cking ng 1 • Input: what to track? Crop the object to be tracked – Initialization of our tracker D. Held, S. Thrun, S. Savarese. “Learning to Track at 100 FPS with Deep Regression Networks”. ECCV 2016. CV3DST | Prof. Leal-Taixé 7

Si Sing ngle le Ta Target t Tr Tracki cking ng 1 Assume smooth “slow” • Where do I search for the object? motion. The object cannot be far away from where it was in the previous frame. Use the position of t-1 to crop frame t Crop the object to be tracked – Initialization of our tracker D. Held, S. Thrun, S. Savarese. “Learning to Track at 100 FPS with Deep Regression Networks”. ECCV 2016. CV3DST | Prof. Leal-Taixé 8

Si Sing ngle le Ta Target t Tr Tracki cking ng 1 • Architecture: conv + concatenate + FC Check the original paper for the exact parameterization of the output D. Held, S. Thrun, S. Savarese. “Learning to Track at 100 FPS with Deep Regression Networks”. ECCV 2016. CV3DST | Prof. Leal-Taixé 9

Si Sing ngle le Ta Target t Tr Tracki cking ng 1 • PROS of GOTURN: – No online training required. – Tracking is done by comparison, so we do not need to retrain or finetune our model for every new object. – Close to the template matching approach that we saw in the first lectures for object detection – This makes it very fast! • CONS: – We have a motion assumption. If the object moves fast and goes out of our search window, we cannot recover. D. Held, S. Thrun, S. Savarese. “Learning to Track at 100 FPS with Deep Regression Networks”. ECCV 2016. CV3DST | Prof. Leal-Taixé 10

SO SOT T 1.2 - Un Unsu supervise ised • Forward cycle and backward cycle should be consistent! X. Wang, A. Jabri, A. Efros. “Learning correspondence from the cycle-consistency of time”. CVPR 2019 CV3DST | Prof. Leal-Taixé 11

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • Online appearance model learning entails training your CNN at test time. – Slow: not suitable for real-time applications – Solution: train as few layers as possible H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 12

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • Shared layers + scene-specific layers H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 13

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • Backpropagation is independent per sequence Sequence 1 H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 14

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • Backpropagation is independent per sequence Sequence k H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 15

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • At test time, we need to train fc6 (up to fc4 if wanted). New test sequence H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 16

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • Online tracking R-CNN type of regression H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 17

Si Sing ngle le Ta Target t Tr Tracki cking ng 2 • PROS of MDNet: – No previous location assumption, the object can move anywhere in the image – Fine-tuning step is comparatively cheap – Winner of the VOT Challenge 2015 (http://www.votchallenge.net) • CONS: – Not as fast as GOTURN H. Nam and B. Han. „Learning Multi-Domain Convolutional Neural Networks for Visual Tracking“. CVPR 2016 CV3DST | Prof. Leal-Taixé 18

Si Sing ngle le Ta Target t Tr Tracki cking ng 3 • CNN for appearance + LSTM for motion Recurrent YOLO ROLO G. Ning, Z. Zhang, C. Huang, Z. He. „Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking“. arXiv:1607.05781. 2016 CV3DST | Prof. Leal-Taixé 19

Si Sing ngle le Ta Target t Tr Tracki cking ng 3 • LSTM receives the heatmap for the object’s position and the 4096 descriptor of the image G. Ning, Z. Zhang, C. Huang, Z. He. „Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking“. arXiv:1607.05781. 2016 CV3DST | Prof. Leal-Taixé 20

Mult ltiple le object tra track cking CV3DST | Prof. Leal-Taixé 21

Di Different t ch challe llenge ges • Multiple objects of the same type • Heavy occlusions • Appearance is often very similar CV3DST | Prof. Leal-Taixé 22

Tr Tracki cking ng-by by-det detec ection on • We will focus on algorithms where a set of detections is provided – Remember detections are not prefect! Find detections that match and form a trajectory CV3DST | Prof. Leal-Taixé 23

On Onli line vs offli ffline tra rack cking • Online tracking – Processes two frames at a time – For real-time applications – Prone to drifting à hard to recover from errors or occlusions • Offline tracking – Processes a batch of frames – Good to recover from occlusions (short ones as we will see) – Not suitable for real-time applications – Suitable for video analysis CV3DST | Prof. Leal-Taixé 24

Onli On line tra rack cking t t+1 t+2 1. Track in init itia ializ izatio ion (e.g. using a detector) • CV3DST | Prof. Leal-Taixé 25

On Onli line tra rack cking t t+1 t+2 1. Track initialization (e.g. using a detector) • 2. Pr Prediction on of the next position (motion model) • CV3DST | Prof. Leal-Taixé 26

On Onli line tra rack cking t t+1 t+2 1. Track initialization (e.g. using a detector) • 2. Prediction of the next position (motion model) • 3. Ma Matc tching predictions with detections (appearance model) • CV3DST | Prof. Leal-Taixé 27

On Onli line tra rack cking • 2. Prediction of the next position (motion model) – Classic: Kalman filter – Nowadays: Recurrent architecture – For now: we will assume a constant velocity model (spoiler alter: it works really well at high framerates and without occlusions!) CV3DST | Prof. Leal-Taixé 28

On Onli line tra rack cking • 3. Matching predictions Predictions with detections (appearance model) Detections CV3DST | Prof. Leal-Taixé 29

On Onli line tra rack cking • Bipartite matching Predictions – Define distances between boxes 0.9 0.8 0.8 0.1 (e.g., IoU, pixel distance, Detections 3D distance) 0.5 0.4 0.3 0.8 0.2 0.1 0.4 0.8 0.1 0.2 0.5 0.9 CV3DST | Prof. Leal-Taixé 30

Onli On line tra rack cking • Bipartite matching Predictions – Define distances between boxes 0.9 0.8 0.8 0.1 (e.g., IoU, pixel distance, Detections 3D distance) 0.5 0.4 0.3 0.8 - Solve the unique 0.2 0.1 0.4 0.8 matching with e.g., the Hungarian algorithm* 0.1 0.2 0.5 0.9 *Demo: http://www.hungarianalgorithm.com/solve.php CV3DST | Prof. Leal-Taixé 31

On Onli line tra rack cking • Bipartite matching Predictions – Define distances between boxes 0.9 0.8 0.8 0.1 (e.g., IoU, pixel distance, Detections 3D distance) 0.5 0.4 0.3 0.8 - Solve the unique 0.2 0.1 0.4 0.8 matching with e.g., the Hungarian algorithm* 0.1 0.2 0.5 0.9 - Solutions are the unique assignments that minimize the total cost *Demo: http://www.hungarianalgorithm.com/solve.php CV3DST | Prof. Leal-Taixé 32

Objec bject t tra tracking CV3DST | Prof. Leal-Taix 1 Pr - PowerPoint PPT Presentation

Objec bject t tra tracking CV3DST | Prof. Leal-Taix 1 Pr Proble blem statement Given a video, find out which parts of the image depict the same object in different frames Often we use detectors as starting points t t+1 CV3DST |

BRIEFIN IEFING G ON ON 2020 20 SUBJECT BJECT-BASED ASED BANDING NDING 1 GE GENERAL

Objec bject t detec detecti tion on CV3DST | Prof. Leal-Taix 1 Ta Task k defini niti

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

se#ng se#ng objec+ves objec+ves Background: The 17-18 SEPs were reviewed at UQUAC and UEC,

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

SSO SSO TRA TRACKING CKING PRESENTED BY LISA MONTEMAYOR HOUSTON WATER REGULATORY COMPLIANCE

NA TIO NA L C ENTER FO R A DV A NC ING TRA NSLA TIO NA L SC IENC ES TRA NSLA TIO NA L SC IENC ES

SMORE: Semi-Oblivious Tra ffi c Engineering Praveen Kumar * Yang Yuan* Chris Yu Nate Foster*

(POSTER) An Empirical Comparative Measurement on Real ICS Network Tra ffi c to Internet Tra ffi c

Tracking Articulated Objects Alexander (Sasha) Lambert CS7495 Fall 2014 Tracking From Depth

Tracking - VSO Framework Tracking Status Controlling Actions 100% Configuring Actions Device

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Overview: Overview: The U.S. National Source Tracking System The U.S. National Source Tracking

Probabilistic Tracking and Probabilistic Tracking and Probabilistic Tracking and Thesis

Tracking Catalog: Uncovering and analyzing user tracking on the Internet Tomasz Bujlow Valentn

Perf ormance Optimization of Throttled Time- Warp Simulation Seng Chuan TAY and Yong Meng TEO

An Operational Semantics for a Fragment of PRS IJCAI 2018 Stockholm, Sweden Lavindra de Silva 1

A combinatorial Fourier transform for quiver representation varieties in type A Pramod N. Achar,

MP2, RPA and GW within the Gaussian and Plane Waves Method Jrg Hutter Department of Chemistry

Poincar e, Sobolev and Rubio de Francia Ezequiel Rela Departamento de Matem atica Facultad

Myocardial work estimation: a new parameter Elena Galli, MD, PhD, FESC Service de Cardiologie

L ECTURE 27 The Lessons and Tools of Economics May 5, 2020 Announcements Suggested answers

Childcare and the Economy Wednesday, October 28 th | 2-3 pm ET AssetFunders.org AssetFunders.org

Objec bject t tra tracking CV3DST | Prof. Leal-Taix 1 Pr - PowerPoint PPT Presentation

Objec bject t tra tracking CV3DST | Prof. Leal-Taix 1 Pr Proble blem statement Given a video, find out which parts of the image depict the same object in different frames Often we use detectors as starting points t t+1 CV3DST |

BRIEFIN IEFING G ON ON 2020 20 SUBJECT BJECT-BASED ASED BANDING NDING 1 GE GENERAL

Objec bject t detec detecti tion on CV3DST | Prof. Leal-Taix 1 Ta Task k defini niti

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

se#ng se#ng objec+ves objec+ves Background: The 17-18 SEPs were reviewed at UQUAC and UEC,

Overview Introduction Object Tracking Vehicle Tracking Theory &amp; Implementation

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

SSO SSO TRA TRACKING CKING PRESENTED BY LISA MONTEMAYOR HOUSTON WATER REGULATORY COMPLIANCE

NA TIO NA L C ENTER FO R A DV A NC ING TRA NSLA TIO NA L SC IENC ES TRA NSLA TIO NA L SC IENC ES

SMORE: Semi-Oblivious Tra ffi c Engineering Praveen Kumar * Yang Yuan* Chris Yu Nate Foster*

(POSTER) An Empirical Comparative Measurement on Real ICS Network Tra ffi c to Internet Tra ffi c

Tracking Articulated Objects Alexander (Sasha) Lambert CS7495 Fall 2014 Tracking From Depth

Tracking - VSO Framework Tracking Status Controlling Actions 100% Configuring Actions Device

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Overview: Overview: The U.S. National Source Tracking System The U.S. National Source Tracking

Probabilistic Tracking and Probabilistic Tracking and Probabilistic Tracking and Thesis

Tracking Catalog: Uncovering and analyzing user tracking on the Internet Tomasz Bujlow Valentn

Perf ormance Optimization of Throttled Time- Warp Simulation Seng Chuan TAY and Yong Meng TEO

An Operational Semantics for a Fragment of PRS IJCAI 2018 Stockholm, Sweden Lavindra de Silva 1

A combinatorial Fourier transform for quiver representation varieties in type A Pramod N. Achar,

MP2, RPA and GW within the Gaussian and Plane Waves Method Jrg Hutter Department of Chemistry

Poincar e, Sobolev and Rubio de Francia Ezequiel Rela Departamento de Matem atica Facultad

Myocardial work estimation: a new parameter Elena Galli, MD, PhD, FESC Service de Cardiologie

L ECTURE 27 The Lessons and Tools of Economics May 5, 2020 Announcements Suggested answers

Childcare and the Economy Wednesday, October 28 th | 2-3 pm ET AssetFunders.org AssetFunders.org

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation