[PPT] - Tracking Tracking Many thanks to: H. Bischof, B. Leibe, V. Ferrari, PowerPoint Presentation

SLIDE 1

Computer Vision

Tracking Tracking

Many thanks to: H. Bischof, B. Leibe, V. Ferrari, K. Graumann, Y. Ukrainitz, D. Wagner, V Lepetit, M. Breitenstein, P. Sabzmeydani, Z. Kalal from whom I borrowed many slides and videos.

SLIDE 2

Computer Vision

’06]

We all know what tracking is, right?

[Grabner et al., VideoProc CVPR’

SLIDE 3

Computer Vision

Tracking

actual object position

Time t+1 Time t

„FIND IT AGAIN“

SLIDE 4

Computer Vision

What to track?

SLIDE 5

Computer Vision

What to track?

center point

SLIDE 6

Computer Vision

What to track?

multiple points

SLIDE 7

Computer Vision

What to track?

(body) parts

SLIDE 8

Computer Vision

What to track?

region

SLIDE 9

Computer Vision

What to track?

utline

SLIDE 10

Computer Vision

What to track?

structure

SLIDE 11

Computer Vision

Approaches

(i) Model-based tracking application-specific

human body, faces, space shuttle,…

SLIDE 12

Computer Vision

Approaches

(i) Model-based tracking application-specific

human body, faces, space shuttle,…

(ii) Feature tracking more generic

corner tracking blob/contour tracking intensity profile tracking region tracking

SLIDE 13

Computer Vision Saliency Object

Tracking Cues

Model/ Tracking History Scene

SLIDE 14

Computer Vision

Applications!

Structure-from-Motion
Gesture/Action Recognition
Video editing
Augmented Reality
Augmented Reality
Navigation
….

SLIDE 15

Computer Vision

Applications: Game Interface

SLIDE 16

Computer Vision

Applications: SfM

Tracked Points gives correspondences

SLIDE 17

Computer Vision

Applications: SfM

004] [Pollefeyes et al. IJCV 200

SLIDE 18

Computer Vision

Applications: Analysis of Motion Pattern

Single-Agent Level Multi-Agent Level Scene Level Detail Level

SLIDE 19

Computer Vision

[Ess et al. CVPR’08]

SLIDE 20

Computer Vision

Outline

Point Tracking
Template Tracking
Region Tracking
Model-based Tracking
Foreground vs. Background
Foreground vs. Background
Tracking-by-Detection

– Object classes – specific object

Combining Tracking and Detection
Context in Tracking

SLIDE 21

Computer Vision

x y

Vision = Inverse Graphics

x y z

SLIDE 22

Computer Vision

Motion

SLIDE 23

Computer Vision

Motion

SLIDE 24

Computer Vision

Motion

SLIDE 25

Computer Vision

x y

Vision = Inverse Graphics

x y z

Tracking = Inverse Animation

SLIDE 26

Computer Vision

Steps of Tracking

predict predict correct correct

Recap: Particle filtering

– Tracking can be seen as the process of propagating the posterior distribution of state given measurements across time.

SLIDE 27

Computer Vision

) | , (

1 1 1 − − − t t t

z p p p & ) | , (

1 − t t t

z p p p &

prediction C O N D E N Particle Filter

) | (

t t

p z p

weighing with

) | , (

t t t

z p p p &

update N S A T I O N

SLIDE 28

Computer Vision

General Tracking Loop

predict to t+1 time t measure at t+1 update location update model

SLIDE 29

Computer Vision

Point Tracking Point Tracking

SLIDE 30

Computer Vision

Estimate Optimal Transformation

SLIDE 31

Computer Vision

Simple 1D Problem

I0(x) I (x+h) I1(x+h)

SLIDE 32

Computer Vision

Sum of Squared Differences

I0(x) I (x+h) h I1(x+h)

E(h) = [I0(x) – I1(x+h) ]2

SLIDE 33

Computer Vision

Calculation of Displacement

E(h) [ I0 (x) – I1(x) – hI1’(x) ]2

≈

E(h) = [I0(x) – I1(x+h) ]2

2[I1’(x)(I0(x) – I1(x) ) – hI1’(x)2]

h E ∂ ∂

≈

I0(x) – I1(x) h I1’(x)

≈

SLIDE 34

Computer Vision

Interpretation

h I0(x) I (x+h) I0(x) – I1(x) I1’(x) I1(x+h)

I0(x) – I1(x) h I1’(x)

≈

I1’(x)

SLIDE 35

Computer Vision

Problem A: Local Minima

(a) (b)

SLIDE 36

Computer Vision

Problem A: Local Minima

SLIDE 37

Computer Vision

Problem B: Zero Gradient

I(x) - I0(x) h ≈ I0’(x)

?

SLIDE 38

Computer Vision

Problem B: Aperture problem

No gradient along

ne direction:

SLIDE 39

Computer Vision

Problem B: Aperture problem

No gradient along

ne direction:

SLIDE 40

Computer Vision

“Solving” the Aperture Problem

How to get more equations for a

pixel?

Spatial coherence constraint: Pixel’s

neighbors have the same movement neighbors have the same movement I0(x) I1(x+h)

SLIDE 41

Computer Vision

, I I x ∂ ∂ =

, ∂ ∂ = y I I y

t I It ∂ ∂ =

Recall: Optical Flow

= + +

t y x

I v I u I

1 equation in 2 unknowns

, = dt dx u

dt dy v =

, x I x ∂ =

, ∂ = y I y

t It ∂ =

SLIDE 42

Computer Vision

Least Squares Problem

Pseudo Inverse Over determined System of Equations

SLIDE 43

Computer Vision

Eigenvectors of ATA

Haven’t we seen an equation like this
Haven’t we seen an equation like this

before?

Recall the Harris corner detector!
“Good Features to Track”

SLIDE 44

Computer Vision

Interpreting the Eigenvalues

λ2

“Corner” λ λ λ λ1 ~ λ λ λ λ2 and large “Edge” λ λ λ λ2 >> λ λ λ λ1

λ1

“Edge” λ λ λ λ1 >> λ λ λ λ2 “Flat” region

SLIDE 45

Computer Vision

Samples: Edge / Low Texture / High Texture

SLIDE 46

Computer Vision

Example

SLIDE 47

Computer Vision

Template Tracking Template Tracking

SLIDE 48

Computer Vision

Lucas-Kanade Template Tracker

From Points to templates
Estimate „optimal“ warp W

SLIDE 49

Computer Vision

4, Lucas-Kanade ramework] [Baker & Matthews, IJCV’04 20 Years On: A Unifying Fr

SLIDE 50

Computer Vision

Example

SLIDE 51

Computer Vision

Tracking by Detection of Local Image Features Image Features

(specific target)

SLIDE 52

Computer Vision

3D Object Detection

Reference image(s) of the object to detect Test image

SLIDE 53

Computer Vision

Standard Approach

Step 1: Keypoint detection

– invariant to scale, rotation, or perspective

SLIDE 54

Computer Vision

Standard Approach

Step 2: Patch rectification

SLIDE 55

Computer Vision

Standard Approach

Step 3: Build descriptor vector

SLIDE 56

Computer Vision

Standard Approach

Step 4: Match descriptor vectors

Query Database

SLIDE 57

Computer Vision

Summary

Search in the Database Search in the Database

Keypoint Detection Keypoint Recognition

Database Database Pre-processing Make the actual classification easier Robust 3D Pose Calculation (RANSAC) Robust 3D Pose Calculation (RANSAC)

Geometric verification

SLIDE 58

Computer Vision

[Wagner et al. ISMAR’08]

SLIDE 59

Computer Vision

[Wagner et al. ‘09]

SLIDE 60

Computer Vision

Region Tracking Region Tracking

SLIDE 61

Computer Vision

Background Modeling

Input Background Model Moving Foreground Blobs (Objects)

SLIDE 62

Computer Vision

Mean Shift Tracking

The mean shift tracker tracks a region,

with a prescribed (color) distribution

The similarity between the tracked

region and the target region is

9]

region and the target region is maximized, through evolution towards higher density in a parameter space

Typically this search only takes a few

iterations

[Comaniciu and Meer, ICCV’99

SLIDE 63

Computer Vision

Meanshift Tracking

Region of interest (Kernel) Center of mass Mean Shift vector Measurements

SLIDE 64

Computer Vision

Intuitive Description

SLIDE 65

Computer Vision

Intuitive Description

SLIDE 66

Computer Vision

Intuitive Description

SLIDE 67

Computer Vision

Intuitive Description

SLIDE 68

Computer Vision

Intuitive Description

SLIDE 69

Computer Vision

Intuitive Description

SLIDE 70

Computer Vision

Intuitive Desciption

SLIDE 71

Computer Vision

Example

SLIDE 72

Computer Vision

Elderly People Monitoring

SLIDE 73

Computer Vision

Model based Model based Tracking

SLIDE 74

Computer Vision

Articulated Tracking with Part-Based Model

part appearance + relative geometry.

SLIDE 75

Computer Vision

Using Models

Goal

– Recover a person’s body articulation – Detailed parameterization in terms of joint locations or joint angles

Two basic classes of approaches

– Articulated tracking as high- dimensional inference – Part-based models

SLIDE 76

Computer Vision

[Ramanan et al. CVPR’05]

SLIDE 77

Computer Vision

Tracking as On-line Foreground vs. Background Background Classification

SLIDE 78

Computer Vision

Tracking as Classification

Learning current object appearance vs. local

background.

current background background

[Grabner et al. CVPR’06]

current

bject

appearance

SLIDE 79

Computer Vision

Tracking as Classification

bject

background vs.

SLIDE 80

Computer Vision

Tracking as Classification

bject

background vs.

SLIDE 81

Computer Vision

Tracking Loop

search Region actual object position from time t to t+1 evaluate classifier on sub-patches

+
create confidence map

analyze map and set new

bject position

update classifier (tracker)

SLIDE 82

Computer Vision

SLIDE 83

Computer Vision

“Simple tracking”

SLIDE 84

Computer Vision

“Tracking the Invisible”

SLIDE 85

Computer Vision

When does it fail…

SLIDE 86

Computer Vision

search Region actual object position from time t to t+1 evaluate classifier on sub-patches

+
create confidence map

analyze map and set new object position update classifier (tracker)

Self-learning!

SLIDE 87

Computer Vision

Drifting

Tracked Patches Confidence

SLIDE 88

Computer Vision

Drifting

SLIDE 89

Computer Vision

Tracking by Detection Detection

(object class)

SLIDE 90

Computer Vision

Traditional Tracking

t=1 initialization t=2 position in prev. frame candidate new positions (e.g., dynamics) best new position (e.g., max color similarity)

SLIDE 91

Computer Vision

Tracking-by-Detection

…

detect object(s) independently in each frame associate detections over time into tracks

SLIDE 92

Computer Vision

Multiple Objects

Frame 5 Frame 1 Frame 9

SLIDE 93

Computer Vision

Example: Multiple Object Tracking

SLIDE 94

Computer Vision

How to get the detections?

Persons Background

Supervised Learning

SLIDE 95

Computer Vision

Using the classifier

SLIDE 96

Computer Vision

How to link them?

Space-Time Analysis:

(a) collect detections

Detections Space Time Volume [Leibe et al. CVPR’07]

SLIDE 97

Computer Vision

Trajectory Estimation

(a) collect detections (b) trajectory growing and selection

t x t z

Space Time Volume

SLIDE 98

Computer Vision

Trajectory Estimation

(a) collect detections (b) trajectory growing and selection

t x t z

H1 H2

Space Time Volume

SLIDE 99

Computer Vision

Result

Input (Object Detections) “Tracking” Result

SLIDE 100

Computer Vision

SLIDE 101

Computer Vision

More information helps…

Articulated tracking

– “walking” people

3D Information

Ground Plane Depth verification

SLIDE 102

Computer Vision

[Gammeter et al. ECCV’08]

SLIDE 103

Computer Vision

Towards Scene Interpretation

SLIDE 104

Computer Vision

Combining Tracking and Tracking and Detection

SLIDE 105

Computer Vision

Refining an object model

Limit drift

Current Model Fix (initial) Model

[Grabner et al. ECCV’08]

SLIDE 106

Computer Vision

Recover from Drift

SLIDE 107

Computer Vision

Drifting

CLICK HERE TO START

SLIDE 108

Computer Vision

Combination: KLT & TbD

Use a KLT Tracker

to explore

Learn an object

detector on the fly.

[Kalal et al. CVPR’10]

SLIDE 109

Computer Vision

SLIDE 110

Computer Vision

SLIDE 111

Computer Vision

SLIDE 112

Computer Vision

Context in Tracking in Tracking

SLIDE 113

Computer Vision

I’m Carl – Track me…

[Grabner et al. CVPR’10]

SLIDE 114

Computer Vision

Tracking Carl

SLIDE 115

Computer Vision

SUPPORTERS…

… came with different strength.
… change over time.

SLIDE 116

Computer Vision

SUPPORTERS…

… came with different strength.
… change over time.

SLIDE 117

Computer Vision

SUPPORTERS help Tracking of…

… objects which

change there appearance very quickly.

… occluded
bjects or object
utside the image.
… small and/or

low textured

bjects or even

“virtual points”.

SLIDE 118

Computer Vision

ETH-Cup Sequenze

SLIDE 119

Computer Vision

ETH-Cup: Humans

SLIDE 120

Computer Vision

Of the Web Tracker

SLIDE 121

Computer Vision

ETH-Cup: Supporters

SLIDE 122

Computer Vision

Beyond the Image

Supporters

SLIDE 123

Computer Vision

Coupled Motion

Supporters

SLIDE 124

Computer Vision

Changing Supporters

Supporters

SLIDE 125

Computer Vision

Obviously, there are failure cases…

…. and magician knows that.

Supporters

SLIDE 126

Computer Vision

Tracking Issues Tracking Issues

SLIDE 127

Computer Vision

Tracking Requirements

Strongly depends on the application!

Robust, Accurate, Fast,…

Constrain the tracking task!

Information about the object, dynamics, environment,…

SLIDE 128

Computer Vision

Tracking Issues

Initialization
bject position

Time t = 0

bject position

SLIDE 129

Computer Vision

Tracking Issues

Prediction vs. Correction

– If the dynamics model is too strong, will end up ignoring the data – If the observation model is too strong, tracking is reduced to repeated detection is reduced to repeated detection

http://www.ethlife.ethz.ch/archive_articles/091008_kalman_per/index

SLIDE 130

Computer Vision

Tracking Issues

Obtaining observation…

– Generative: “render” the state on top of the image and compare – Discriminative: classifier or detector score score

…and dynamics model

– specify using domain knowledge – learn (very difficult)

SLIDE 131

Computer Vision

Tracking Issues

Nonlinear

dynamics

– Sometimes needed to keep multiple trackers in parallel trackers in parallel – E.g., for abrupt direction changes („Persons“)

Wrong prediction Correct prediction

SLIDE 132

Computer Vision

Tracking Issues

Data Association - Multiple Object

Tracking

– What if we don’t know which measurements to associate with which tracks? tracks?

SLIDE 133

Computer Vision

Tracking Issues

Data Association – Fast Motion

SLIDE 134

Computer Vision

Tracking Issues

Data Association – Background /

Appearance Change

– Cluttered Background – Changes in shape, orientation, color,… – Changes in shape, orientation, color,…

SLIDE 135

Computer Vision

Tracking Issues

Data Association – Occlusions / Self

Occlusions

SLIDE 136

Computer Vision

Tracking Issues

Model- vs. Model-free-Tracking

SLIDE 137

Computer Vision

Tracking Issues

Drift

– Errors caused by dynamical model,

bservation model, and data association

tend to accumulate over time

SLIDE 138

Computer Vision