Motion
St Stanfor
- rd University
19-Nov-2019 1
Lecture: Motion Juan Carlos Niebles and Ranjay Krishna Stanford - - PowerPoint PPT Presentation
Motion Lecture: Motion Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab 19-Nov-2019 1 St Stanfor ord University CS 131 Roadmap Motion Pixels Segments Images Videos Web Neural networks Convolutions Recognition
Motion
St Stanfor
19-Nov-2019 1
Motion
St Stanfor
19-Nov-2019 2
Convolutions Edges Descriptors
Resizing Segmentation Clustering Recognition Detection Machine learning
Motion Tracking
Neural networks Convolutional neural networks
Motion
St Stanfor
19-Nov-2019 3
Reading: [Szeliski] Chapters: 8.4, 8.5
[Fleet & Weiss, 2005] http://www.cs.toronto.edu/pub/jepson/teaching/vision/2503/opticalFlow.pdf
Motion
St Stanfor
19-Nov-2019 4
Reading: [Szeliski] Chapters: 8.4, 8.5
[Fleet & Weiss, 2005] http://www.cs.toronto.edu/pub/jepson/teaching/vision/2503/opticalFlow.pdf
Motion
St Stanfor
19-Nov-2019 5
Motion
St Stanfor
19-Nov-2019 6
Motion
St Stanfor
19-Nov-2019 7
Motion
St Stanfor
19-Nov-2019 8
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 9 Picture courtesy of Selim Temizer - Learning and Intelligent Systems (LIS) Group, MIT
Vector field function of the spatio-temporal image brightness variations
Motion
St Stanfor
19-Nov-2019 10
frame
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 11
* Slide from Michael Black, CS143 2003
Motion
St Stanfor
19-Nov-2019 12
* Slide from Michael Black, CS143 2003
Motion
St Stanfor
19-Nov-2019 13
* Slide from Michael Black, CS143 2003
Motion
St Stanfor
19-Nov-2019 14
t y x
Image derivative along x
T + It = 0
Source: Silvio Savarese
Image derivative along t
Motion
St Stanfor
19-Nov-2019 15
Motion
St Stanfor
19-Nov-2019 16
edge (u,v) (u’,v’) gradient (u+u’,v+v’)
If (u, v ) satisfies the equation, so does (u+u’, v+v’ ) if
T = 0
T + It = 0 Source: Silvio Savarese
∇𝐽
Motion
St Stanfor
19-Nov-2019 17
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 18
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 19
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 20
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 21
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 22
Motion
St Stanfor
19-Nov-2019 23
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 24
Reading: [Szeliski] Chapters: 8.4, 8.5
[Fleet & Weiss, 2005] http://www.cs.toronto.edu/pub/jepson/teaching/vision/2503/opticalFlow.pdf
Motion
St Stanfor
19-Nov-2019 25
– If we use a 5x5 window, that gives us 25 equations per pixel
679, 1981.
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 26
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 27
The summations are over all pixels in the K x K window
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 28
– Optimal (u, v) satisfies Lucas-Kanade equation
Does this remind anything to you?
– eigenvalues l1 and l 2 of ATA should not be too small
– l 1/ l 2 should not be too large (l 1 = larger eigenvalue)
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 29
the direction of fastest intensity change
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 30
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 31
– gradients very large or very small – large l1, small l2
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 32
– small l1, small l2
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 33
– large l1, large l2
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 34
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 35
– Can solve using Newton’s method (out of scope for this class) – Lukas-Kanade method does one iteration of Newton’s method
– To do better, we need to add higher order terms back in:
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 36
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 37
– the set is stationary yet things seem to move
– nothing seems to move, yet it is rotating
– for example, if the specular highlight on a rotating sphere moves.
– And infinitely more break downs of optical flow.
Motion
St Stanfor
19-Nov-2019 38
Motion
St Stanfor
19-Nov-2019 39
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 40
– Probably not—it’s much larger than one pixel (2nd order terms dominate) – How might we solve this problem?
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 41
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 42
image I image H
Gaussian pyramid of image 1 Gaussian pyramid of image 2 image 2 image 1
u=10 pixels u=5 pixels u=2.5 pixels u=1.25 pixels
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 43
image I image J
Gaussian pyramid of image 1 (t) Gaussian pyramid of image 2 (t+1) image 2 image 1
run iterative L-K run iterative L-K warp & upsample
. . .
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 44
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 45
* From Khurram Hassan-Shafique CAP5415 Computer Vision 2003
Motion
St Stanfor
19-Nov-2019 46
Reading: [Szeliski] Chapters: 8.4, 8.5
[Fleet & Weiss, 2005] http://www.cs.toronto.edu/pub/jepson/teaching/vision/2503/opticalFlow.pdf
Motion
St Stanfor
19-Nov-2019 47
Motion
St Stanfor
19-Nov-2019 48
Motion
St Stanfor
19-Nov-2019 49
Motion
St Stanfor
19-Nov-2019 50
Motion
St Stanfor
19-Nov-2019 51
Motion
St Stanfor
19-Nov-2019 52
Motion
St Stanfor
19-Nov-2019 53
Motion
St Stanfor
19-Nov-2019 54
Motion
St Stanfor
19-Nov-2019 55
Regularized flow Optical flow
Slide credit: Sebastian Thurn
Motion
St Stanfor
19-Nov-2019 56
Motion
St Stanfor
19-Nov-2019 57
Motion
St Stanfor
19-Nov-2019 58
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 59
Motion
St Stanfor
19-Nov-2019 60
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 61
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 62
6 5 4 3 2 1
t y x
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 63
6 5 4 3 2 1
t y x
6 5 4 3 2 1
2
t y x
6 5 4 3 2 1
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 64
– Divide the image into blocks and estimate affine motion parameters in each block by least squares
–Merge clusters that are close and retain the largest clusters to obtain a smaller set of hypotheses to describe all the motions in the scene
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 65
– Divide the image into blocks and estimate affine motion parameters in each block by least squares
–Merge clusters that are close and retain the largest clusters to obtain a smaller set of hypotheses to describe all the motions in the scene
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 66
– Divide the image into blocks and estimate affine motion parameters in each block by least squares
–Merge clusters that are close and retain the largest clusters to obtain a smaller set of hypotheses to describe all the motions in the scene
–Pixels with high residual error remain unassigned
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 67
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 68
Motion
St Stanfor
19-Nov-2019 69
Motion
St Stanfor
19-Nov-2019 70
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 71
– A static camera is observing a scene – Goal: separate the static background from the moving foreground
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 72
– Segment the video into multiple coherently moving objects
Proceedings of the British Machine Vision Conference (BMVC) 2006
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 73 Z.Yin and R.Collins, "On-the-fly Object Modeling while Tracking," IEEE Computer Vision and Pattern Recognition (CVPR '07), Minneapolis, MN, June 2007.
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 74
Motion
St Stanfor
19-Nov-2019 75
Example: A set of low quality images
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 76
Each of these images looks like this:
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 77
The recovery result:
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 78
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 79 Juan Carlos Niebles, Hongcheng Wang and Li Fei-Fei, Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words, (BMVC), Edinburgh, 2006.
Motion
St Stanfor
19-Nov-2019 80
Source: Silvio Savarese
Motion
St Stanfor
19-Nov-2019 81
People", 9th International Workshop on Visual Surveillance (VSWS09) in conjuction with ICCV 09
Motion
St Stanfor
19-Nov-2019 82
Motion
St Stanfor
19-Nov-2019 83
Motion
St Stanfor
19-Nov-2019 84
[Fleet & Weiss, 2005] http://www.cs.toronto.edu/pub/jepson/teaching/vision/2503/opticalFlow.pdf
Reading: [Szeliski] Chapters: 8.4, 8.5