Filters and other potions
- P. Perona - Caltech
MIT - 21 November 2013
Filters and other potions P. Perona - Caltech MIT - 21 November - - PowerPoint PPT Presentation
Filters and other potions P. Perona - Caltech MIT - 21 November 2013 what ? where Architectures Architecture 1 building train The vision black box Marble Ripe torso bananas Image(s) Grouping: image regions Surface shape, motor
Filters and other potions
MIT - 21 November 2013
?
what where
Architectures
Architecture 1
Image(s) The vision black box
Ripe bananas
Marble torsotrain building
Feature extraction: texture stereo disparity color contrast motion flow edgels …. Surface shape, scene depth, spatial relationships, 3D motion Grouping: image regions PerceptualImage processing Regions and surfaces Objects, verbs, categories…
motor cognition
[Marr ’82]
features?
Le Corbusier, Villa Savoye http://flickr.com/photos/ikura/1398271367/edges
http://www.iit.edu/~stawraf/perspx.jpg Le Corbusier, Villa Savoye[Fukushima ‘80]
Architecture 2
[DeValois ’85]
Column
Hypercolumn
Dense sampling
translation, rotation invariance
[LeCun et al. 1998]
scale invariance
[Lowe 2004]
[Hinton et al. ’12]
translation, rotation, scale invariance
96 filters 6 orientations 2 center-surround 14 scale samples over 2.2 binary octaves
Detection Performance
Caltech pedestrians: 1M frames, 250K hand-annotated
Detection Performance
Detection Performance
Dollar et al. ‘10 Dollar et al. ‘08 Viola & Jones ‘01 Dalal-Triggs ‘05 * Walk et al. ‘10filter technology
Scale, orientation, elongation…. lots of CPU cycles
how do we make computations efficient?
Separability
[Adelson & Bergen, ’85]
Cost = m x n Cost = m + n R(i, j) = X
h=1:M,k=1:Nk(h, k)I(i − h, j − k)
R(i, j) = X
h=1:MX
k=1:Nk(h)k0(k)I(i − h, j − k)
Separability and decomposition
[Adelson & Bergen, ’85]
Steerability
[Freeman & Adelson, ’91]
General decomposition
k(x, θ) =
D
X
i=1
bi(θ)fi(x)
k(x, y) =
D
X
i=1
fi(x)gi(y)
k(x, y; θ) =
D
X
i=1
bi(θ)fi(x)gi(y)
Design?
x
θ
=
k(x; θ)
D
bi(θ)
θ
x
σi,i
fi(x)
Approximation
K(x, y; θ) =
D
X
i=1
bi(θ)fi(x, y)
K(x, y; θ) ≈
R
X
i=1
bi(θ)fi(x, y)
R ⌧ D
[Perona ’95]
[Perona ’95]
[Perona ’95]
Tensor Factorization
k(x, y; θ) =
D
X
i=1
bi(θ)fi(x)gi(y)
[Shy, Perona ’96]
Including scale by resampling
[Manduchi et al. ’98] [cfr. Simoncelli et al]
Exploiting Image Statistics
upsampled
sampling the gradient
[Dollar et al. 2013]
Gradient histograms
[Dollar et al. 2013]
Power law feature scaling
Power law feature scaling
Individual images
[Dollar et al. 2013]
Fast computations
Fast computations
[Dollar et al. 2013]
Performance
[Dollar et al. 2013]
Conclusions