Object categorization: the constellation models
Li Fei-Fei
with many thanks to Rob Fergus with many thanks to Rob Fergus
Object categorization: the constellation models Li Fei-Fei with - - PowerPoint PPT Presentation
Object categorization: the constellation models Li Fei-Fei with many thanks to Rob Fergus with many thanks to Rob Fergus The People and slides credit Pietro Perona Andrew Zisserman Thomas Leung Mike Burl Markus Weber Max Welling Rob
Li Fei-Fei
with many thanks to Rob Fergus with many thanks to Rob Fergus
Pietro Perona Mike Burl Thomas Leung Markus Weber Rob Fergus Max Welling Li Fei-Fei Andrew Zisserman
Goal
Model: Parts and Structure
etc…
Parts and Structure Literature
Representation Detection Shape statistics – F&G ’95 Affine invariant shape – CVPR ‘98 CVPR ‘96 ECCV ‘98
Unsupervised Learning ECCV ‘00 Multiple views - F&G ’00 Discovering categories - CVPR ’00
Joint shape & appearance learning Generic feature detectors One-Shot Learning Incremental learning CVPR ’03 Polluted datasets - ECCV ‘04 ICCV ’03 CVPR ‘04
A B D C
Foreground model
Generative probabilistic model
Gaussian shape pdf
Clutter model
Uniform shape pdf
0.8 0.75 0.9
# detections
pPoisson(N2|λ2) pPoisson(N1|λ1) pPoisson(N3|λ3)
Assumptions: (a) Clutter independent of foreground detections (b) Clutter detections independent of each other
Example
N1 N2
N3
1. Run part detectors exhaustively over image
⎟ ⎟ ⎟ ⎟ ⎟ ⎠ ⎞ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ ⎛ = ⎟ ⎟ ⎟ ⎟ ⎟ ⎠ ⎞ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ ⎛ = 2 3 2 e.g.
4 3 2 1
h N N N N h K K K K
1 2 3 3 2 4 1 1 2 3 1 2
) , | ( ) , | ( Hyp Clutter Data p Hyp Object Data p
– Joint model of part locations – Ability to deal with background clutter and occlusions
– Manual construction of part detectors – Estimate parameters of shape density
– Run part detectors over image – Try combinations of features in model – Use efficient search techniques to make fast
Weber & Welling et. al.
Unsupervised detector training - 1
10 10
Unsupervised detector training - 2
“Pattern Space” (100+ dimensions)
Unsupervised detector training - 3
100-1000 images ~100 detectors
Estimation of model parameters
Learning
learn them and the model parameters
... Image 1 Image 2 Image i
Large P Small P
Large P x + Small P x pdf new estimate of μ + … =
Parameter Estimation
Choice 1 Choice 2
Parameter Estimation
Model 1 Model 2 Predict / measure model performance (validation set or directly from model) Detectors (≈100)
(Greedy search)
Pre-selected Parts Model Foreground pdf Sample Detection Parts in Model Test Error: 6% (4 Parts)
Preselected Parts Model Foreground pdf Sample Detection Parts in Model
Test Error: 13% (5 Parts)
3D Object recognition – Multiple mixture components
Frontal Profile
20 40 60 80 100 50 55 60 65 70 75 80 85 90 95 100
Orientation Tuning
angle in degrees % Correct
% Correct
– Multiple mixture components for different viewpoints
– Now semi-unsupervised – Automatic construction and selection of part detectors – Estimation of parameters using EM
– As before
– Slow (many combinations of detectors)
– Difficult to learn part that has stable location but variable appearance – Each detector is used as a cross-correlation filter, giving a hard definition of the part’s appearance
the object
Fergus et. al. CVPR ‘03
Detection & Representation of regions
Appearance Location Scale
(x,y) coords. of region centre Radius of region (pixels) 11x11 patch Normalize Projection onto PCA basis c1 c2 c15
………..
Gives representation of appearance in low-dimensional vector space
(Kadir & Brady 01)
Motorbikes example
Foreground model
Gaussian shape pdf Poission pdf on # detections Uniform shape pdf
Generative probabilistic model (2)
Clutter model
Gaussian part appearance pdf Gaussian background appearance pdf
0.8 0.75 0.9
Gaussian relative scale pdf
log(scale)
Uniform relative scale pdf
log(scale)
based on Burl, Weber et al. [ECCV ’98, ’00]
Motorbikes
Samples from appearance model
Recognized Motorbikes
Background images evaluated with motorbike model
Frontal faces
Spotted cats
Summary of results
10.0 10.0 Spotted cats 9.7 15.2 Cars (Rear) 7.0 9.8 Airplanes 4.6 4.6 Faces 6.7 7.5 Motorbikes
Scale invariant experiment Fixed scale experiment Dataset
% equal error rate Note: Within each series, same settings used for all datasets
Comparison to other methods
Agarwal Roth [ECCV ’02] 21.0 11.5 Cars (Side) Weber 32.0 9.8 Airplanes Weber 6.0 4.6 Faces Weber et al. [ECCV ‘00] 16.0 7.5 Motorbikes Others Ours Dataset
% equal error rate
needed
– Tried ICA, FLD, Oriented filter responses etc. – But PCA worked best
machine learning community
Fei-Fei et. al. ICCV ‘03
Faces, Cars ~2,000
Schneiderman, et al.
Faces ~500
Rowley et al.
Faces, Motorbikes, Spotted cats, Airplanes, Cars 200 ~ 400
Burl, et al. Weber, et al. Fergus, et al.
Faces ~10,000
Viola et al.
Categories Training Examples Algorithm
1 2 3 4 5 6 7 8 9 10 20 30 40 50 60
log2 (Training images) Classification error (%) Generalisation performance Test Train
Previously 6 part Motorbike model
Prior knowledge: means
Shape Appearance
likely unlikely
Bayesian framework
P(object | test, train) vs. P(clutter | test, train)
)
( ) train
| test ( p p
Bayes Rule
θ θ θ d p p
) train
| ( )
, | test (
Expansion by parametrization
Bayesian framework
ML
Previous Work: P(object | test, train) vs. P(clutter | test, train)
)
( ) train
| test ( p p
Bayes Rule
θ θ θ d p p
) train
| ( )
, | test (
Expansion by parametrization
Bayesian framework
One-Shot learning:
( ) ( )
θ θ p p
, train
P(object | test, train) vs. P(clutter | test, train)
)
( ) train
| test ( p p
Bayes Rule
θ θ θ d p p
) train
| ( )
, | test (
Expansion by parametrization
θ1 θ2 θn
model (θ) space
Each object model θ
Gaussian shape pdf Gaussian part appearance pdf
Model Structure
θ2 θn
model distribution: p(θ)
θ1
model (θ) space
Each object model θ
Gaussian shape pdf Gaussian part appearance pdf
Model Structure
Learning Model Distribution
Variational EM (Attias, Hinton, Minka, etc.)
( ) ( ) ( )
θ θ θ p p p
, train train
∝
Random initialization
Variational EM Variational EM
prior knowledge of p(θ)
new estimate
new θ’s
Experiments
Training: 1- 6 randomly drawn images Testing: 50 fg/ 50 bg images
Datasets
spotted cats airplanes motorbikes faces
[www.vision.caltech.edu]
Faces Airplanes Motorbikes Spotted cats
Experiments: obtaining priors
spotted cats airplanes motorbikes faces
Miller, et al. ‘00 model (θ) space
Experiments: obtaining priors
spotted cats faces airplanes motorbikes
model (θ) space
Number of training examples
Number of training examples
Number of training examples
Number of training examples
7.5 – 24.1% Faces ~500
Rowley et al.
5.6 – 17% Faces, Cars ~2,000
Schneiderman, et al.
8 – 15 %
Faces, Motorbikes, Spotted cats, Airplanes
1 ~ 5
Bayesian One-Shot
5.6 - 10 % Faces, Motorbikes, Spotted cats, Airplanes, Cars 200 ~ 400
Burl, et al. Weber, et al. Fergus, et al.
7-21% Faces ~10,000
Viola et al.
Results(e rror) Categories Training Examples Algorithm
(legs of camel, curve of wheels for motorbikes)
(b) commonly selected images