Fields of Parts & Friends
peter.gehler.net
Fields of Parts & Friends peter.gehler.net p i Detection + - - PowerPoint PPT Presentation
Fields of Parts & Friends peter.gehler.net p i Detection + Geometry p i Human Pose Estimation or Predict Predict Observation Observation Bounding Boxes Joint Locations Human Pose Estimation F (1) top X Y top F (2) top , head . . .
peter.gehler.net
pi
pi
Observation Predict Bounding Boxes Observation Predict Joint Locations
Observation Desired Output
. . .
Ytop Yhead Ytorso Yrarm Yrhnd Yrleg Yrfoot Ylfoot Ylleg Ylarm Ylhnd
X . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
F (1) top F (2) top,head
p(y|I, w) ∝ X
p
ψ(yp, I; w) + X
p∼p0
ψ(yp, yp0; w)
. . .
Ytop Yhead Ytorso Yrarm Yrhnd Yrleg Yrfoot Ylfoot Ylleg Ylarm Ylhnd
X . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
F (1) top F (2) top,head
[Johnson&Everingham, BMVC’10], [Yang&Ramanan, CVPR’11],[Eichner&Ferrari, ACCV’12], [Sapp et al., ECCV’10], [Tran&Forsyth, ECCV’10], [Wang et al., CVPR’11], [Agarwal&Triggs, PAMI’02], [Urtasun&Darrell, ICCV’09], [Ionescu et al., ICCV’11]
(∆x, ∆y)
θ
X
p
ψ(yp; I, w)
+ X
p∼p0
ψ(yp, yp0; I, w)
p(y|I, w) ∝
extensions are proposed:
[Johnson&Everingham, BMVC’10] [Yang&Ramanan, CVPR’11] [Eichner&Ferrari, ACCV’12] [Sapp et al., ECCV’10] [Tran&Forsyth, ECCV’10] [Wang et al., CVPR’11] [Agarwal&Triggs, PAMI’02] [Urtasun&Darrell, ICCV’09] [Ionescu et al., ICCV’11] …
result kinematic tree pairwise conditioning
II
IV position/rotation
...
50 100 150 200extra unary factors
III
appearance
50 100 150 200 50. . . I
poselets
Top detections Poselet cluster medoids
Possible pairwise factors
(∆x, ∆y)
θ
Yhead
X
ψ(yhead, x) Ytorso
X
ψ(ytorso, x) ψ(yhead, ytorso)
Possible body models
Top poselet detections Cluster medoids
Poselet Conditioned
Prediction Result
Baseline PS
Generic Tree Result
1000 training, 1000 testing images
Error: PCP percentage of correct parts
kinematic tree pairwise conditioing
II
result IV position/rotation
...
50 100 150 200unary factors
III
appearance
50 100 150 200 50. . . I
poselets
55.7
pairwise
60.9
Baseline PS unary
60.8
pairwise + unary 62.9
result kinematic tree pairwise conditioning
II
IV position/rotation
...
50 100 150 200unary factors
III
appearance
50 100 150 200 50. . . I
poselets
55.7
pairwise
60.9
Baseline PS unary
60.8
pairwise + unary 62.9
kinematic tree result IV position/rotation
...
50 100 150 200unary factors
III
appearance
50 100 150 200 50. . . I II
pairwise conditioning poselets
55.7
pairwise
60.9
Baseline PS unary
60.8
pairwise + unary 62.9
result kinematic tree pairwise conditioning
II
IV position/rotation
...
50 100 150 200unary factors
III
appearance
50 100 150 200 50. . . I
poselets
55.7
pairwise
60.9
Baseline PS unary
60.8
pairwise + unary 62.9
M A P
F u l l m
e l
P a r t M a r g i n a l s
P l a i n P i c t
i a l S t r u c t u r e s
M A P P a r t M a r g i n a l s
ICCV 2013
Joint model for body parts and body joints Mid-Level representation
ICCV 2013
rotation
Mixtures of DPM for local Appearance Rotation Dependent Part Detectors
(CNNs)
Setting PCP [%] model so far 62.9 Andriluka et al. CVPR 09 55.7 + flexible body model 56.9 + local mixtures 65.2 + Poselet conditioned unaries 68.5 + Poselet conditioned pairwise 69.0 Yang & Ramanan, CVPR 11 60.8 Eichner & Ferrari, ACCV 12 64.3 Ramakrishna et al. ECCV 14 67.6 Chen & Yuille arXiv 14 76.6
(Pose Inference Machines)
Rare poses Self-occlusion Strong foreshortening
Same color! Explain this then!
joint positions and occlusions 3D torso and head orientation part occlusions activity labels
xp
i ∈ {0, 1}, i = 1, . . . , |Yp|
. . , |Yp|
p = 1, . . . , P
Kiefel & Gehler, Human Pose Estimation with a Fields of Parts, ECCV 2014
Kiefel & Gehler, Human Pose Estimation with a Fields of Parts, ECCV 2014
(∆x, ∆y)
θ
space
p = 1, . . . , P
yp ∈ {1, . . . , M} × {1, . . . , N} = Yp
xp
i ∈ {0, 1}, i = 1, . . . , |Yp|
factors (bilateral, segmentation)
pictorial structures
Krähenbühl & Koltun, Efficient inference in fully connected CRFs with Gaussian edge potentials, NIPS 2011
ip = argmax
i∈Yp
Q10(xp
i = 1|I)
Q0(x|I, θ) → Q1(x|I, θ) → · · · → Q10(x|I, θ)
unaries (step 0)
Q5(x|I, θ) Q10(x|I, θ)
unaries (step 0)
Q5(x|I, θ) Q10(x|I, θ)
Q0(x|I, θ) → Q1(x|I, θ) → · · · → Q10(x|I, θ)
mean field inference
unaries (step 0)
Q5(x|I, θ) Q10(x|I, θ)
for performance
segmentation
Estimation, ICCV13
CVPR14
Bernt Schiele Martin Kiefel Micha Andriluka Leonid Pishchulin