Siyu Tang
Holis&c Vision Group Max Planck Ins&tute for Intelligent Systems 04 July 2018
People Tracking and Pose Es5ma5on by Graph Decomposi5on Siyu Tang - - PowerPoint PPT Presentation
People Tracking and Pose Es5ma5on by Graph Decomposi5on Siyu Tang Holis&c Vision Group Max Planck Ins&tute for Intelligent Systems 04 July 2018 Overview Graph Decomposition and Multicut Multi-person Tracking Minimum cost
Holis&c Vision Group Max Planck Ins&tute for Intelligent Systems 04 July 2018
2
3
4
y ∈ {0, 1}E
d ∈ RE
min
y∈{0,1}E
X
e∈E
deye subject to ∀C ∈ cycles(G) ∀e ∈ C : (1 − ye) ≤ X
e0∈C\{e}
(1 − ye0)
5
6
Input video Our tracking result People detec&on
7
[Andriluka et al@CVPR’08; Zhang et al @CVPR’08; Shitrit et al @ ICCV’11; Pirsiavash et al@CVPR’11; Kuo et al @CVPR’11; Chen et al @CVPR’14; Wang et al@ECCV’16; Schulter at al @CVPR’17]
8
9
10
Within-frame edges Long-range edges
11
Red dots: detec&on hypotheses
12
Red lines: linking hypotheses
13
14
15
x2{0,1}V y2{0,1}E
v2V
e2E
e02C\{e}
Consistency Transi5vity
16
x2{0,1}V y2{0,1}E
v2V
e2E
[Kernighan&Lin@Bell System Technical Journal’1970]
e02C\{e}
Consistency Transi5vity
17
x2{0,1}V y2{0,1}E
v2V
e2E
f (e)
e = hθ, f (e)i
e02C\{e}
Consistency Transi5vity
18
Time
19
20
y2{0,1}E0
e2E0
X
2
X
e02C\{e}
(1 − ye0) ∀vw ∈ E0 \ E ∀P ∈ vw-paths(G) : (1 − yvw) ≤ X
e2P
(1 − ye) ∀vw ∈ E0 \ E ∀C ∈ vw-cuts(G) : yvw ≤ X
e2C
ye
21
v1 v2 v3 v4
1
3
v1 v2 v3 v4
1
3
Transi5vity
3 3
3 3
22
CVPR 17 CVPR 17 arXiv’ 16 ECCVW' 16
23
24
25
26
min
x
X
d∈D
X
c∈C
αdc xdc
min
y
X
dd0∈E
βdd0 ydd0
27
Consistency Transi5vity Uniqueness
28
instance labels.
(Image from Zhang et al. 2015)
[Tang et al. CVPR 15, CVPR17] [Tang et al. CVPR 15] [Insafutdinov et al. CVPR17] [Kirillov et al. CVPR17]
29
30
min
x∈{0,1}E
X
e∈E
ce xe subject to ∀C ∈ cc(G) ∀e ∈ C : xe ≤ X
e0∈C\{e}
xe0 .
min
x∈{0,1}E
X
e∈E
ce xe + C X
C∈cc(G)
X
e∈C
xe Y
e0∈C\{e}
(1 − xe0) .
min
x∈{0,1}E
X
e∈E
ce xe + C X
{u,v,w}∈
V
3
xvw¯ xuw + ¯ xuvxvw¯ xuw + ¯ xuv¯ xvwxuw) .
31
min
x∈{0,1}E
X
e∈E
ce xe + C X
{u,v,w}∈
V
3
xvw¯ xuw + ¯ xuvxvw¯ xuw + ¯ xuv¯ xvwxuw) .
E(x) = X
i
ψU
i (xi) +
X
c
ψCycle
c
(xc)
ψpat
c
(xc) = ( γxc if xc ∈ Pc γmax
32
without CRF inference with CRF inference
33
without CRF inference with CRF inference
34
Head-Neck-Shou Shou-Elbo-Wris Neck-LHip-RHip Hip-Knee-Ankl Mean
1.68 3.40 1.41 3.83 2.60 Iter 1 1.12 2.79 1.06 3.17 2.04 Iter 2 1.01 2.58 0.89 2.82 1.81 Iter 3 0.96 2.47 0.87 2.79 1.76 Table 2: Ratio of non valid cycle. Numbers (%) represent the ratio of non valid cycle for four different types of cliques that are defined for adjacent body joints.
Method Head Shou Elbo Wris Hip Knee Ankl Mean Iqbal et al., [28] 58.4 53.9 44.5 35.0 42.2 36.7 31.1 43.1 pishchulin et al., [1] 89.4 84.5 70.4 59.3 68.9 62.7 54.6 70.0 Insafutdinov et al., [10] 78.4 72.5 60.2 51.0 57.2 52.0 45.4 59.5 Insafutdinov et al., [7] 88.8 87.0 75.9 64.9 74.2 68.8 60.5 74.3 Cao et al., [12] 91.2 87.6 77.7 66.8 75.4 68.9 61.7 75.6 Fang et al., [21] 88.4 86.5 78.6 70.4 74.4 73.0 65.8 76.7 Newell et al., [17] 92.1 89.3 78.9 69.8 76.2 71.6 64.7 77.5 Baseline 90.7 87.4 77.3 66.5 75.7 69.0 60.9 75.2 Our Method 90.8 87.7 78.2 67.2 76.0 69.4 61.5 75.9
Table 3: Comparison with the state-of-the-art on the MPII Human Pose dataset. 7
35
36
37