Human Shape and Pose Tracking Using Keyframes: Supplementary Material
Chun-Hao Huang§, Edmond Boyer†, Nassir Navab§, Slobodan Ilic§
§Department of Computer Science, Technische Universit¨
at M¨ unchen
†LJK-INRIA Grenoble Rhˆ
- ne-Alpes
{huangc,slobodan.ilic,navab}@in.tum.de, edmond.boyer@inria.fr
image clean silhouette annotated joints
- riginal silhouette
Figure 1. Example images, generated clean silhouettes, and anno- tated joint positions of WalkChair.
The supplementary material for the paper Human Shape and Pose Tracking Using Keyframes consists of this docu- ment and the accompanying video. It provides more details
- n the newly recorded sequences and more analysis on the
experiment results.
- 1. New recorded sequences
In Fig. 1, we show one example frame of the newly recorded sequences. The occluding object, i.e. the chair, is kept after background subtraction, and therefore remains in the subsequent reconstructed point cloud. The reference surfaces at t = 0 is the smoothed reconstructed visual hulls. There is no need to register the surface to the point cloud with a rigid transformation to initialize the tracking. We produce two different types of ground truth for eval- uating shapes and poses, respectively. For shape evaluation, we remove the silhouettes of irrelevant objects manually, if they are not connected to the subjects, as shown in Fig. 1. The associated metric is the standard silhouette overlap er- ror which measures the discrepancies between the contour
- f the projected surface and the contour in the observed
- silhouettes. To evaluate the estimated poses, we annotate
the positions of joints in five cameras, and see how close to them the estimated joints are (2D joint error). The se- quences and the associated ground truths will be publicly available upon publication.
t = 0
95 (5) 198 (24)
6715
97 (21) 211 (480)
7125
71 (2) 87 (5)
…
191 (191)
…
260 (189)
…
405 (344)
0.1 6983 0.31
(estimated)
0.5
t = 0
0.8 0.1
no other keyframes generated
3881
21 (5) 14 (0) 15 (4)
3593
19 (3) 20 (36)
3573
Bandwidth
0.44
(estimated)
Keyframes besides t = 0 Error
Ref
Figure 2. Generated keyframe pool of Skirt [2] (top) and Ham- merTable (bottom) in varying mean-shift bandwidths.
- 2. Supplementary results
Influence of mean-shift bandwidth. In Fig. 2 we visual- ize the generated keyframe pools of Skirt and HammerTable in different bandwidths. Two sequences are chosen because the subjects repeat the actions. With small bandwidths, we
- bserve many similar key poses, which however does not