Human Motion Tracking by Registering an Articulated Surface to 3-D - - PowerPoint PPT Presentation

human motion tracking by registering an
SMART_READER_LITE
LIVE PREVIEW

Human Motion Tracking by Registering an Articulated Surface to 3-D - - PowerPoint PPT Presentation

Human Motion Tracking by Registering an Articulated Surface to 3-D Points and Normals 1 Radu Horaud (Radu.Horaud@inrialpes.fr) Matti Niskanen, Guillaume Dewaele, Edmond Boyer. INRIA Grenoble Rhone-Alpes, France


slide-1
SLIDE 1

Human Motion Tracking by Registering an Articulated Surface to 3-D Points and Normals1

Radu Horaud (Radu.Horaud@inrialpes.fr) Matti Niskanen, Guillaume Dewaele, Edmond Boyer. INRIA Grenoble Rhone-Alpes, France

http://perception.inrialpes.fr/Publications/2009/HNDB09/ 1IEEE Trans. on PAMI, vol. 31, no. 1, pp 158–164, Jan. 2009 Horaud et al. Human Motion Tracking

slide-2
SLIDE 2

Introduction

We address human-motion capture using 3D data gathered with several videos; We fit an articulated implicit surface to 3D data, namely points and normals; We introduce a new data-to-ellipsoid distance, and We derive an expectation-maximization (EM) algorithm that performs the fitting in a reliable and robust maner.

Horaud et al. Human Motion Tracking

slide-3
SLIDE 3

Outline

Kinematic model Implicit surface 3D data Fitted model

Horaud et al. Human Motion Tracking

slide-4
SLIDE 4

The articulated model representation

Articulated implicit surface: f(y, Λ) =

P

  • p=1

exp

  • −d2

M(y, xp(Λ))

ν2

p

  • y ∈ S ⇔ f(y) = C,

xp ∈ Qp (an ellipsoid), dM is the data-to-ellipsoid Mahalanobis distance, Λ are the kinematic parameters.

Horaud et al. Human Motion Tracking

slide-5
SLIDE 5

Problem formulation

The kinematic parameters may be found by minimizing the following non-linear criterion: Λ∗ = arg min Λ  −ν2

I

  • i=1

log

P

  • p=1

exp

  • −d2

M(yi, xip(Λ))

ν2  

Horaud et al. Human Motion Tracking

slide-6
SLIDE 6

The data-to-ellipsoid distance

d2

M = (yi − xip(Λ, ni))⊤Σp−1(yi − xip(Λ, ni))

(See Horaud et al. PAMI’09 for details)

Horaud et al. Human Motion Tracking

slide-7
SLIDE 7

The advantage of the proposed distance

Horaud et al. Human Motion Tracking

slide-8
SLIDE 8

Probabilistic implicit surface model (1)

The likelihood of a data point (conditioned by its assignment to an ellipsoid) is drawn from a Gaussian (normal) distribution: P(yi|zi = p) = N(yi|xip(Λ), Σp) N(yi|xip(Λ), Σp) = 1 (2π)3/2|Σp)|1/2 exp

  • −1

2d2

M(yi, xp(Λ))

  • the notation zi = p means that the ith data point

(observation) is assigned to the pth ellipsoid. Z = {z1, . . . , zI} is the set of hidden variables also referred to as missing data as opposed to the observed data.

Horaud et al. Human Motion Tracking

slide-9
SLIDE 9

Probabilistic implicit surface model (2)

Apply the sum and product rules of probabilities: P(yi) =

P

  • p=1

P(yi|zi = p) P(zi = p)

  • πp

πp are the priors (the mixing parameters) with

p πp = 1

It is assumed that the yi are independent and identically distributed, or i.i.d.: P(y1, . . . , yI) =

I

  • i=1

P(yi)

Horaud et al. Human Motion Tracking

slide-10
SLIDE 10

The (negative) log-likelihood

The observed-data log-likelihood: log P(y1, . . . , yI) =

I

  • i=1

log P(yi) The negative observed-data log-likelihood becomes: −

I

  • i=1

log

P

  • p=1

πp (2π)3/2|Σp)|1/2 exp

  • −1

2d2

M(yi, xp(Λ))

  • Horaud et al.

Human Motion Tracking

slide-11
SLIDE 11

The two formulas, side by side

Articulated implicit surface: arg min  −ν2

I

  • i=1

log

P

  • p=1

exp

  • −d2

M(yi, xip(Λ))

ν2   Observed-data log-likelihood (probabilistic implicit surface): arg min  −

I

  • i=1

log

P

  • p=1

πp (2π)3/2|Σp)|1/2 exp

  • −1

2d2

M(y, xp(Λ))

  Both functions are difficult to minimize.

Horaud et al. Human Motion Tracking

slide-12
SLIDE 12

The expected complete-data log-likelihood

Replace the maximization of the observed-data log-likelihood with the maximization of the expected complete-data log-likelihood – Expectation-Maximization (EM). complete-data means the observed-data (y1, . . . , yI) and the missing-data (z1, . . . , zI): max E[log P(y1, . . . , yI, z1, . . . , zI|y1, . . . , yI)] EM was introduced by Dempster, Laird, and Rubin in 1977: ”Maximum likelihood from incomplete data via the EM algorithm” (Google scholar: Cited by 18071) We provide an EM algorithm that fits an articulated implicit surface to a set of 3D observed points and normals.

Horaud et al. Human Motion Tracking

slide-13
SLIDE 13

EM for point-to-surface fitting

  • Initialization. Provide the kinematic parameters Λ(q) the

covariance matrix Σ(q) common to all the ellipsoids, and set π(q)

1

= . . . = π(q)

P

= 1/P; E step. Evaluate the posterior probabilities t(q)

ip using the

current parameter values: t(q)

ip = π(q) p P(yi|zi = p)

P(yi) M step. Estimate new values for the kinematic parameters Λ(q+1): arg min Λ 1 2

I

  • i=1

P

  • p=1

t(q)

ip (yi − xip(Λ))⊤Σ(q)−1(yi − xip(Λ))

Horaud et al. Human Motion Tracking

slide-14
SLIDE 14

EM (continued)

M step (continued). Update the covariance matrix and the priors: Σ(q+1) = 1 I

i=1

P

p=1 t(q) ip I

  • i=1

P

  • p=1

t(q)

ip (yi − xip(Λ(q+1)))(yi − xip(Λ(q+1)))⊤

π(q+1)

p

= 1 I

I

  • i=1

t(q)

ip

Maximum likelihood. Evaluate the observed-data log-likelihood and check for convergence.

Horaud et al. Human Motion Tracking

slide-15
SLIDE 15

Practical considerations

We added a uniform-noise component to the Gaussian

  • mixture. This component captures the outliers such that they

do not have an impact on the estimated parameters. The M-step consists in a non-linear minimization procedure followed by closed-form solutions for the statistical parameters. We used a common covariance to all the Gaussian components in the mixture to reduce problems associated with small covariances. A good initial solution is necessary.

Horaud et al. Human Motion Tracking

slide-16
SLIDE 16

Results

We used a multiple-camera setup to gather several videos. At each time step we build a visual hull (Franco, Boyer, Lapierre 3DPVT’06, Franco & Boyer PAMI’09). We compared the results with the classical distance & model fitting.

Horaud et al. Human Motion Tracking

slide-17
SLIDE 17

Camera layout

Horaud et al. Human Motion Tracking

slide-18
SLIDE 18

Taekwendo (images and silhouettes)

Horaud et al. Human Motion Tracking

slide-19
SLIDE 19

Taekwendo (3D data, points and normals, points only)

Horaud et al. Human Motion Tracking

slide-20
SLIDE 20

Conclusions

A principled probabilistic framework for fitting an articulated implicit surface to 3D points (and normals). A modified EM algorithm that estimates simultaneously the kinematic and the statistical parameters of the model. Proper initializatin is crucial. In the future we plan to combine with spectral clustering/matching to solve for many unsolved problems (Diana Mateus, Avinash Sharma, David Knossow, Radu Horaud).

Horaud et al. Human Motion Tracking