Tow ards Understanding Articulated Objects Jrgen Sturm 1 Cyrill - - PowerPoint PPT Presentation

tow ards understanding articulated objects
SMART_READER_LITE
LIVE PREVIEW

Tow ards Understanding Articulated Objects Jrgen Sturm 1 Cyrill - - PowerPoint PPT Presentation

Tow ards Understanding Articulated Objects Jrgen Sturm 1 Cyrill Stachniss 1 Vijay Pradeep 2 Christian Plagemann 3 Kurt Konolige 2 Wolfram Burgard 1 1 University of Freiburg 2 Willow Garage 3 Stanford University SA-1 Motivation Domestic


slide-1
SLIDE 1

SA-1

Tow ards Understanding Articulated Objects

Jürgen Sturm 1 Cyrill Stachniss1 Vijay Pradeep2 Christian Plagemann3 Kurt Konolige2 Wolfram Burgard1

1University of Freiburg 2Willow Garage 3Stanford University

slide-2
SLIDE 2

Motivation

Domestic

environments

Articulated objects

Doors Drawers

Learning models of

  • bjects

Understanding the

spatial movements can improve

Perception Manipulation planning Navigation

slide-3
SLIDE 3

Problem Form ulation

x1 ∈ R6, . . . , xn ∈ R6

The goal of our approach is to

  • 1. learn models describing the relationship

between two object parts

  • 2. infer the kinematic topology of the scene

(which object parts are connected in which way)

Mij

M

Given: noisy 6D pose observations of n rigid object parts

slide-4
SLIDE 4

Training a Model

Find a model that describes the connection

between two object parts (i and j)

Maximize the likelihood of the observations given

the model

Mij

Noisy observations Consider a sequence of

T observed relative transforms of , denoted by

ˆ Mij = argmaxMij p(Dij | Mij) Dij = (z1

ij, . . . , zT ij)

xi ª xj

slide-5
SLIDE 5

Tem plate Models

Rigid model Prismatic model Rotational model Non-parametric model

(“LLE/ GP model”)

Mprismatic Mrigid Mrotational MLLE/GP

slide-6
SLIDE 6

Observation Likelihood ( 1 )

What is the likelihood of an observation given a

model?

Models for articulated objects typically have a

latent action variable a (e.g., door opening angle)

For computing the observation likelihood, we

integrate over a

p(zij | Mij) = R

a p(zij | a, Mij) p(a | Mij) da

p(zij | Mij) = ? p(zij | a, Mij) ∝ exp ¡ −||fMij(a) ª zij||2/l ¢

slide-7
SLIDE 7

Observation Likelihood ( 2 )

Assumption: all latent actions have the same

likelihood

Integration might be time consuming Efficiently estimate value of latent action variable! In our settings, this works well in practice for all

  • bserved models

p(zij | Mij) ∝ R

a p(zij | a, Mij) da

p(zij | Mij) ≈ maxa p(zij | a, Mij)

slide-8
SLIDE 8

Draw er: Moves on Line Segm ent

Estimate

axis e of movement

(using PCA)

Estimate latent action Transformation function:

ˆ at

ij = e · trans(zt ij ª z1 ij)

fMprismatic

ij

(a) = z1

ij ⊕ ae

slide-9
SLIDE 9

Param etric Model for a Door

Estimate:

axis of rotation n center of rotation c rigid transform r

fMrotational

ij

(a) = [c; n]T ⊕ rotZ(a) ⊕ r

Estimate latent action Transformation function:

slide-10
SLIDE 10

Garage door: A Tw o-link Joint

Garage door runs in a

vertical and a horizontal slider

Neither rotational, nor

prismatic motion

There are objects

which cannot be explained well by “standard” models

slide-11
SLIDE 11

LLE/ GP: Non-param etric Model

For a articulation model, we need

Estimate the latent action a Predict the expected transform

Assume that the data lies on

(or close to) a low dimensional manifold in

Find latent low dimensional coordinates on

the manifold dimensionality reduction using locally linear embedding (LLE)

Then learn a Gaussian process regression

for the transformation function

R6 f(a) = zij

slide-12
SLIDE 12

The LLE/ GP Model Sum m ary

Given: 6D obs. Find low-dim. manifold Non-linear regression

zij ∈ R6 zij

LLE

→ aij ∈ Rd (aij, zij) → GP a0 GP → z0

Result: a mapping from the manifold

coordinate system into the observations space

f(a) = zij

slide-13
SLIDE 13

Higher Dim ensional Manifolds

LLE can find the

embedding for any given dimensionality (d> 1)

GPs naturally take multi-

dimensional input data Approach directly applicable

slide-14
SLIDE 14

Finding the Topology of the Scene

Consider all possible models at the same time Then select the models that maximize

the data likelihood while minimizing the overall complexity

The model selection and inference of the topology

is done by computing a minimum spanning tree

1 kDtest

ij

k log p(Dtest ij

| Mtype

ij

) + C(Mtype

ij

) costMtype

ij

=

slide-15
SLIDE 15

Experim ents

Microwave door [ PhaseSpace] Cabinet with two drawers [ PhaseSpace] Garage door [ Simulated] Table [ ARToolkit]

slide-16
SLIDE 16

Microw ave Door: Observations

slide-17
SLIDE 17

Microw ave Door: Learned Model

slide-18
SLIDE 18

Microw ave Door: Error Analysis

slide-19
SLIDE 19

Cabinet w ith Tw o Draw ers

slide-20
SLIDE 20

Cabinet: Learned Models

slide-21
SLIDE 21

Garage Door: Error Analysis

slide-22
SLIDE 22

Table

slide-23
SLIDE 23

Table

slide-24
SLIDE 24

Table

slide-25
SLIDE 25

Novel approach for learning kinematic models for

articulated objects

Uses a candidate set of parametric and non-

parametric models

Parametric (rigid, prismatic, rotation) Non-parametric models (LLE/ GP)

Infers the connectivity of the object parts by

means of a minimum spanning tree Towards understanding space in domestic environments

Conclusions

slide-26
SLIDE 26

Future W ork

Closed kinematic chains Pose registration with natural features Plan trajectories for door/ drawer opening

slide-27
SLIDE 27

Thanks for Your Attention