Tow ards Understanding Articulated Objects Jrgen Sturm 1 Cyrill - - PowerPoint PPT Presentation

▶

Apr 30, 2023 152 likes •438 views

Tow ards Understanding Articulated Objects Jrgen Sturm 1 Cyrill Stachniss 1 Vijay Pradeep 2 Christian Plagemann 3 Kurt Konolige 2 Wolfram Burgard 1 1 University of Freiburg 2 Willow Garage 3 Stanford University SA-1 Motivation Domestic

SLIDE 1

SA-1

Tow ards Understanding Articulated Objects

Jürgen Sturm 1 Cyrill Stachniss1 Vijay Pradeep2 Christian Plagemann3 Kurt Konolige2 Wolfram Burgard1

1University of Freiburg 2Willow Garage 3Stanford University

SLIDE 2

Motivation

Domestic

environments

Articulated objects

Doors Drawers

Learning models of

bjects

Understanding the

spatial movements can improve

Perception Manipulation planning Navigation

SLIDE 3

Problem Form ulation

x1 ∈ R6, . . . , xn ∈ R6

The goal of our approach is to

1. learn models describing the relationship

between two object parts

2. infer the kinematic topology of the scene

(which object parts are connected in which way)

Mij

M

Given: noisy 6D pose observations of n rigid object parts

SLIDE 4

Training a Model

Find a model that describes the connection

between two object parts (i and j)

Maximize the likelihood of the observations given

the model

Mij

Noisy observations Consider a sequence of

T observed relative transforms of , denoted by

ˆ Mij = argmaxMij p(Dij | Mij) Dij = (z1

ij, . . . , zT ij)

xi ª xj

SLIDE 5

Tem plate Models

Rigid model Prismatic model Rotational model Non-parametric model

(“LLE/ GP model”)

Mprismatic Mrigid Mrotational MLLE/GP

SLIDE 6

Observation Likelihood ( 1 )

What is the likelihood of an observation given a

model?

Models for articulated objects typically have a

latent action variable a (e.g., door opening angle)

For computing the observation likelihood, we

integrate over a

p(zij | Mij) = R

a p(zij | a, Mij) p(a | Mij) da

p(zij | Mij) = ? p(zij | a, Mij) ∝ exp ¡ −||fMij(a) ª zij||2/l ¢

SLIDE 7

Observation Likelihood ( 2 )

Assumption: all latent actions have the same

likelihood

Integration might be time consuming Efficiently estimate value of latent action variable! In our settings, this works well in practice for all

bserved models

p(zij | Mij) ∝ R

a p(zij | a, Mij) da

p(zij | Mij) ≈ maxa p(zij | a, Mij)

SLIDE 8

Draw er: Moves on Line Segm ent

Estimate

axis e of movement

(using PCA)

Estimate latent action Transformation function:

ˆ at

ij = e · trans(zt ij ª z1 ij)

fMprismatic

(a) = z1

ij ⊕ ae

SLIDE 9

Param etric Model for a Door

Estimate:

axis of rotation n center of rotation c rigid transform r

fMrotational

(a) = [c; n]T ⊕ rotZ(a) ⊕ r

Estimate latent action Transformation function:

SLIDE 10

Garage door: A Tw o-link Joint

Garage door runs in a

vertical and a horizontal slider

Neither rotational, nor

prismatic motion

There are objects

which cannot be explained well by “standard” models

SLIDE 11

LLE/ GP: Non-param etric Model

For a articulation model, we need

Estimate the latent action a Predict the expected transform

Assume that the data lies on

(or close to) a low dimensional manifold in

Find latent low dimensional coordinates on

the manifold dimensionality reduction using locally linear embedding (LLE)

Then learn a Gaussian process regression

for the transformation function

R6 f(a) = zij

SLIDE 12

The LLE/ GP Model Sum m ary

Given: 6D obs. Find low-dim. manifold Non-linear regression

zij ∈ R6 zij

LLE

→ aij ∈ Rd (aij, zij) → GP a0 GP → z0

Result: a mapping from the manifold

coordinate system into the observations space

f(a) = zij

SLIDE 13

Higher Dim ensional Manifolds

LLE can find the

embedding for any given dimensionality (d> 1)

GPs naturally take multi-

dimensional input data Approach directly applicable

SLIDE 14

Finding the Topology of the Scene

Consider all possible models at the same time Then select the models that maximize

the data likelihood while minimizing the overall complexity

The model selection and inference of the topology

is done by computing a minimum spanning tree

−

1 kDtest

k log p(Dtest ij

| Mtype

ij

) + C(Mtype

ij

) costMtype

=

SLIDE 15

Experim ents

Microwave door [ PhaseSpace] Cabinet with two drawers [ PhaseSpace] Garage door [ Simulated] Table [ ARToolkit]

SLIDE 16

Microw ave Door: Observations

SLIDE 17

Microw ave Door: Learned Model

SLIDE 18

Microw ave Door: Error Analysis

SLIDE 19

Cabinet w ith Tw o Draw ers

SLIDE 20

Cabinet: Learned Models

SLIDE 21

Garage Door: Error Analysis

SLIDE 22

Table

SLIDE 23

Table

SLIDE 24

Table

SLIDE 25

Novel approach for learning kinematic models for

articulated objects

Uses a candidate set of parametric and non-

parametric models

Parametric (rigid, prismatic, rotation) Non-parametric models (LLE/ GP)

Infers the connectivity of the object parts by

means of a minimum spanning tree Towards understanding space in domestic environments

Conclusions

SLIDE 26

Future W ork

Closed kinematic chains Pose registration with natural features Plan trajectories for door/ drawer opening

SLIDE 27