Form2Fit: Learning Shape Priors for Generalizable Assembly from Disassembly
Kevin Zakka, Andy Zeng, Johnny Lee, Shuran Song
Columbia
University
Stanford
University
Research
Form2Fit: Learning Shape Priors for Generalizable Assembly from - - PowerPoint PPT Presentation
Form2Fit: Learning Shape Priors for Generalizable Assembly from Disassembly Kevin Zakka, Andy Zeng, Johnny Lee, Shuran Song Google Stanford Columbia Research University University Form2Fit: Learning Shape Priors for Generalizable Assembly
Kevin Zakka, Andy Zeng, Johnny Lee, Shuran Song
University
University
Research
Kevin Zakka, Andy Zeng, Johnny Lee, Shuran Song
University
University
Research
includes video narration
kit assembly
kit assembly everyday interactions
kit assembly everyday interactions
kit assembly everyday interactions
kit assembly everyday interactions
state-of-the-art robo-kitting solution
Choi et. al.
CAD Model
state-of-the-art robo-kitting solution
Choi et. al.
CAD Model
state-of-the-art robo-kitting solution
Choi et. al.
CAD Model
state-of-the-art robo-kitting solution
Choi et. al.
CAD Model
state-of-the-art robo-kitting solution can we endow them with generalization abilities?
Choi et. al.
CAD Model
through Shape Matching & Self-Supervision
through Shape Matching & Self-Supervision
through Shape Matching & Self-Supervision
never before seen through Shape Matching & Self-Supervision
Form2Fit never before seen through Shape Matching & Self-Supervision
Form2Fit 94% novel configurations 86% novel objects & kits never before seen through Shape Matching & Self-Supervision
Form2Fit 94% novel configurations 86% novel objects & kits never before seen through Shape Matching & Self-Supervision ~12 hours training
Kit Assembly Shape Matching
→
Kit Assembly Shape Matching
→
Kit Assembly Shape Matching
→
Kit Assembly Shape Matching
→
Assembly from Disassembly
64x
Kit Assembly Shape Matching
→
Assembly from Disassembly
64x
Kit Assembly Shape Matching
→
Assembly from Disassembly
64x
Kit Assembly Shape Matching
→
Assembly from Disassembly
64x
Kit Assembly Shape Matching
→
Assembly from Disassembly
64x
Kit Assembly Shape Matching
→
<< rewind
Assembly from Disassembly
64x
Kit Assembly Shape Matching
→
grayscale-depth heightmaps are generated from 3D pointcloud data
Kit Heightmap Object Heightmap
grayscale-depth heightmaps are generated from 3D pointcloud data
suction network ingests object heightmap and outputs suction heatmap
Kit Heightmap Object Heightmap
suction network ingests object heightmap and outputs suction heatmap
Kit Heightmap Object Heightmap Suction Network
suction network ingests object heightmap and outputs suction heatmap
Kit Heightmap Object Heightmap Suction Network
place network ingests kit heightmap and outputs place heatmap
Kit Heightmap Object Heightmap Suction Network
place network ingests kit heightmap and outputs place heatmap
Kit Heightmap Object Heightmap Suction Network Place Network
place network ingests kit heightmap and outputs place heatmap
Kit Heightmap Object Heightmap Suction Network Place Network
Kit Heightmap Object Heightmap Suction Network Place Network
corresponding pick and place candidates
Kit Heightmap Object Heightmap Suction Network Place Network
corresponding pick and place candidates
Kit Heightmap Object Heightmap Suction Network Place Network
corresponding pick and place candidates
Kit Heightmap Object Heightmap Suction Network Place Network
corresponding pick and place candidates
Kit Heightmap Object Heightmap Suction Network Place Network
corresponding pick and place candidates
Kit Heightmap Object Heightmap Suction Network Place Network
corresponding pick and place candidates
matching network ingests heightmaps and outputs descriptor maps
Kit Heightmap Object Heightmap Suction Network Place Network
Matching Network
matching network ingests heightmaps and outputs descriptor maps
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
Matching Network
closer descriptor distances indicate better object-to-placement correspondences
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
Matching Network
closer descriptor distances indicate better object-to-placement correspondences
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
Matching Network
closer descriptor distances indicate better object-to-placement correspondences
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
Matching Network
closer descriptor distances indicate better object-to-placement correspondences
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
Matching Network
closer descriptor distances indicate better object-to-placement correspondences
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
Matching Network
closer descriptor distances indicate better object-to-placement correspondences
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20 × 20
Matching Network
descriptors are rotation-sensitive
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20 × 20
Matching Network
descriptors are rotation-sensitive
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20 × 20
Matching Network
descriptors are rotation-sensitive
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20 × 20
Matching Network
descriptors are rotation-sensitive
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20 × 20
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20 × 20
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20
θ
p q
Planner
× 20
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20
θ
p q
Planner
× 20
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20
θ
p q
Planner
× 20
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20
θ
p q
Planner
× 20
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20
θ
p q
Planner
× 20
p
Pick Position Place Position
q
Matching Network
planner integrates information to produce suction/place poses & end-effector rotation
Kit Heightmap Object Heightmap Suction Network Place Network pixel-wise descriptors
× 20
θ
p q
Planner
× 20
p
Pick Position Place Position
q
12x 12x
500 disassembly sequence (~ 8 to 10 hours) for each kit
12x 12x 12x
12x 12x
500 disassembly sequence (~ 8 to 10 hours) for each kit
12x 12x 12x
suction network predicts a suction candidate
suction network predicts a suction candidate
suction network predicts a suction candidate
place pose randomly generated (q, θ)
place pose randomly generated (q, θ)
place pose randomly generated (q, θ)
θ
kit is secured to table to prevent accidental displacement from bad suction grasps
kit is secured to table to prevent accidental displacement from bad suction grasps
place point ground-truth obtained from suction
place point ground-truth obtained from suction
place point ground-truth obtained from suction
suction point ground-truth obtained from place
suction point ground-truth obtained from place
suction point ground-truth obtained from place
dense correspondence ground-truth obtained from robot motion
dense correspondence ground-truth obtained from robot motion
model trained and tested on each kit
12x 12x 12x 12x 12x
model trained and tested on each kit
12x 12x 12x 12x 12x
model trained and tested on each kit
12x 12x 12x 12x 12x
model trained and tested on each kit
12x 12x 12x 12x 12x
model trained and tested on each kit
12x 12x 12x 12x 12x
model trained and tested on each kit
12x 12x 12x 12x 12x
model trained on 2 kits: floss and tape
model trained on 2 kits: floss and tape
Individual
64x 64x
model trained on 2 kits: floss and tape
Multiple Individual
64x 64x 64x 64x
model trained on 2 kits: floss and tape
Multiple Mixture Individual
64x 64x 64x 64x 64x 64x
64x 64x 64x 64x
never before seen animals
4x
never before seen animals
4x
descriptors encode object orientation
descriptors encode object orientation
same orientation
descriptors encode object orientation
same orientation different rotation
descriptors encode spatial correspondence
descriptors encode spatial correspondence
same points share similar descriptors
descriptors encode object identity
descriptors encode object identity
unique descriptor for different objects
180º rotational flips
12x
180º rotational flips
12x
quasi-static environment
For details, videos and code, visit: https://form2fit.github.io