Neurosymbolic 3D Models: Learning to Generate 3D Shape Programs - - PowerPoint PPT Presentation

neurosymbolic 3d models learning to generate 3d shape
SMART_READER_LITE
LIVE PREVIEW

Neurosymbolic 3D Models: Learning to Generate 3D Shape Programs - - PowerPoint PPT Presentation

Neurosymbolic 3D Models: Learning to Generate 3D Shape Programs Daniel Ritchie This guy! WHO AM I? Brown n Un Univer ersi sity ty Located in Providence, Rhode Island #14 University in the US (US News) Brown wn Comp mput uter


slide-1
SLIDE 1

Neurosymbolic 3D Models: Learning to Generate 3D Shape Programs

Daniel Ritchie

slide-2
SLIDE 2

WHO AM I?

This guy!

slide-3
SLIDE 3

Brown n Un Univer ersi sity ty

  • Located in Providence, Rhode Island
  • #14 University in the US (US News)
slide-4
SLIDE 4

Brown wn Comp mput uter Sci er Scien ence ce Dep Depar artme ment nt

  • 37 full-time faculty
  • 2-year Masters program
  • Fully-funded PhD program (5 years)
  • #25 for CS Graduate Study (US News)
slide-5
SLIDE 5

Brown Visual Computing

  • Nine (9) faculty
  • Active research in graphics,

vision, HCI, visualization, ...

  • Regularly publish in top visual

computing venues (SIGGRAPH, CVPR, ICCV, ...)

http://visual.cs.brown.edu/

slide-6
SLIDE 6

Brown Visual Computing

  • An

Andy y van Dam: co-founder of ACM SICGRAPH (pre-cursor to SIGGRAPH)

http://visual.cs.brown.edu/

slide-7
SLIDE 7

Brown Visual Computing

  • An

Andy y van Dam & Sp Spike ke Hughes: s: Authors of “Computer Graphics: Principles and Practice”

http://visual.cs.brown.edu/

slide-8
SLIDE 8

My Research (Broadly)

Computer er Graphics hics AI + I + ML

slide-9
SLIDE 9

My Research (Specifically)

Gene nera rate te Infer fer

3D Structures

  • Objects
  • Scenes
  • ...

Generative Models

  • Programs
  • Deep Networks
  • ...

What are ne neurosymbolic 3D models, and how do they relate to all of this?

slide-10
SLIDE 10

FIRST, A LITTLE BACKGROUND & MOTIVATION...

slide-11
SLIDE 11

Increasing Demand for 3D Content

Traditional driver: Entertainment (Games, VR, ...)

slide-12
SLIDE 12

Increasing Demand for 3D Content

12

E-Commerce (esp. furniture / interior design)

slide-13
SLIDE 13

Increasing Demand for 3D Content

New driver: Artificial Intelligence (“Graphics for AI”)

3D Scene Semantic Segments

slide-14
SLIDE 14

Increasing Demand for 3D Content

New driver: Artificial Intelligence (“Graphics for AI”)

slide-15
SLIDE 15

Increasing Demand for 3D Content

New driver: Artificial Intelligence (“Graphics for AI”)

Learning to Generalize Kinematic Models to Novel Objects, Abbatematteo et al. 2019

slide-16
SLIDE 16

Current Practice Can’t Meet Demand

Mannual 3D modeling: still slow, still hard to learn

Maya Solidworks

slide-17
SLIDE 17

Current Practice Can’t Meet Demand

Mannual 3D modeling: still slow, still hard to learn

Maya Solidworks

“The difficulty of generating images has been overwhelmed by a five-thousand-fold improvement in price/performance of computing. What remains hard is modeling…the grand challenges in three- dimensional graphics are to mak ake sim simple modeli ling eas asy and to mak ake complex modelin ling ac accessible le to far ar more re people.”

— Bob Sproull, 1990

slide-18
SLIDE 18

Generative Models to the Rescue!?

For the purposes of this talk: Generative model: a procedure which can be executed to generate novel instances of some 3D object class

slide-19
SLIDE 19

Benefits of Generative Models

3D content generation at scale

SpeedTree, Unreal Engine CityEngine

slide-20
SLIDE 20

Benefits of Generative Models

Explore modeling possibilities

Learning Implicit Fields for Generative Shape Modeling , Chen & Zhang 2019

slide-21
SLIDE 21

Benefits of Generative Models

Strong prior for vision systems

StructureNet: Hierarchical Graph Networks for 3D Shape Generation, Mo et al. 2019

slide-22
SLIDE 22

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction

Advanced Procedural Modeling of Architecture, Schwartz & Muller 2015

slide-23
SLIDE 23

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction
  • Interpretable & editable

Advanced Procedural Modeling of Architecture, Schwartz & Muller 2015

slide-24
SLIDE 24

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction
  • Interpretable & editable

Cons:

  • Difficult to author

Advanced Procedural Modeling of Architecture, Schwartz & Muller 2015

slide-25
SLIDE 25

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction
  • Interpretable & editable

Cons:

  • Difficult to author
  • Limited output variety

Learning to Generalize Kinematic Models to Novel Objects, Abbatematteo et al. 2019

slide-26
SLIDE 26

Two Classes of Generative Model

De Deep ep Genera nerati tive ve Mod

  • dels

els Pros:

  • Variety (any class of shape)
  • Easy to author (“just add data”)

Learning Implicit Fields for Generative Shape Modeling , Chen & Zhang 2019

slide-27
SLIDE 27

Recent High-Profile Successes

3D-GAN Octree Generating Nets PointFlow AtlasNet Pixel2Mesh IM-Net

slide-28
SLIDE 28

Two Classes of Generative Model

De Deep ep Genera nerati tive ve Mod

  • dels

els Pros:

  • Variety (any class of shape)
  • Easy to author (“just add data”)

Cons:

  • Inconsistent output quality
  • Inscrutable representation
slide-29
SLIDE 29

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction
  • Interpretable & editable

Cons:

  • Difficult to author
  • Limited output variety

De Deep ep Genera nerati tive ve Mod

  • dels

els Pros:

  • Variety (any class of shape)
  • Easy to author (“just add data”)

Cons:

  • Inconsistent output quality
  • Inscrutable representation

Ho How ca can we e ge get all all of

  • f the

these...

slide-30
SLIDE 30

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction
  • Interpretable & editable

Cons:

  • Difficult to author
  • Limited output variety

De Deep ep Genera nerati tive ve Mod

  • dels

els Pros:

  • Variety (any class of shape)
  • Easy to author (“just add data”)

Cons:

  • Inconsistent output quality
  • Inscrutable representation

...with no none of

  • f th

these? Ho How ca can we e ge get all all of

  • f the

these...

slide-31
SLIDE 31

Generative Models Capture Variaton

Some modes can easily ily b be expressed symbolic licall lly:

  • Hierarchy

StructureNet: Hierarchical Graph Networks for 3D Shape Generation, Mo et al. 2019

slide-32
SLIDE 32

Generative Models Capture Variaton

Some modes can easily ily b be expressed symbolic licall lly:

  • Hierarchy
  • Connectivity

GRASS: Generative Recursive Autoencoders for Shape Structures, Li et al. 2018

slide-33
SLIDE 33

Generative Models Capture Variaton

Some modes can easily ily b be expressed symbolic licall lly:

  • Hierarchy
  • Connectivity
  • Symmetry
  • ...

GRASS: Generative Recursive Autoencoders for Shape Structures, Li et al. 2018

slide-34
SLIDE 34

Generative Models Capture Variaton

Some modes are hard to express symbolic licall lly:

  • Fine-detailed geometry

Learning Implicit Fields for Generative Shape Modeling , Chen & Zhang 2019

slide-35
SLIDE 35

Generative Models Capture Variaton

Some modes are hard to express symbolic licall lly:

  • Fine-detailed geometry
  • Complex inter-part correlations
  • ...

Learning Implicit Fields for Generative Shape Modeling , Chen & Zhang 2019

slide-36
SLIDE 36

Generative Models Capture Variaton

Some modes can easily ily b be expressed symbolic licall lly:

  • Hierarchy
  • Connectivity
  • Symmetry
  • ...

Some modes are hard to express symbolic licall lly:

  • Fine-detailed geometry
  • Complex inter-part correlations
  • ...

Design Philosophy: Use symbols where possible Use neural nets for everything else

slide-37
SLIDE 37

Neurosymboli lic 3D Model: A generative model of a class of 3D objects which models some modes of variability via explicit symbols and others via a neural latent space

slide-38
SLIDE 38

Neurosymbolic 3D Model Design Space

Neurosymbolic models of shape structure

This talk

slide-39
SLIDE 39

NEUROSYMBOLIC MODELS OF SHAPE STRUCTURE

slide-40
SLIDE 40

What Do I Mean by Shape Structure?

  • Parts (as oriented bounding boxes)
  • Relations
  • Hierarchy, connectivity, symmetry, ...
  • Useful despite low geometric detail
  • Ex: robot motion planning  infer all

parts + relations given point cloud

  • bservation
  • Focus on manufactured objects
  • E.g. chairs, tables, airplanes...

StructureNet: Hierarchical Graph Networks for 3D Shape Generation, Mo et al. 2019

slide-41
SLIDE 41

What Do I Mean by Shape Structure?

  • Parts (as oriented bounding boxes)
  • Relations
  • Hierarchy, connectivity, symmetry, ...
  • Useful despite low geometric detail
  • Ex: robot motion planning  infer all

parts + relations given point cloud

  • bservation
  • Focus on manufactured objects
  • E.g. chairs, tables, airplanes...
  • Can extend to organic objects via e.g.

generalized cylinder decomposition

Generalized Cylinder Decomposition, Zhou et al. 2015

slide-42
SLIDE 42

The “Holy Grail” of Structure Modeling

A single, interpretable procedural model that generates the structures of every object in a given shape class (e.g. chairs, airplanes) But...

slide-43
SLIDE 43

Two Classes of Generative Model

Proc

  • ced

edural ural Mod

  • dels

els Pros:

  • High quality output by construction
  • Interpretable & editable

Cons:

  • Difficult to author
  • Limited output variety

Can an a a str strategic use use of

  • f neur

neural ne nets elim elimin inate the these?

slide-44
SLIDE 44

Eliminating Procedural Cons

Prob

  • blem

em: : Hard to author Sol

  • lution:

tion: Train a neural net to write them for us Prob

  • blem

em: Limited output variety Sol

  • lution:

tion: Latent space of neural net will capture the variability that the symbolic program does not

slide-45
SLIDE 45

[SIGGRAPH Asia 2020]

slide-46
SLIDE 46

A Neurosymbolic 3D Modeling Pipeline

slide-47
SLIDE 47

ShapeAssembly

An “assembly language” for part-based shapes

Low-level instructions Operates by assembling parts

slide-48
SLIDE 48

Anatomy of a ShapeAssembly Program

slide-49
SLIDE 49

Execution Semantics

slide-50
SLIDE 50

Execution Semantics

Semantics of attach

slide-51
SLIDE 51

Execution Semantics

slide-52
SLIDE 52

Execution Semantics

slide-53
SLIDE 53

Execution Semantics

Macros:

  • s:

squeeze, reflect, translate expand into multiple Cuboid + attach statements

slide-54
SLIDE 54

Execution Semantics

Diff fferen erentiab tiable le execut ution ion: Output geometry is differentiable with respect to continuous parameters of input program

slide-55
SLIDE 55

A Neurosymbolic 3D Modeling Pipeline

slide-56
SLIDE 56

A Neurosymbolic 3D Modeling Pipeline

slide-57
SLIDE 57

Extracting Programs from Shapes

Local region of an input hierarchical part graph “Chair back” “Chair back side bars” “Chair back center slats”

slide-58
SLIDE 58

Extracting Programs from Shapes

Locally flattening the hierarchy to make interacting leaf parts siblings

slide-59
SLIDE 59

Extracting Programs from Shapes

Shortening leaf parts that intersect

  • ther leaf parts
slide-60
SLIDE 60

Extracting Programs from Shapes

Locating attachment points between parts

slide-61
SLIDE 61

Extracting Programs from Shapes

Forming leaf parts into symmetry groups

slide-62
SLIDE 62

Extracting Programs from Shapes

Or Ordering ng At Attachment ents

  • Due to imperative semantics,

attach order matters

  • Heuristics to prune possible
  • rders, then check which one

produces output that best fits the shape

slide-63
SLIDE 63

A Neurosymbolic 3D Modeling Pipeline

slide-64
SLIDE 64

A Neurosymbolic 3D Modeling Pipeline

slide-65
SLIDE 65

Learning to Write ShapeAssembly Programs

slide-66
SLIDE 66

Learning to Write ShapeAssembly Programs

slide-67
SLIDE 67

WHAT CAN YOU DO WITH IT?

slide-68
SLIDE 68

Novel Shape Generation

slide-69
SLIDE 69

Novel Shape Generation

slide-70
SLIDE 70

Novel Shape Generation

slide-71
SLIDE 71

Editing Generated Programs

slide-72
SLIDE 72

Editing Generated Programs

slide-73
SLIDE 73

Comparison Conditions

  • 3D PR

D PRNN NN: Sequence of boxes, but no hierarchy or relations

  • Str

tructureNet ctureNet: Hierarchy of boxes w/ symmetry relations, but no explicit parametric attachments

slide-74
SLIDE 74

Ours Generates Better Novel Shapes

slide-75
SLIDE 75

Ours Generates Better Novel Shapes

slide-76
SLIDE 76

Ours Generates Better Novel Shapes

Ours are also quantifiably more compact, physically stable, and distributionally similar to a held-out validation set

slide-77
SLIDE 77

Ours Produces Better Interpolation

Our interpolations are quantifiably smoother, in terms of both structure and geometry

slide-78
SLIDE 78

Point Cloud “Parsing”

slide-79
SLIDE 79

Point Cloud “Parsing”

slide-80
SLIDE 80

Point Cloud “Parsing”

slide-81
SLIDE 81

WHAT’S NEXT?

slide-82
SLIDE 82

Neurosymbolic 3D Model Design Space

Neurosymbolic models of shape structure

This talk

slide-83
SLIDE 83

Neurosymbolic 3D Model Design Space

Neurosymbolic models of shape structure Neurosymbolic models of part geometry Neurosymbolic models

  • f surface appearance

(e.g. texture) Neurosymbolic models of part functionalities

Joint generat rative ve models dels which couple ple all o ll of these ese

slide-84
SLIDE 84

THANKS!