Improving Generative Imagination in Object-Centric World Models - - PowerPoint PPT Presentation

improving generative imagination in object centric world
SMART_READER_LITE
LIVE PREVIEW

Improving Generative Imagination in Object-Centric World Models - - PowerPoint PPT Presentation

Improving Generative Imagination in Object-Centric World Models Zhixuan Lin, Yi-Fu Wu, Skand, Bofeng Fu, Jindong Jiang, Sungjin Ahn Object-Centric Temporal Generative Models STOVE SCALOR SILOT OP3 (Kossen et al., 2019) (Jiang et al., 2019)


slide-1
SLIDE 1

Improving Generative Imagination in Object-Centric World Models

Zhixuan Lin, Yi-Fu Wu, Skand, Bofeng Fu, Jindong Jiang, Sungjin Ahn

slide-2
SLIDE 2

Object-Centric Temporal Generative Models

STOVE (Kossen et al., 2019) SCALOR (Jiang et al., 2019) SILOT (Crawford & Pineau, 2020) OP3 (Veerapaneni et al., 2019)

slide-3
SLIDE 3

What’s Missing

  • Interaction
  • Occlusion
  • Scalability
  • Multimodal Uncertainty
  • Situation Awareness
slide-4
SLIDE 4

G-SWM: Generative Structured World Models

Context Objects Time

slide-5
SLIDE 5

G-SWM: Generative Process

Object Dynamics (Versatile Propagation) Context Dynamics Rendering

slide-6
SLIDE 6

Dynamics latent: multimodality Attribute latents: position, size, appearance Object attributes: position, size, appearance, ... Object-state RNN Versatile propagation: the core

Versatile Propagation

slide-7
SLIDE 7

Versatile Propagation

Interaction Situation Awareness Multimodality

slide-8
SLIDE 8

Object Interaction: Graph Neural Network

GNN:

slide-9
SLIDE 9

Situation Awareness: Attention on Environment

?

AOE: Attention on Environment

slide-10
SLIDE 10

Multimodal Uncertainty: Hierarchical Dynamics

Explicit representation, e.g. position (x, y)

slide-11
SLIDE 11

Summary of Versatile Propagation

slide-12
SLIDE 12

Inference: Scalable Discovery and Propagation

slide-13
SLIDE 13

Scalability, Occlusion and Interaction

slide-14
SLIDE 14

Situation Awareness and Multimodal Uncertainty

slide-15
SLIDE 15

Ablation Study

slide-16
SLIDE 16

3D Interactions

slide-17
SLIDE 17

Summary

  • G-SWM:

○ Object-centric ○ Interaction ○ Multimodal uncertainty ○ Situation awareness ○ Occlusion Handling ○ Scalability

  • Limitations

○ Quite complex ○ Many assumptions and priors

slide-18
SLIDE 18

Thank you!