Visual Grounding of Learned Physical Models ICML 2020 Yunzhu Li - PowerPoint PPT Presentation

Visual Grounding of Learned Physical Models ICML 2020 Yunzhu Li Toru Lin* Kexin Yi* Daniel M. Bear Daniel L.K. Yamins Jiajun Wu Joshua B. Antonio Torralba Tenenbaum http://visual-physics-grounding.csail.mit.edu/ (* indicates equal contribution)

Intuitive Physics (1) Distinguish between different instances (2) Recognize objects’ physical properties (3) Predict future movements (Wu et al., Learning to See Physics via Visual De-animation)

Larger stiffness Smaller stiffness For example Different physical parameters lead to different motions. Estimating physical parameter Larger gravity Smaller gravity by comparing mental simulation with observation

Physical reasoning of deformable objects is challenging.

Physical reasoning of deformable objects is challenging. Particle-based Representation General & Flexible

Physical reasoning of deformable objects is challenging. Particle-based Representation General & Flexible We propose a model that jointly (1) Estimates the physical properties (2) Refines the particle locations using (1) a learned visual prior (2) a learned dynamics prior

Visually Grounded Physics Learner (VGPL)

Visually Grounded Physics Learner (VGPL) Visual Grounding

We evaluate our model in environments involving interactions between rigid objects, elastic materials, and fluids.

We evaluate our model in environments involving interactions between rigid objects, elastic materials, and fluids. Within a few observation steps, our model is able to (1) refine the state estimation and reason about the physical properties (2) make predictions into the future.

Related Work Learning-based particle dynamics Mrowca, Zhuang, Wang, Haber, Fei-Fei, Battaglia, Pascanu, Lai, Rezende, Tenenbaum, Yamins. NeurIPS’18 Li, Wu, Tedrake, Tenenbaum, Torralba. ICLR’19 Kavukcuoglu. NeurIPS’16 Sanchez-Gonzalez, Godwin, Pfaff, Ying, Leskovec, Ummenhofer, Prantl, Thuerey, Koltun. ICLR’20 Battaglia. ICML’20

Related Work Questions remains: (1) How well they handle visual inputs? Learning-based particle dynamics (2) How to adapt to scenarios of unknown physical parameters? Mrowca, Zhuang, Wang, Haber, Fei-Fei, Battaglia, Pascanu, Lai, Rezende, Tenenbaum, Yamins. NeurIPS’18 Li, Wu, Tedrake, Tenenbaum, Torralba. ICLR’19 Kavukcuoglu. NeurIPS’16 Sanchez-Gonzalez, Godwin, Pfaff, Ying, Leskovec, Ummenhofer, Prantl, Thuerey, Koltun. ICLR’20 Battaglia. ICML’20

Related Work Differentiating through physics-based simulators Hu, Liu, Spielberg, Tenenbaum, Schenck, Fox. CoRL’18 Freeman, Wu, Rus, Matusik. ICRA’19 Liang, Lin, Koltun. NeurIPS’19 Belbute-Peres, Smith, Allen, Tenenbaum, Kolter. NeurIPS’18 Degrave, Hermans, Dambre, Wyffels. Frontiers in Neurorobotics 2019

Related Work Questions remains: (1) Make strong assumptions on the Differentiating through physics-based simulators structure of the system (2) Usually time-consuming (2) Prone to local optimum (3) Lacking ways to handle visual inputs Hu, Liu, Spielberg, Tenenbaum, Schenck, Fox. CoRL’18 Freeman, Wu, Rus, Matusik. ICRA’19 Liang, Lin, Koltun. NeurIPS’19 Belbute-Peres, Smith, Allen, Tenenbaum, Kolter. NeurIPS’18 Degrave, Hermans, Dambre, Wyffels. Frontiers in Neurorobotics 2019

Our Work We proposed Visually Grounded Physics Learner (VGPL) to (1) bridge the perception gap, (2) enable physical reasoning from visual perception, and (3) perform dynamics-guided inference to directly predict the optimization results, which allows quick adaptation to environments with unknown physical properties.

Problem Formulation Consider a system that contains objects and particles.

Problem Formulation Consider a system that contains objects and particles. : Visual observ.

Problem Formulation Consider a system that contains objects and particles. Visual prior : Visual observ. : Particle position : Instance grouping

Problem Formulation Consider a system that contains objects and particles. Visual prior Dynamics prior : Visual observ. : Particle position : Instance grouping

Problem Formulation Consider a system that contains objects and particles. Visual prior Dynamics prior : Visual observ. : Particle position : Instance grouping : Rigidness of each instance

Problem Formulation Consider a system that contains objects and particles. Visual prior Dynamics prior : Visual observ. : Particle position : Instance grouping : Rigidness of each instance : Physical parameters

Problem Formulation Consider a system that contains objects and particles. Visual prior Dynamics prior Inference module : Visual observ. : Particle position : Instance grouping : Rigidness of each instance : Physical parameters

Problem Formulation Consider a system that contains objects and particles. Visual prior Dynamics prior Inference module : Visual observ. : Particle position : Instance grouping : Rigidness of each instance : Physical parameters : Position refinement

Problem Formulation Consider a system that contains objects and particles. Visual prior Dynamics prior Inference module : Visual observ. : Particle position : Instance grouping : Rigidness of each instance : Physical parameters Objective function : Position refinement

Visual Prior Visual observations :

Visual Prior Visual observations : Particle locations : Instance grouping :

Visual Prior Visual observations : Particle locations : Instance grouping : Objective function

Results of the Visual Prior Visual Inputs Prediction Visual Inputs Prediction

Dynamics Prior : Particle position : Instance grouping

Dynamics Prior : Particle position : Instance grouping : Rigidness of each instance : Physical parameters

Dynamics Prior : Particle position : Instance grouping : Rigidness of each instance : Physical parameters Li, Wu, Tedrake, Tenenbaum, Torralba, “Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids,” ICLR’19

Results of the Dynamics Prior

Dynamics-Guided Inference

Dynamics-Guided Inference : Rigidness of each instance : Physical parameters

Dynamics-Guided Inference : Rigidness of each instance : Physical parameters : Particle position : Instance grouping

Dynamics-Guided Inference : Rigidness of each instance : Physical parameters : Particle position : Instance grouping : Position refinement

Results We will mainly investigate how accurate the following estimations are and whether they help with future prediction: (1) : Rigidness estimation (2) : Parameter estimation (3) : : Position refinement

Qualitative results on Rigidness Estimation

Quantitative results on Rigidness Estimation Mean accuracy Mean accuracy

Qualitative results on Parameter Estimation

Quantitative results on Parameter Estimation

Qualitative results on Position Refinement

Quantitative results on Position Refinement

Quantitative results on Future Prediction

In summary We proposed Visually Grounded Physics Learner (VGPL) to (1) simultaneously reason about physics and make future predictions based on visual and dynamics priors.

In summary We proposed Visually Grounded Physics Learner (VGPL) to (1) simultaneously reason about physics and make future predictions based on visual and dynamics priors. (2) We employ a particle-based representation to handle rigid bodies, deformable objects, and fluids.

In summary We proposed Visually Grounded Physics Learner (VGPL) to (1) simultaneously reason about physics and make future predictions based on visual and dynamics priors. (2) We employ a particle-based representation to handle rigid bodies, deformable objects, and fluids. (3) Experiments show that our model can infer the physical properties within a few observations, which allows the model to quickly adapt to unseen scenarios and make accurate predictions into the future.

Thank you for watching!

Visual Grounding of Learned Physical Models ICML 2020 Yunzhu Li - PowerPoint PPT Presentation

Visual Grounding of Learned Physical Models ICML 2020 Yunzhu Li Toru Lin* Kexin Yi* Daniel M. Bear Daniel L.K. Yamins Jiajun Wu Joshua B. Antonio Torralba Tenenbaum http://visual-physics-grounding.csail.mit.edu/ (* indicates equal

Dune Grounding Issues Impedance Concerns T. Shaw 11APR2018 Grounding Plan Grounding Plan

A Fast and Accurate One-Stage Approach to Visual Grounding Zhengyuan Yang Boqing Gong Liwei

The Symbol Grounding Problem Qi Huang Department of Computer Science February 3, 2020 1 / 31

Grounding LING 575: Spoken Dialog Systems May 12 th , 2016 1 What is Grounding? Spoken Dialog

ArgonCube 2x2 Cabling and grounding F. Piastra 31.10.2019 Power connections/grounding DAQ rack

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Lessons Learned Lessons Learned From From Lessons Learned Lessons Learned From From

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

CALM Sueann Tupy, Educator AGENDA Grounding Activity The Story of CALM Our Vision Program

Grounding of cargo barge TRIAS Latvian Coast Guard Service Ojars Gerke Environment Management

Application of Green's Function to Application of Green's Function to Analysis of Grounding

More on Grounding Sep. 16th 2014 Computational Semantics and Pragmatics Institute for Logic,

LBNF/DUNE Far Site Detector Grounding System Requirements This document sets forth the LBNF

Tips N Tricks Care and Feeding of the AM Transmitter Site (Grounding, Security,

Exact Pareto-Optimal Coordination of Two Translating Polygonal Robots on an Acyclic Roadmap

Internally CalabiYau Algebras Matthew Pressland Max-Planck-Institut fr Mathematik, Bonn

Dense Planar-Iner,al SLAM for Large Indoor 3D Reconstruc,on

What is scan matching Andrea Censi , La Sapienza Universit y of Rome San mathing

Compositional Recurrence Analysis Revisited Zachary Kincaid 1 Jason Breck 2 Ashkan Forouhi

Autonomous Robots: Towards Founded Assessment of Robustness Gerald Steinbauer (1), Lisa-Christina

Learning and control with movement primitives in multiple coordinate systems Sylvain Calinon

3D (Multi) Object Detection, Tracking and Segmentation 1 CV3DST | Laura Leal-Taix, Aljoa