3D Dynamic Scene Graphs Actionable Spatial Perception with Places, - PowerPoint PPT Presentation

3D Dynamic Scene Graphs Actionable Spatial Perception with Places, Objects, and Humans Antoni Rosinol* , Arjun Gupta, Marcus Abate, Jingnan Shi, Luca Carlone *arosinol@mit.edu

5/19/20 2 Motivation Fully autonomous systems should operate given high-level tasks and figure out the necessary low-level tasks. Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 3 Bottleneck: 3D Scene Understanding What does a robot need to accomplish high-level tasks? 3D Scene Understanding Semantics L3 L2 Mapping L1 Localization Metric-Semantic SLAM Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 4 Kimera: Real-Time Metric-Semantic SLAM [1] • Accurate and Robust State Estimation: state-of-the-art VIO • Faithfull metric-semantic reconstruction • Real-Time 100ms per frame (CPU-only) Estimated Ground-Truth [1] Rosinol, Antoni and Abate, Marcus and Chang, Yun and Carlone, Luca. “Kimera: an Open-Source Library for Real-Time Metric-Semantic Localization and Mapping”, ICRA 2020 Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 5 Problem • Raw 3D semantic mesh is not actionable: • Obstacle Avoidance and Planning: • Not readily usable for path planning: `go to the kitchen` • Human-Robot Interaction: • 3D model readable for both humans and robots • Difficult to answer queries: `how many chairs are there?` • Long-term Autonomy: • Compact representation • Different levels of Abstractions • Forget/retain relevant information Ground-Truth Estimated Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 6 3D Dynamic Scene-Graphs Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 7 3D Dynamic Scene-Graphs (DSGs) Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 8 Layers • Layer 1: Metric-Semantic 3D Mesh • Layer 2: Objects and Agents • Layer 3: Places and Structures • Layer 4: Rooms • Layer 5: Buildings Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 9 Layers • Layer 1: Metric-Semantic 3D Mesh (Kimera) • Layer 2: Objects and Agents • Layer 3: Places and Structures • Layer 4: Rooms • Layer 5: Buildings Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 11 Layer 2: Objects and Agents • Object Attributes: • 3D Centroid, bounding box, semantic label, and instance id. • Object instance extraction: 1. Extract portions of the mesh with a semantic label. 2. Clustering to extract instances (assumes 3D objects’ instances are not touching!) 3. Calculate centroid and bounding-box. • We distinguish between: • Known objects: for which we have a CAD model, and • Unknown objects: no prior 3D model • Known object instance fitting: 1. Extract 3D keypoints (spheres in blue) 2. Match all 3D keypoints from estimate and CAD model (=> outliers) 3. Use TEASER++[1] to remove outliers and fit CAD model. [1] Yang, Heng and Shi, Jingnan and Carlone, Luca. Teaser: Fast and certifiable point cloud registration. https://arxiv.org/abs/2001.07715 Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 12 Layer 2: Objects and Agents • Agents: dynamic entities in the environment: vehicles, humans, robots... • We model Agents by: i. 3D Pose Graph*: describing their trajectory over time ii. 3D Mesh Model: describing their (non-rigid) shape iii. Semantic class: human, robot, ... • Human Agents: 1. Detection: 1. Extract bounding box of image from semantic segmentation 2. Estimate 3D mesh model (SMPL) of human using [1]. 2. Tracking: 1. Incrementally build pose-graph with motion model 2. Remove outliers and/or incorrect data associations by enforcing joint consistency (blue segments in (c)) * A pose graph is a collection of time-stamped 3D poses where edges model pairwise relative measurements [1] Kolotouros, Nikos and Pavlakos, Georgios and Daniilidis, Kostas . Convolutional mesh regression for single-image human shape reconstruction. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019. Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 13 Layer 2: Objects and Agents • Human Agent Tracking: • Blue trajectory: corresponds to the built pose-graph • Rainbow human mesh: associated detections with pose-graph vertices. Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 14 Layer 2: Objects and Agents • Dynamic Masking: • Non-static agents can corrupt 3D reconstruction: we avoid integrating dynamic agents in 3D metric-semantic mesh. Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 15 Layer 2: Objects and Agents • Localization: KLT-IMU + 2-point RANSAC • Mapping: Dynamic Masking • Avoid integrating dynamic agents in 3D metric-semantic mesh. Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 17 Layer 3: Places and Structures • Places: free-space locations, edges represent traversability. • Modelled as a topological map (readily usable for path-planning!) • Each object and agent in Layer 2 is connected to the nearest place • Structures: • Walls, floor, ceiling, pillars… [1] H Oleynikova, Z Taylor, R Siegwart, J Nieto. Sparse 3d topological graphs for micro-aerial vehicle planning, IROS 2018. Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 19 Layer 4: Rooms • Rooms: as well as corridors, halls … • Attributes: i. 3D pose ii. Bounding box iii. Semantic class (kitchen, corridor, bedroom…) • Connectivity between rooms represents traversability • Elements in Layer 3 (places, structures) are connected to their containing room nodes (Layer 4). Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 20 Layer 4: Rooms • Rooms detection: 1. A 2D slice of the 3D ESDF (Euclidean Signed Distance Function) below the detected ceiling is constant almost everywhere except near walls. Fig. (a). 2. Truncate 2D ESDF to obtain disconnected sections corresponding to rooms. Fig. (b). 3. Label nodes that fall inside a disconnected ESDF section with one room label (this only labels a subset of all nodes) 4. Using topology of the Places graph, infer the rest of room labels using majority voting. Fig. (b) Truncated 2D ESDF Fig. (a) 2D slice of 3D ESDF Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 22 Layer 5: Buildings • Buildings • Attributes: i. 3D pose ii. Bounding box iii. Semantic class (office building, residential house ) • Elements in Layer 4 (rooms) are connected to their containing building (Layer 5). Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 23 3D Dynamic Scene-Graphs Antoni Rosinol 3D Dynamic Scene Graphs

5/19/20 24 Thank you! Antoni Rosinol 3D Dynamic Scene Graphs

3D Dynamic Scene Graphs Actionable Spatial Perception with Places, - PowerPoint PPT Presentation

3D Dynamic Scene Graphs Actionable Spatial Perception with Places, Objects, and Humans Antoni Rosinol* , Arjun Gupta, Marcus Abate, Jingnan Shi, Luca Carlone *arosinol@mit.edu 5/19/20 2 Motivation Fully autonomous systems should operate given

Scene Graphs Scene Representation How does one describe the objects in a 3D scene? Scene

Scene Representation How does one describe the objects in a Scene Graphs 3D scene? Scene

CMSC427 Scene graphs Credit: slides from Dr. Zwicker Today Scene graphs & hierarchies

Episode 42: I Made Slides 10 February 2019 The Three-Act, Seven Scene Structure Act I:

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

a better and faster way Shu Kong CS, ICS, UCI Image Understanding --> Scene Parsing Scene

Volumetric Scene Reconstruction Volumetric Scene Reconstruction Goal Goal from Multiple

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Lecture 7: Image Sources, Convolution, Scene Graphs COMPSCI/MATH 290-04 Chris Tralie, Duke

c cientific The Hessian Module omputing Current status and future perspectives

PTCL Teresa Palomero, PhD Institute for Cancer Genetics Department of Pathology and Cell Biology

Basic Text Processing Regular Expressions Regular expressions A formal

GCF 26th Meeting of the Board (B.26) Margaret-Ann Splawn Active Private Sector Observer to the

May a City Ordinance . . . 1. Limit the number of cats per residence? 2. Set the rates charged by

NLG, Wrap up Surface realizer Linearization SimpleNLG Lexicon Scott Farrar Design ideas

Climate Change Economics Dr. Shana McDermott Department of Economics Climate Changed FYE- Fall

Proportional Lumpability Andrea Marin 1 Carla Piazza 2 Sabina Rossi 1 1 Universit` a Ca Foscari

3D Dynamic Scene Graphs Actionable Spatial Perception with Places, - PowerPoint PPT Presentation

3D Dynamic Scene Graphs Actionable Spatial Perception with Places, Objects, and Humans Antoni Rosinol* , Arjun Gupta, Marcus Abate, Jingnan Shi, Luca Carlone *arosinol@mit.edu 5/19/20 2 Motivation Fully autonomous systems should operate given

Scene Graphs Scene Representation How does one describe the objects in a 3D scene? Scene

Scene Representation How does one describe the objects in a Scene Graphs 3D scene? Scene

CMSC427 Scene graphs Credit: slides from Dr. Zwicker Today Scene graphs &amp; hierarchies

Episode 42: I Made Slides 10 February 2019 The Three-Act, Seven Scene Structure Act I:

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

a better and faster way Shu Kong CS, ICS, UCI Image Understanding --&gt; Scene Parsing Scene

Volumetric Scene Reconstruction Volumetric Scene Reconstruction Goal Goal from Multiple

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Lecture 7: Image Sources, Convolution, Scene Graphs COMPSCI/MATH 290-04 Chris Tralie, Duke

c cientific The Hessian Module omputing Current status and future perspectives

PTCL Teresa Palomero, PhD Institute for Cancer Genetics Department of Pathology and Cell Biology

Basic Text Processing Regular Expressions Regular expressions A formal

GCF 26th Meeting of the Board (B.26) Margaret-Ann Splawn Active Private Sector Observer to the

May a City Ordinance . . . 1. Limit the number of cats per residence? 2. Set the rates charged by

NLG, Wrap up Surface realizer Linearization SimpleNLG Lexicon Scott Farrar Design ideas

Climate Change Economics Dr. Shana McDermott Department of Economics Climate Changed FYE- Fall

Proportional Lumpability Andrea Marin 1 Carla Piazza 2 Sabina Rossi 1 1 Universit` a Ca Foscari

CMSC427 Scene graphs Credit: slides from Dr. Zwicker Today Scene graphs & hierarchies

a better and faster way Shu Kong CS, ICS, UCI Image Understanding --> Scene Parsing Scene