Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage: - PowerPoint PPT Presentation

Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage: https://devendrachaplot.github.io/projects/Neural-Topological-SLAM Abhinav   Ruslan   Devendra Singh Saurabh   Gupta Salakhutdinov Gupta Chaplot

Semantic Priors and Common-Sense • Humans use semantic priors and common-sense to explore and navigate everyday • Most navigation algorithms struggle to do so 2

Semantic Priors and Common-Sense • Humans use semantic Target Image priors and common-sense to explore and navigate everyday • Most navigation algorithms 3 2 struggle to do so 1 2

Image Goal Task ( I S ) ( I G ) Source Image Goal Image 3

Image Goal Task ( I S ) ( I G ) Source Image Goal Image • Agent observations are panoramic images 3

Image Goal Task ( I S ) ( I G ) Source Image Goal Image navigate • Agent observations are panoramic images • Take actions to navigate to the goal location 3

Image Goal Task ( I S ) ( I G ) Source Image Goal Image navigate • Agent observations are panoramic images • Take actions to navigate to the goal location • Take the `stop’ action at the goal location 3

Image Goal Task ( I S ) ( I G ) Source Image Goal Image navigate • Agent observations are panoramic images • Take actions to navigate to the goal location • Take the `stop’ action at the goal location • Sequential goals 3

Prior work 4

Prior work End-to-end Reinforcement or Imitation Learning End-to-end Learning Reward + • High sample complexity • Ine ff ective in large environments Reward − Neural Network Observations Actions 4

Prior work End-to-end Reinforcement or Imitation Learning End-to-end Learning Reward + • High sample complexity • Ine ff ective in large environments Reward − Neural Network Observations Actions Modular Metric Maps Modular Metric Maps • Can not learn semantic priors • Pose error accumulation 4

Topological Maps 5

Topological Maps Dining Room Kitchen Living Room Stairway Hallway Master Bedroom Office Children’s Room Entrance 5

Topological Graph Representation 6

Topological Graph Representation • Nodes : areas • Regular nodes : Explored areas • Ghost nodes : Unexplored areas Agent’s Current Node Regular Nodes Ghost Nodes 6

Topological Graph Representation • Nodes : areas • Regular nodes : Explored areas • Ghost nodes : Unexplored areas Agent’s Current Node Regular Nodes Ghost Nodes Selected Ghost Node 6

Topological Graph Representation • Nodes : areas • Regular nodes : Explored areas • Ghost nodes : Unexplored areas Agent’s Current Node Regular Nodes Ghost Nodes 7

Topological Graph Representation • Nodes : areas • Regular nodes : Explored areas • Ghost nodes : Unexplored areas Agent’s Current Node Regular Nodes Ghost Nodes Selected Ghost Node 7

Topological Graph Representation • Nodes : areas • Regular nodes : Explored areas • Ghost nodes : Unexplored areas • Edges : Spatial relationship Relative Position Agent’s Current Node between nodes Regular Nodes Ghost Nodes Selected Ghost Node 7

Four learnable functions 8

Four learnable functions = Geometric Prediction: Free directions ℱ G ( I 1 ) = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) = Localization ℱ L ( I 1 , I 2 ) = Relative Pose Prediction ℱ R ( I 1 , I 2 ) 8

Geometric Prediction 9

Geometric Prediction = Geometric Prediction: Free directions ℱ G ( I 1 ) 9

Semantic Prediction 10

Semantic Prediction = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) 10

Localization 11

Localization = Localization ℱ L ( I 1 , I 2 ) Localization ( ℱ L ) 1 Localization ( ℱ L ) 0 11

Relative Pose Prediction 12 12

Relative Pose Prediction = Relative Pose ℱ R ( I 1 , I 2 ) Angle Direction label ℱ R Relative Pose 0 0 0 1 0 0 0 0 0 0 0 0 Prediction ( ℱ R ) 0.0 0.0 0.0 0.87 0.0 0.0 0.0 0.0 0.0 0.0 0 0 Score predictions ℱ R Distance 12 12

Neural Topological SLAM 13

Neural Topological SLAM = Geometric Prediction: Free directions ℱ G ( I 1 ) = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) = Localization ℱ L ( I 1 , I 2 ) = Relative Pose Prediction ℱ R ( I 1 , I 2 ) 13

Neural Topological SLAM = Geometric Prediction: Free directions ℱ G ( I 1 ) = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) = Localization ℱ L ( I 1 , I 2 ) = Relative Pose Prediction ℱ R ( I 1 , I 2 ) 14

Neural Topological SLAM = Geometric Prediction: Free directions ℱ G ( I 1 ) = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) = Localization ℱ L ( I 1 , I 2 ) = Relative Pose Prediction ℱ R ( I 1 , I 2 ) ℱ L ( I 1 , I 2 ) ℱ L ( I 1 , I 2 ) ℱ G ( I 1 ) ℱ S ( I 1 , I 2 ) ℱ R ( I 1 , I 2 ) ℱ R ( I 1 , I 2 ) 15

Neural Topological SLAM = Geometric Prediction: Free directions ℱ G ( I 1 ) = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) ℱ L ( I 1 , I 2 ) ℱ L ( I 1 , I 2 ) = Localization ℱ L ( I 1 , I 2 ) ℱ G ( I 1 ) ℱ S ( I 1 , I 2 ) = Relative Pose Prediction ℱ R ( I 1 , I 2 ) ℱ R ( I 1 , I 2 ) ℱ R ( I 1 , I 2 ) 16

Neural Topological SLAM = Geometric Prediction: Free directions ℱ G ( I 1 ) = Semantic Prediction: Closeness to target ℱ S ( I 1 , I 2 ) ℱ L ( I 1 , I 2 ) ℱ L ( I 1 , I 2 ) = Localization ℱ L ( I 1 , I 2 ) ℱ G ( I 1 ) ℱ S ( I 1 , I 2 ) = Relative Pose Prediction ℱ R ( I 1 , I 2 ) ℱ R ( I 1 , I 2 ) ℱ R ( I 1 , I 2 ) Δ p Δ p 17

Single supervised learning model 22

Single supervised learning model • No reinforcement learning, no interaction needed • Can be trained completely with static data 22

Demo video Goal Location Ghost nodes Node Locations Selected Ghost node 23

Demo video Goal Location Ghost nodes Node Locations Selected Ghost node 0.09 0.08 0.73 0.23 0.07 0.17 23

Demo video Goal Location Ghost nodes Node Locations Selected Ghost node 24

Learning Semantic Priors Goal Location Ghost nodes Node Locations Selected Ghost node 0.09 0.08 0.73 0.23 0.07 0.17 25

Learning Semantic Priors Goal Location Ghost nodes Node Locations Selected Ghost node 0.20 0.56 0.17 0.13 0.76 0.27 26

Learning Semantic Priors Goal Location Ghost nodes Node Locations Selected Ghost node 27

Results RGBD RGBD RGB RGBD (No Noise) (No Stop) End-to-end   LSTM + Imitation 0.10 0.14 0.15 0.18 Learning LSTM + RL 0.10 0.13 0.14 0.17 Occupancy Maps + Modular   N/A 0.26 0.31 0.24 Metric Maps FBE + RL Active Neural SLAM 0.23 0.29 0.35 0.39 Neural Topological Topological 0.38 0.43 0.45 0.60 SLAM Maps

Results Robustness to Pose Noise RGBD RGBD RGB RGBD (No Noise) (No Stop) End-to-end   LSTM + Imitation 0.10 0.14 0.15 0.18 Learning LSTM + RL 0.10 0.13 0.14 0.17 Occupancy Maps + Modular   N/A 0.26 0.31 0.24 Metric Maps FBE + RL NTS is better than occupancy map models, Active Neural SLAM 0.23 0.29 0.35 0.39 captures and uses semantic Neural Topological Topological 0.38 0.43 0.45 0.60 priors. SLAM Maps

Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage: - PowerPoint PPT Presentation

Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage: https://devendrachaplot.github.io/projects/Neural-Topological-SLAM Abhinav Ruslan Devendra Singh Saurabh Gupta Salakhutdinov Gupta Chaplot Semantic Priors and

SLAM: COMPARATIVE APPROACH Khooshal Saurty 1 OUTLINE Introduction - What is SLAM? EKF SLAM

SLAM@NVIDIA Kari Pulli| Senior Director of Research Overview Keyframe-based SlAM 3D

Direct Visual SLAM Instructor - Simon Lucey 16-623 - Designing Computer Vision Apps Reminder:

Visual SLAM for Mobile Instructor - Simon Lucey 16-623 - Designing Computer Vision Apps Example

Visual SLAM for Mobile Instructor - Simon Lucey 16-423 - Designing Computer Vision Apps Example

Visual SLAM An Overview L. Freda ALCOR Lab DIAG University of Rome La Sapienza May 3,

I ntroduction to Mobile Robotics SLAM Grid-based FastSLAM Wolfram Burgard 1 The SLAM

Navigation, Gravitation and Navigation, Gravitation and Navigation, Gravitation and Navigation,

Monocular Visual-Inertial SLAM for ISMAR SLAM Challenge Jie PAN Shaozu CAO, Jie PAN, Jieqi SHI,

Haptic Navigation in Mobile Contexts Agenda What is Haptic Navigation? Advantages of

React Native Navigation Screens, moving, parameters React Navigation React Navigation is not

React Native Navigation: Tabs 1 Tab Navigation the most common style of navigation in

Visual SLAM with Multi-Fisheye Camera Systems Stefan Hinz, Steffen Urban Institute of

Visual SLAM with an Event-based Camera Hanme Kim Supervisor: Prof. Andrew Davison Robot Vision

Agribot Sprayer, SLAM, and Robust Navigation Andrey Kurenkov, Troy ONeal, Pavel Komarov

Spatial navigation in humans Recap: navigation strategies and spatial representations Spatial

L AB N OTES : O DOMETRY , ROS R EFERENCE F RAMES I NSTRUCTOR : G IANNI A. D I C ARO A T Y P I C A

Neural Nonnegative Matrix Factorization for Hierarchical Multilayer Topic Modeling Jamie Haddock

14.1 Physically Based Simulation II Mass-Spring Systems Hao Li http://cs420.hao-li.com 1

OpenSky: A Swiss Army Knife for Air Traffic Security Research Martin Strohmeier 1 Matthias

Types of Correspondence Problems and Data Sets 1 1 Correspondence Registration 2

Hybrid Systems Modeling, Analysis and Control Radu Grosu Vienna University of Technology Lecture

Verifying Object Construction How to use the builder pattern with the type safety of constructors

Circular Security for Symmetric Key Bit Encryption from LWE Rishab Goyal Venkata Koppula Brent