Neural Topological SLAM for Visual Navigation
Devendra Singh Chaplot Saurabh Gupta Abhinav Gupta Ruslan Salakhutdinov
CVPR-2020
Webpage: https://devendrachaplot.github.io/projects/Neural-Topological-SLAM
Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage: - - PowerPoint PPT Presentation
Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage: https://devendrachaplot.github.io/projects/Neural-Topological-SLAM Abhinav Ruslan Devendra Singh Saurabh Gupta Salakhutdinov Gupta Chaplot Semantic Priors and
Devendra Singh Chaplot Saurabh Gupta Abhinav Gupta Ruslan Salakhutdinov
CVPR-2020
Webpage: https://devendrachaplot.github.io/projects/Neural-Topological-SLAM
priors and common-sense to explore and navigate everyday
struggle to do so
2
priors and common-sense to explore and navigate everyday
struggle to do so
Target Image
2 3 1
2
(IS) Source Image (IG) Goal Image
3
(IS) Source Image (IG) Goal Image
3
(IS) Source Image (IG) Goal Image
navigate
3
(IS) Source Image (IG) Goal Image
navigate
3
(IS) Source Image (IG) Goal Image
navigate
3
4
4
Observations Neural Network Actions
Reward
+
Reward
−
End-to-end Reinforcement or Imitation Learning
End-to-end Learning
4
Observations Neural Network Actions
Reward
+
Reward
−
End-to-end Reinforcement or Imitation Learning Modular Metric Maps
End-to-end Learning
Modular Metric Maps
5
5
Entrance Children’s Room Living Room Stairway Dining Room Office Kitchen Master Bedroom Hallway
6
6
Agent’s Current Node Regular Nodes Ghost Nodes
6
Selected Ghost Node Agent’s Current Node Regular Nodes Ghost Nodes
7
Agent’s Current Node Regular Nodes Ghost Nodes
7
Agent’s Current Node Regular Nodes Ghost Nodes
7
Selected Ghost Node Agent’s Current Node Regular Nodes Ghost Nodes
7
Selected Ghost Node Agent’s Current Node Regular Nodes Ghost Nodes
Relative Position
between nodes
8
8
9
9
10
10
11
11
Localization(ℱL) Localization(ℱL) 1
12 12
12
Relative Pose Prediction(ℱR)
12
ℱR
0.0 0.0 0.0 0.87 0.0 0.0 0.0 0.0 0.0 0.0 1
Direction label
ℱR
Score predictions Angle Distance
13
13
14
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
15
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2) ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
16
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2) ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
17
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2)
Δp Δp
ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
18
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2) ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
19
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2) ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
20
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2) ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
21
= Geometric Prediction: Free directions = Semantic Prediction: Closeness to target = Localization = Relative Pose Prediction
ℱG(I1) ℱS(I1, I2) ℱL(I1, I2) ℱR(I1, I2)
ℱL(I1, I2) ℱS(I1, I2) ℱR(I1, I2) ℱL(I1, I2) ℱG(I1) ℱR(I1, I2)
22
22
23 Goal Location Node Locations Ghost nodes Selected Ghost node
23
0.73 0.09 0.08 0.07 0.17 0.23
Goal Location Node Locations Ghost nodes Selected Ghost node
Goal Location Node Locations Ghost nodes Selected Ghost node 24
Goal Location Node Locations Ghost nodes Selected Ghost node 24
25
0.73 0.09 0.08 0.07 0.17 0.23
Goal Location Node Locations Ghost nodes Selected Ghost node
26
0.76 0.20 0.56 0.17 0.27 0.13
Goal Location Node Locations Ghost nodes Selected Ghost node
Goal Location Node Locations Ghost nodes Selected Ghost node 27
Goal Location Node Locations Ghost nodes Selected Ghost node 27
RGB RGBD RGBD (No Noise) RGBD (No Stop) LSTM + Imitation 0.10 0.14 0.15 0.18 LSTM + RL 0.10 0.13 0.14 0.17 Occupancy Maps + FBE + RL N/A 0.26 0.31 0.24 Active Neural SLAM 0.23 0.29 0.35 0.39 Neural Topological SLAM 0.38 0.43 0.45 0.60
End-to-end Learning Modular Metric Maps Topological Maps
RGB RGBD RGBD (No Noise) RGBD (No Stop) LSTM + Imitation 0.10 0.14 0.15 0.18 LSTM + RL 0.10 0.13 0.14 0.17 Occupancy Maps + FBE + RL N/A 0.26 0.31 0.24 Active Neural SLAM 0.23 0.29 0.35 0.39 Neural Topological SLAM 0.38 0.43 0.45 0.60
Robustness to Pose Noise NTS is better than occupancy map models, captures and uses semantic priors.
End-to-end Learning Modular Metric Maps Topological Maps
But, at the same time, importance of the topological representation increases Semantic score function improves efficiency when no prior experience with environment is available. As experience in environment increases, utility of semantic function decreases
30
Neural Topological SLAM for Visual Navigation
Devendra Singh Chaplot, Ruslan Salakhutdinov, Abhinav Gupta, Saurabh Gupta CVPR 2020
Webpage: https://devendrachaplot.github.io/projects/Neural-Topological-SLAM
Devendra Singh Chaplot
Webpage: http://devendrachaplot.github.io/ Email: chaplot@cs.cmu.edu Twitter: @dchaplot