GraphNav A Behavioral Approach to Visual Navigation with Graph - PowerPoint PPT Presentation

MIN Faculty Department of Informatics GraphNav A Behavioral Approach to Visual Navigation with Graph Localization Networks Paul Hölzen University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Department of Informatics Technical Aspects of Multimodal Systems 20. January 2020 P. Hölzen – GraphNav 1 / 18

Outline Motivation Method Results Conclusion References 1. Motivation 2. Method Graph Neural Network Graph Localization Network Particle Filtering for GLN Behavior Networks 3. Results Baselines Evaluation 4. Conclusion 5. References P. Hölzen – GraphNav 2 / 18

Motivation Motivation Method Results Conclusion References ◮ Navigating cluttered spaces is difficult for robots ◮ Humans are really good at it ◮ Behavioral approach founded in psychology ◮ proposed by Chen et al. [1] ◮ Cognitive Maps → graph-like structure Environment and corresponding topological map [1] P. Hölzen – GraphNav 3 / 18

Motivation Motivation Method Results Conclusion References ◮ Benefits of a graph-like map ◮ Coarse/Sparse topological information ◮ Navigation planning on a graph ◮ High-level abstraction Navigation examples on topological map [1] P. Hölzen – GraphNav 4 / 18

Graph Neural Network (GNN) Motivation Method Results Conclusion References ◮ Neural Network performing on graph-like structures ◮ Captures relational inductive biases ◮ Graph G = ( u , V , E ) ◮ u global feature ◮ V = { v i } i = 1 : n node features ◮ E = { ( e k , r k , s k ) } k = 1 : m edge features ◮ Edge features correspond to behaviors ◮ corridor follow ◮ find door ◮ turn left ◮ turn right ◮ straight (into room) P. Hölzen – GraphNav 5 / 18

Graph Neural Network (GNN) Motivation Method Results Conclusion References ◮ Graph network blocks 1. φ e ( · ) update edge features 2. ρ e → v ( · ) aggregate edge features 3. φ v ( · ) update node features 4. ρ e → u ( · ) , ρ v → u ( · ) aggregate edge and node features 5. φ u ( · ) update global feature Info The update functions φ e ( · ) , φ v ( · ) , φ u ( · ) were implemented using multilayer perceptrons (MLPs), the aggregation functions ρ e → v ( · ) , ρ e → u ( · ) , ρ v → u ( · ) use elementwise summation to ensure symmetry of the function (permutation agnostic) P. Hölzen – GraphNav 6 / 18

Graph Localization Network (GLN) Motivation Method Results Conclusion References ◮ Predicts location of the agent in the topological map ◮ Inputs ◮ Current visual observation ◮ Last predicted location ◮ Graph with edge and node features Graph localization network overview [1] P. Hölzen – GraphNav 7 / 18

Graph Localization Network (GLN) Motivation Method Results Conclusion References ◮ Topological map is cropped to region around last location ◮ Edge/Node features from embedding lookup table ◮ Global feature from CNN processing visual observation ◮ GNN predicts the current node/edge GLN architecture in detail [1] P. Hölzen – GraphNav 8 / 18

Particle Filtering for GLN Motivation Method Results Conclusion References ◮ used to improve GLN predictions ◮ based on statistical model ◮ p ( x t | u t , x t − 1 ) ◮ x t current state at time step t ◮ u t control input ◮ p ( z t | x t ) ◮ z t observation/measurement at time step t P. Hölzen – GraphNav 9 / 18

Particle Filtering for GLN Motivation Method Results Conclusion References ◮ Assumption 1: Two time steps t − 1 and t don’t differ a lot in topological location ◮ p ( x t | u t , x t − 1 ) = p ( x t | x t − 1 ) ◮ Chen et al. use p ( x t = x t − 1 | x t − 1 ) = 0 . 8 ◮ Assumption 2: p ( z t ) and p ( x t ) are uniform distributions for all time steps ◮ γ = p ( z t ) p ( x t ) = const . ◮ Bayes rule: p ( z t | x t ) = γ · p ( x t | z t ) ∝ p ( x t | z t ) ◮ Approximate p ( x t | z t ) by aggregating edge probabilities from the GLN P. Hölzen – GraphNav 10 / 18

Behavior Networks Motivation Method Results Conclusion References ◮ Separate networks for each behavior ◮ Correspond to edge features ◮ CNNs and LSTMs used to implement Overall architecture of GraphNav including behavior networks [1] P. Hölzen – GraphNav 11 / 18

Behavior Networks Motivation Method Results Conclusion References ◮ CNN-based behavior networks ◮ corridor follow ◮ find door ◮ LSTM-based behavior networks ◮ turn left ◮ turn right ◮ straight (into room) Architecture of LSTM-based behavior networks [1] P. Hölzen – GraphNav 12 / 18

Baselines Motivation Method Results Conclusion References ◮ Evaluation of the results by comparing to baselines ◮ PhaseNet [2]: LSTM-based, predicts temporal progress of behavior and when to switch to a new one ◮ BehavRNN [3]: Sequence-to-sequence deep learning model, behavior classification from visual input ◮ GTL : Ground Truth Localization, used to evaluate behavior networks independently P. Hölzen – GraphNav 13 / 18

Evaluation Motivation Method Results Conclusion References ◮ GraphNavPF (with Particle Filtering) has highest performance compared to baselines ◮ Per-behavior success ( 90%) and path completion rate ( 70%) are resonable ◮ PhaseNet and BehavRNN perform significantly worse on seen and unseen environments ◮ GTL baseline shows that behavior networks work well, struggles in open spaces Output of the localization network [1] P. Hölzen – GraphNav 14 / 18

Video Motivation Method Results Conclusion References Video example of the GraphNav approach working [4] P. Hölzen – GraphNav 15 / 18

Future Research Motivation Method Results Conclusion References ◮ Topological map has to be created and annotated by hand ◮ Set of behaviors has to be pre-defined ◮ Chen et al. propose data-driven approach to automate this ◮ Simulation-to-reality has to be tested P. Hölzen – GraphNav 16 / 18

Conclusion Motivation Method Results Conclusion References ◮ Navigation approach that uses topological map and visual information as input ◮ Graph neural networks for localization ◮ Separate behavior networks with behavior selection ◮ Outperforms several baselines Overall architecture of GraphNav including behavior networks [1] P. Hölzen – GraphNav 17 / 18

References Motivation Method Results Conclusion References [1] Chen, Kevin, et al. "A behavioral approach to visual navigation with graph localization networks." arXiv preprint arXiv:1903.00445 (2019). [2] Yu, Tianhe, et al. "One-shot hierarchical imitation learning of compound visuomotor tasks." arXiv preprint arXiv:1810.11043 (2018). [3] Sutskever, Ilya, et al. "Sequence to sequence learning with neural networks." In Advances in neural information processing systems, pages 3104–3112 (2014). [4] Chen, Kevin, et al. "GraphNav: A behavioral approach to visual navigation with graph localization networks." March 2019, URL: www.youtube.com/watch?v=nN3B1F90CFM, Acessed 17.01.2020. P. Hölzen – GraphNav 18 / 18

GraphNav A Behavioral Approach to Visual Navigation with Graph - PowerPoint PPT Presentation

MIN Faculty Department of Informatics GraphNav A Behavioral Approach to Visual Navigation with Graph Localization Networks Paul Hlzen University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Department of Informatics

Spherical and toroidal Schubert Varieties Reuven Hodges (joint with V. Lakshmibai, M.B. Can)

A Descriptive View of Unitary Group Representations Simon Thomas Rutgers University

Problem: Understand (classify?) the irreducible, complex representations of GL n ( Z / p Z ).

Nested Bethe ansatz for orthogonal and symplectic open spin chains Allan Gerrard in

Geometric actions of classical groups Raffaele Rainone School of Mathematics University of

Clinical trials in T2DM & CVD: Review of key outcomes with GLP-1 RA and SGLT2i Eduard

Imaginaries in valued fields with operators Silvain Rideau UC Berkeley Mai 25 2016 1 / 8

Group cohomology and Levi decompositions of linear groups George McNinch Department of

Ramanujan Graphs, Ramanujan Complexes and Zeta Functions Emerging Applications of Finite Fields

4.6 General Linear Groups 4.7 Steinbergs Unipotent Characters Farid Aliniaeifard York

A comparative study of extrapolation methods, sequence transformations and steepest descent

A theorem in representation theory for DG algebras, with an application to a question of

Catalan Combinatorics of Borel Ideals and Generalizations Eric S. Egge Carleton College

Imaginaries in pseudo- p -adically closed fields Joint with Samaria Montenegro Silvain Rideau UC

A Little Lie (Theory) Never Hurt Anyone Siddharth Taneja in collaboration with Arghya Sadhukhan

Local L -values and geometric harmonic analysis on spherical varieties Jonathan Wang (joint w/

Example Let K = C ( x ) and t C . Consider y ( x ) = t xy ( x ) . The solution looks like x

Sieve Methods in Group Theory Alex Lubotzky Hebrew University Jerusalem, Israel joint with:

Bowling ball representation of virtual string links Zhiyun Cheng Beijing Normal University

Counting versus Integration Anton Gerasimov (ITEP/TCD/HMI) SCGP Workshop November 2012 Examples

The sl 3 web algebra Marco Mackaay CAMGSD and Universidade do Algarve September, 2012 1

Duality for integrable systems associated to quantum toroidal algebras Evgeny Mukhin Indiana

The theory of essential dimension was born in 1997 with the publication of On the essential

The functor category F quad associated to quadratic spaces over F 2 Christine VESPA University

GraphNav A Behavioral Approach to Visual Navigation with Graph - PowerPoint PPT Presentation

MIN Faculty Department of Informatics GraphNav A Behavioral Approach to Visual Navigation with Graph Localization Networks Paul Hlzen University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Department of Informatics

Spherical and toroidal Schubert Varieties Reuven Hodges (joint with V. Lakshmibai, M.B. Can)

A Descriptive View of Unitary Group Representations Simon Thomas Rutgers University

Problem: Understand (classify?) the irreducible, complex representations of GL n ( Z / p Z ).

Nested Bethe ansatz for orthogonal and symplectic open spin chains Allan Gerrard in

Geometric actions of classical groups Raffaele Rainone School of Mathematics University of

Clinical trials in T2DM &amp; CVD: Review of key outcomes with GLP-1 RA and SGLT2i Eduard

Imaginaries in valued fields with operators Silvain Rideau UC Berkeley Mai 25 2016 1 / 8

Group cohomology and Levi decompositions of linear groups George McNinch Department of

Ramanujan Graphs, Ramanujan Complexes and Zeta Functions Emerging Applications of Finite Fields

4.6 General Linear Groups 4.7 Steinbergs Unipotent Characters Farid Aliniaeifard York

A comparative study of extrapolation methods, sequence transformations and steepest descent

A theorem in representation theory for DG algebras, with an application to a question of

Catalan Combinatorics of Borel Ideals and Generalizations Eric S. Egge Carleton College

Imaginaries in pseudo- p -adically closed fields Joint with Samaria Montenegro Silvain Rideau UC

A Little Lie (Theory) Never Hurt Anyone Siddharth Taneja in collaboration with Arghya Sadhukhan

Local L -values and geometric harmonic analysis on spherical varieties Jonathan Wang (joint w/

Example Let K = C ( x ) and t C . Consider y ( x ) = t xy ( x ) . The solution looks like x

Sieve Methods in Group Theory Alex Lubotzky Hebrew University Jerusalem, Israel joint with:

Bowling ball representation of virtual string links Zhiyun Cheng Beijing Normal University

Counting versus Integration Anton Gerasimov (ITEP/TCD/HMI) SCGP Workshop November 2012 Examples

The sl 3 web algebra Marco Mackaay CAMGSD and Universidade do Algarve September, 2012 1

Duality for integrable systems associated to quantum toroidal algebras Evgeny Mukhin Indiana

The theory of essential dimension was born in 1997 with the publication of On the essential

The functor category F quad associated to quadratic spaces over F 2 Christine VESPA University

Clinical trials in T2DM & CVD: Review of key outcomes with GLP-1 RA and SGLT2i Eduard