Teaching a robot to interpret natural language navigation - PowerPoint PPT Presentation

Teaching a robot to interpret natural language navigation instructions Ryan Eloff Supervisor: Dr. H. Kamper Department of Electrical & Electronic Engineering Maties Machine Learning Stellenbosch University 23 February 2018 UNIVERSITEIT • STELLENBOSCH • UNIVERSITY • your knowledge partner jou kennisvennoot

Natural language for human-robot interaction The Gambit platform which learns to identify and pick up objects from a users speech and gestures. 1 1 C. Matuszek, L. Bo, L. Zettlemoyer, et al. , “Learning from unscripted deictic gesture and language for human-robot interactions,” in Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence , 2014. 2 / 20

Natural language for human-robot interaction A robotic wheelchair that learns about the environment from a narrated tour. 2 2 M. R. Walter, S. Hemachandra, B. Homberg, et al. , “A framework for learning semantic maps from grounded natural language descriptions,” The International Journal of Robotics Research , 2014. 3 / 20

Natural language for human-robot interaction ◮ Still face challenges in building robots that can understand our written natural language instructions. 4 / 20

Natural language for human-robot interaction ◮ Still face challenges in building robots that can understand our written natural language instructions. ◮ Most existing systems are only available in most commonly used languages: English, German, Spanish ... 4 / 20

Natural language for human-robot interaction ◮ Still face challenges in building robots that can understand our written natural language instructions. ◮ Most existing systems are only available in most commonly used languages: English, German, Spanish ... ◮ These technologies are not available for low-resource languages. ◮ This is a problem in South Africa! 4 / 20

Natural language instructions for route direction Sofa Hatrack 5 / 20

Natural language instructions for route direction ◮ “Walk forward once” ◮ “Move to the sofa” ◮ “Go down till the chair” Sofa Hatrack ◮ “Go forward one segment to the intersection with a bare concrete floor” 5 / 20

Natural language instructions for route direction ◮ “Walk forward once” ◮ “Move to the sofa” ◮ “Go down till the chair” Sofa Hatrack ◮ “Go forward one segment to the intersection with a bare concrete floor” Can we build a language independent system for interpreting written natural language navigation instructions that is also less dependent on data? 5 / 20

Evaluation, virtual environments, & web application 6 / 20

Example instruction data ◮ Example 1: “Walk two steps forward” 7 / 20

Example instruction data ◮ Example 2: “Move forward and turn right” 8 / 20

Example instruction data ◮ Example 3: “Move to the easel” 9 / 20

Unseen instruction problem ◮ Unseen: “Walk three steps to the easel” 10 / 20

General model 11 / 20

General model Testing Training Observations Test Input x ( j ) Sentence x ( i ) Sentence Instruction x ( i ) embedded Instruction x ( j ) embedded Embeddings Embeddings World State w ( i ) World State w ( j ) Model (“Black Box” p ( j ) Learning Action Navigation Output Action Navigation p ( i ) Sequence a ( i ) Algorithm) Sequence a ( j ) Plan Plan Constructor Executor 11 / 20

Proposed models k-Nearest neighbours (kNN) models: f 2 Class labels: Turn ( LEFT ) Turn ( RIGHT ) Distance metric q 3 q 2 q 1 f 1 Proposed distance metrics: ◮ Levenshtein distance ◮ Euclidean distance 12 / 20

Proposed models k-Nearest neighbours (kNN) models: Basic one-vs-rest (OVR) logistic f 2 Class labels: regression model: Turn ( LEFT ) Turn ( RIGHT ) Distance metric Output plan label p ( i ) k max index k q 3 Output class y 0 y 1 . . . y k . . . y m probabilities q 2 OVR . . . q 1 Parameter θ (0) θ (1) θ ( m ) Vectors f 1 Input Encoded World Proposed distance metrics: State w ( i ) features Instruction n − 1 ◮ Levenshtein distance x ( i ) ◮ Euclidean distance 12 / 20

Results: Overall model performance ◮ Overall performance of the proposed models and that of previous work: Model Accuracy F 1 kNN with Levenshtein distance 77.10 55.25 kNN with Euclidean distance 75.44 51.65 Basic logistic regression 78.18 57.32 Stepwise logistic regression 72.58 44.36 Chen and Mooney (2011) 68.37 54.40 Chen (2012) 69.43 57.28 Mei et al. (2016) - 69.98 13 / 20

Results: Performance on small datasets ◮ Average strict accuracy of the proposed models when trained on different sample sizes n : 70 60 50 Accuracy [%] Legend: 40 kNN with Levenshtein distance 30 kNN with Euclidean distance 20 Basic logistic regression 10 Stepwise logistic regression 0 64 128 192 256 320 384 448 512 576 640 Number of samples [ n ] 14 / 20

Results: Performance on an Afrikaans corpus ◮ Proposed models evaluated on an Afrikaans translation of the benchmark English corpus*: 90 70 78.18 80 77.1 76 75.05 73.52 73.5 60 57.32 72.58 71.29 55.25 54.9 52.39 70 51.39 51.52 50 44.36 60 42.75 F1-score [%] Accuracy [%] 40 50 40 30 30 20 20 10 10 0 0 English corpus Afrikaans corpus English corpus Afrikaans corpus kNN with Levenshtein distance kNN with Euclidean distance kNN with Levenshtein distance kNN with Euclidean distance Basic logistic regression Stepwise logistic regression Basic logistic regression Stepwise logistic regression * English corpus translated to Afrikaans using Google’s state-of-the-art Neural Machine Translation system. 15 / 20

Contributions ◮ Simple machine learning models used for the novel purpose of learning from limited datasets, which is competitive with complex systems of previous work. 16 / 20

Contributions ◮ Simple machine learning models used for the novel purpose of learning from limited datasets, which is competitive with complex systems of previous work. ◮ Web application has data collection capabilities which is ideal for future research on human-robot interaction. 16 / 20

Contributions ◮ Simple machine learning models used for the novel purpose of learning from limited datasets, which is competitive with complex systems of previous work. ◮ Web application has data collection capabilities which is ideal for future research on human-robot interaction. ◮ These models are the first steps and foundation for future research on more complex models for limited data. (and speech!!) 16 / 20

Contributions ◮ Simple machine learning models used for the novel purpose of learning from limited datasets, which is competitive with complex systems of previous work. ◮ Web application has data collection capabilities which is ideal for future research on human-robot interaction. ◮ These models are the first steps and foundation for future research on more complex models for limited data. (and speech!!) ◮ First Afrikaans system for Robotic instruction following. The system may also be trained in any language, so it is ideal for low-resource languages. 16 / 20

Teaching a robot to interpret natural language navigation - PowerPoint PPT Presentation

Teaching a robot to interpret natural language navigation instructions Ryan Eloff Supervisor: Dr. H. Kamper Department of Electrical & Electronic Engineering Maties Machine Learning Stellenbosch University 23 February 2018 UNIVERSITEIT

Robothlon Team competition, each team programs a robot for each event Events Robot

Learning to Interpret Natural Language Navigation Instructions from Observations Chen, D., &

Navigation, Gravitation and Navigation, Gravitation and Navigation, Gravitation and Navigation,

Haptic Navigation in Mobile Contexts Agenda What is Haptic Navigation? Advantages of

React Native Navigation Screens, moving, parameters React Navigation React Navigation is not

React Native Navigation: Tabs 1 Tab Navigation the most common style of navigation in

Building New Robots 1 Extending Robot Language Suppose we needed a Robot to patrol the walls

(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

Parsing Natural Language commands to Robot control System By Praveen Dhinwa dpraveen@iitk.ac.in

OFDM Signal Navigation NAV 2008 2 OFDM Signal Navigation NAV 2008 3 OFDM Signal Navigation

HOUSING NAVIGATION CENTER https://www.hayward-ca.gov/content/hayward- housing-navigation-center

Spatial navigation in humans Recap: navigation strategies and spatial representations Spatial

MULTI-AGENT NAVIGATION MULTI-AGENT NAVIGATION Why do it? Autonomous cars Robot

Simulations Sam Reid PhET Interactive Simulation University of Colorado Boulder Denver HTML5

SAU Budget 01/23/2018 FY16 FY17 FY18 Adopted Dollar Prop. Cost TECHNOLOGY FY19 Proposed % change

Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences

Ministering to Members with Long-Term Needs Diaconal Summit II Process Notes Content Notes

CSE217 INTRODUCTION TO DATA SCIENCE LECTURE 3: SENTIMENT ANALYSIS Spring 2019 Marion Neumann

Community Prevention and Wellness Initiative Needs Assessment Webinar November 6, 2013 Julia

Escape from the Textbook! Carlos Cabana ccabana@pacbell.net Mission High School SFUSD Henri

Akamatsus third flying geese paradigm Model for internal division of labor in East Asia based

Teaching a robot to interpret natural language navigation - PowerPoint PPT Presentation

Teaching a robot to interpret natural language navigation instructions Ryan Eloff Supervisor: Dr. H. Kamper Department of Electrical & Electronic Engineering Maties Machine Learning Stellenbosch University 23 February 2018 UNIVERSITEIT

Robothlon Team competition, each team programs a robot for each event Events Robot

Learning to Interpret Natural Language Navigation Instructions from Observations Chen, D., &amp;

Navigation, Gravitation and Navigation, Gravitation and Navigation, Gravitation and Navigation,

Haptic Navigation in Mobile Contexts Agenda What is Haptic Navigation? Advantages of

React Native Navigation Screens, moving, parameters React Navigation React Navigation is not

React Native Navigation: Tabs 1 Tab Navigation the most common style of navigation in

Building New Robots 1 Extending Robot Language Suppose we needed a Robot to patrol the walls

(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

Parsing Natural Language commands to Robot control System By Praveen Dhinwa dpraveen@iitk.ac.in

OFDM Signal Navigation NAV 2008 2 OFDM Signal Navigation NAV 2008 3 OFDM Signal Navigation

HOUSING NAVIGATION CENTER https://www.hayward-ca.gov/content/hayward- housing-navigation-center

Spatial navigation in humans Recap: navigation strategies and spatial representations Spatial

MULTI-AGENT NAVIGATION MULTI-AGENT NAVIGATION Why do it? Autonomous cars Robot

Simulations Sam Reid PhET Interactive Simulation University of Colorado Boulder Denver HTML5

SAU Budget 01/23/2018 FY16 FY17 FY18 Adopted Dollar Prop. Cost TECHNOLOGY FY19 Proposed % change

Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences

Ministering to Members with Long-Term Needs Diaconal Summit II Process Notes Content Notes

CSE217 INTRODUCTION TO DATA SCIENCE LECTURE 3: SENTIMENT ANALYSIS Spring 2019 Marion Neumann

Community Prevention and Wellness Initiative Needs Assessment Webinar November 6, 2013 Julia

Escape from the Textbook! Carlos Cabana ccabana@pacbell.net Mission High School SFUSD Henri

Akamatsus third flying geese paradigm Model for internal division of labor in East Asia based

Learning to Interpret Natural Language Navigation Instructions from Observations Chen, D., &