Environment-agnostic Multitask Learning for Natural Language - PowerPoint PPT Presentation

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation Xin (Eric) Wang *, Vihan Jain*, Engene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi Research Conference 2018

Natural Language Grounded Navigation Command embodied agents to navigate in the 3D world with natural language, such as coarse-/fine-grained instructions, questions, dialog Person: Can you grab the plant for me? Sure. Where is it? Get out of the room and go towards the kitchen. The plant is on the window near the kitchen. Gotcha. 2

Vision-and-Language Navigation (VLN) ● Given fine-grained instruction and a starting location ● Agent must reach the target location by following the natural language instruction ● Room-to-Room (R2R) Dataset Anderson et al., CVPR 2018 3

Cooperative Vision-and-Dialog Navigation (CVDN) ● Both Navigator and Oracle are given a hint (e.g., the goal room contains a mat) ● Navigator : go towards the goal room and can stop anytime to ask a question ● Oracle : foresee the next best steps and answer the questions Thomason et al., CoRL 2019 4

Sub-task: Navigation from Dialog History (NDH) Given the dialogue history , predict ● the navigation actions that bring the agent closer to the goal room Thomason et al., CoRL 2019 5

Challenge 6

Poor Generalization Issue ● Navigation models tend to overfit seen environments and perform poorly on unseen environments Training Evaluation Seen Unseen = ! = 7

Data Scarcity is A Big Problem ● Real-world experiments are NOT scalable ● Data collection is prohibitively expensive and time-consuming ● Models break under distribution shift 8

Environment-agnostic Multitask Navigation 9

Towards Generalizable Navigation ● Multitask learning : transfer knowledge across tasks �� ● Environment-agnostic learning : invariant representations that can be better generalized on unseen environments 10

A Strong Baseline for VLN: RCM Leave the living room. Go through the hallway with paintings on the wall and head to the kitchen. Paired Demo Path Stop next to the wooden dining table. VLN Instruction Word Embedding Panoramic Features Language Encoder CM-ATT Trajectory Encoder Wang et al., CVPR 2019 Action Predictor 11

Multitask RCM VLN Instruction Interleaved or Paired Demo Path Multitask NDH Dialog Data Sampling Joint Word Embedding Panoramic Features Language Encoder CM-ATT Trajectory Encoder Action Predictor 12

Multitask Reinforcement Learning ● Navigation Loss: Reinforcement Learning + Supervised Learning ● Reward shaping: ○ VLN: Distance to Goal ○ NDH: Distance to Room 13

Effect of Multitask RL ● NDH benefits from VLN ● VLN benefits from NDH with more fine-grained information about paths Extending visual paths alone is NOT ○ helpful ● Multitask RL improves generalization Seen-unseen gap is narrowed ○ 14

Effect of Multitask RL Multitask learning benefits from • More appearances of unrepresented �� words • Shared semantic encoding of the whole sentences �� 15

Environment-agnostic Representation Learning NDH Dialog or VLN Instruction Word Embedding Language Encoder CM-ATT Trajectory Encoder Action Predictor Gradient Reversal Layer Environment ● A classifier to predict the environment identity Classifier House label y 16

Environment-Aware versus Environment-Agnostic NDH (Progress) VLN (Success Rate) Seen Unseen Seen Unseen 57.59 10 60 8.38 52.79 55 52.39 8 6.49 6.07 50 6 44.4 42.93 45 4 3.15 38.83 2.64 40 1.81 2 35 0 30 RCM EnvAware EnvAgnostic RCM EnvAware EnvAgnostic ● Env-aware learning tends to overfit seen environments ● Env-agnostic learning generalizes better on unseen environments ● (Potential) Meta-learning with env-aware & env-agnostic may benefit from both worlds 17

Environment-Aware versus Environment-Agnostic Seen Unseen �� EnvAware EnvAgnostic EnvAware EnvAgnostic 18

Environment-agnostic Multitask Learning Framework �� 19

Effect of Environment-agnostic Multitask Learning 20

Ranking 1st on CVDN Leaderboard https://evalai.cloudcv.org/web/challenges/challenge-page/463/leaderboard/1292 21

Future Work 22

Generalized Navigation on Street View StreetLearn ( Mirowski, et al. 2018) TouchDown (Chen, et al. 2019) TalkTheWalk (de Vries, et al. 2018) 23

Thanks! Paper: https://arxiv.org/abs/2003.00443 Code: https://github.com/google-research/valan 24

Environment-agnostic Multitask Learning for Natural Language - PowerPoint PPT Presentation

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation Xin (Eric) Wang , Vihan Jain, Engene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi Research Conference 2018 Natural Language Grounded Navigation Command

Multitask Learning Lei Tang Arizona State University Nov. 6th, 2006 Lei Tang Multitask

Consistent Multitask Learning with Nonlinear Output Constraints Carlo Ciliberto Department of

LANGUAGE-AGNOSTIC INJECTION LANGUAGE-AGNOSTIC INJECTION DETECTION DETECTION Lars Hermerschmidt,

MANA for MPI MPI-Agnostic Network-Agnostic Transparent Checkpointing Rohan Garg, *Gregory Price,

Pool-based Agnostic Pool-based Agnostic Experiment Design Experiment Design in Linear

COMPETITIVE MULTITASK MARINE TECHNOLOGY Ocean Cleaner Technology S.L. is a competitive marine

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters 2, 3 CS485/685 (c) 2012 P.

Illustrating Agnostic Learning We want a classifier to distinguish between cats and dogs Image 1

Task-Agnostic Dynamics Priors for Deep Reinforcement Learning Yilun Du 1 , Karthik Narasimhan 2

SPEEDING UP DEEP REINFORCEMENT LEARNING VIA TRANSFER AND MULTITASK LEARNING Speaker: Yunshu Du

Computational Learning Theory: Agnostic Learning Machine Learning 1 Slides based on material

Category & Progression Specific Programming Model for Industry Agnostic Incubators Sean

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Monitoring and Recordkeeping 2018 CDBG-DR Problem Solving Clinic Atlanta, GA | D e c e m b e r

Image Recognition has a wide variety of cool apps Robotic control Quality control of

Joint Action and the Emergence of Mindreading s.butterfill@warwick.ac.uk challenge Explain

SwingStates programming graphical interactions in Java Michel Beaudouin-Lafon - mbl@lri.fr

Unit 9: Basic Contract Law 1 Contracts: Expressed vs Implied, Unilateral vs Bilateral Validity

Machine Learning for Signal Processing Lecture 1: Signal Representations Class 1. 27 August

Contract Drafting 2010-01-27 University of Houston Law Center Spring 2010 D. C. Toedt III In

SP PHOTON DETECTION CONSORTIUM ETTORE SEGRETO 30% READINESS REVIEW NOVEMBER 11, 2018 Consortium

Environment-agnostic Multitask Learning for Natural Language - PowerPoint PPT Presentation

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation Xin (Eric) Wang *, Vihan Jain*, Engene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi Research Conference 2018 Natural Language Grounded Navigation Command

Multitask Learning Lei Tang Arizona State University Nov. 6th, 2006 Lei Tang Multitask

Consistent Multitask Learning with Nonlinear Output Constraints Carlo Ciliberto Department of

LANGUAGE-AGNOSTIC INJECTION LANGUAGE-AGNOSTIC INJECTION DETECTION DETECTION Lars Hermerschmidt,

MANA for MPI MPI-Agnostic Network-Agnostic Transparent Checkpointing Rohan Garg, *Gregory Price,

Pool-based Agnostic Pool-based Agnostic Experiment Design Experiment Design in Linear

COMPETITIVE MULTITASK MARINE TECHNOLOGY Ocean Cleaner Technology S.L. is a competitive marine

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters 2, 3 CS485/685 (c) 2012 P.

Illustrating Agnostic Learning We want a classifier to distinguish between cats and dogs Image 1

Task-Agnostic Dynamics Priors for Deep Reinforcement Learning Yilun Du 1 , Karthik Narasimhan 2

SPEEDING UP DEEP REINFORCEMENT LEARNING VIA TRANSFER AND MULTITASK LEARNING Speaker: Yunshu Du

Computational Learning Theory: Agnostic Learning Machine Learning 1 Slides based on material

Category &amp; Progression Specific Programming Model for Industry Agnostic Incubators Sean

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Monitoring and Recordkeeping 2018 CDBG-DR Problem Solving Clinic Atlanta, GA | D e c e m b e r

Image Recognition has a wide variety of cool apps Robotic control Quality control of

Joint Action and the Emergence of Mindreading s.butterfill@warwick.ac.uk challenge Explain

SwingStates programming graphical interactions in Java Michel Beaudouin-Lafon - mbl@lri.fr

Unit 9: Basic Contract Law 1 Contracts: Expressed vs Implied, Unilateral vs Bilateral Validity

Machine Learning for Signal Processing Lecture 1: Signal Representations Class 1. 27 August

Contract Drafting 2010-01-27 University of Houston Law Center Spring 2010 D. C. Toedt III In

SP PHOTON DETECTION CONSORTIUM ETTORE SEGRETO 30% READINESS REVIEW NOVEMBER 11, 2018 Consortium

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation Xin (Eric) Wang , Vihan Jain, Engene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi Research Conference 2018 Natural Language Grounded Navigation Command

Category & Progression Specific Programming Model for Industry Agnostic Incubators Sean