InfoGAIL: Interpretable Imitation Learning from Visual - PowerPoint PPT Presentation

May 03, 2023 •532 likes •692 views

InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations Chih-Hui Ho, Chun Hu, Po-Jung Lai 1 Outline 1. Introduction 2. Related work Generative adversarial imitation learning (GAIL) 3. Proposed method 4. Experiment

InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations Chih-Hui Ho, Chun Hu, Po-Jung Lai 1
Outline 1. Introduction 2. Related work ○ Generative adversarial imitation learning (GAIL) 3. Proposed method 4. Experiment results 5. Conclusion 2
Introduction ● A reward function is important in RL task ● Hard to design reward function in some scenario (e.g. autonomous driving) ● Imitation learning allows agents to learn how to perform task like an expert ○ Generative Adversarial Imitation Learning (GAIL, [12]) ○ Generative adversarial nets (GANs, [13]) ● Expert demonstrations varies significantly ○ Multiple experts might have multiple policies ○ Need external latent factors to better represent the observed behavior ● Goal: To develop an imitation learning framework that is able to automatically discover and disentangle the latent factors of variation underlying expert demonstrations 3
GAN for imitation learning (GAIL) https://www.youtube.com/watch?v=rOho-2oJFeA 4
GAN for imitation learning (GAIL) 5
Proposed method ● Introduce a latent factor c to represent the variation under expert demonstrations ● In GAIL, action is chosen as ● Proposed method chooses action as ● Maximize the mutual information between latent code c and {state, action}. ● is a function of GAIL InfoGAIL 6
Proposed method ● Discriminator maximizes ● Mutual information minimizes ● Policy updates with TRPO[2] 7
Proposed method ● Reward augmentation ○ Helps when expert perform sub-optimally ○ Hybrid between RL and imitation learning ● Replace vanilla GAN with WGAN[26] ○ More stable and easier to train ○ 8
Experiment Result - Learning to Distinguish Trajectories ● The driving experiment are conducted on Open Source Race Car Simulator ● Each color denotes one specific latent code ○ Different experts have different trajectories 9
Experiment Result - Interpretable Imitation Learning ● Blue and red indicate policies under different latent codes ● They correspond to “turning from inner lane” and “turning from outer lane” respectively 10
Experiment Result - Interpretable Imitation Learning ● Different latent codes correspond to passing from right or left InfoGAIL GAIL 11
Experiment 12
Conclusion ● Automatically distinguish certain driving behaviors by introducing the latent factors ● Discovering the latent factors without direct supervision ● Perform imitation learning by using only visual inputs ● Learning a policy that can imitate and even outperform the human experts 13
Demo Video 14

Recommend

Why do imitation and analogy fail? Why do imitation and analogy fail? Imitation Imitation

Why do imitation and analogy fail? Why do imitation and analogy fail? Imitation Imitation Children do imitate some things Children do imitate some things Children say things they Children say things they ve never

370 views • 33 slides

Imitation Learning Initial Concept and Approaches Nguyen, Thi Linh Chi Outline Motivation

Imitation Learning Initial Concept and Approaches Nguyen, Thi Linh Chi Outline Motivation Basics and Definition Approaches & Examples Conclusion Nguyen, Thi Linh Chi Imitation Learning 2 Motivation Imitation Learning

587 views • 26 slides

Learning to Optimize as Policy Learning Yisong Yue Policy Learning (Reinforcement &

Learning to Optimize as Policy Learning Yisong Yue Policy Learning (Reinforcement & Imitation) Goal: Find Optimal Policy State/Context s t Agent Imitation Learning: Optimize imitation loss Reinforcement Learning: Optimize

547 views • 53 slides

Interpretable sets in o-minimal structures Will Johnson March 27, 2015 Will Johnson

Interpretable sets in o-minimal structures Will Johnson March 27, 2015 Will Johnson Interpretable sets in o-minimal structures March 27, 2015 1 / 13 Interpretable groups in o-minimal theories Theorem (Ramakrishnan, Peterzil, Eleftheriou)

937 views • 44 slides

FAIC Foreign Accent Imitation Corpus Sara Neuhauser University of Jena, Germany IAFPA 2011

FAIC Foreign Accent Imitation Corpus FAIC Foreign Accent Imitation Corpus Sara Neuhauser University of Jena, Germany IAFPA 2011 Vienna, 24.28.07.2011 FAIC Foreign Accent Imitation Corpus Outline 1 Background Preliminary study

595 views • 21 slides

Imitation Theory and Experimental Evidence Joerg Oechssler University of Heidelberg

Imitation Theory and Experimental Evidence Joerg Oechssler University of Heidelberg Imitation is relevant Neoplan (Germany) vs. Zhongwei (P.R. China) Imitation is prevalent Zeiss Ikon Contax II (1936) vs. Nikon I (1948) Imitator

655 views • 46 slides

Imitation as a Stepping Stone to Innovation Amy Jocelyn Glass Texas A&M University Shift

Imitation as a Stepping Stone to Innovation Amy Jocelyn Glass Texas A&M University Shift from Imitation to Innovation Countries such as Korea, China, and Taiwan shifting from imitation to innovation. Product cycle literature

323 views • 18 slides

Kevin Warwick Coventry University T urings Imitation Game T urings Imitation Game Kevin

Kevin Warwick Coventry University T urings Imitation Game T urings Imitation Game Kevin Warwick Kevin Warwick 25th September 2015 25th September 2015 Man is an Unoriginal Animal Man is an Unoriginal Animal

659 views • 38 slides

to No-Regret Online Learning Stephane Ross Joint work with Drew Bagnell & Geoff Gordon

Reduction of Imitation Learning to No-Regret Online Learning Stephane Ross Joint work with Drew Bagnell & Geoff Gordon Imitation Learning Machine Expert Learning Policy Demonstrations Algorithm 2 Imitation Learning Many

811 views • 46 slides

Deep Visual Models with Interpretable Features and Modularized Structures Quanshi Zhang John

Deep Visual Models with Interpretable Features and Modularized Structures Quanshi Zhang John Hopcroft Center Shanghai Jiao Tong University Quanshi Zhang, Ying Nian Wu, and Song-Chun Zhu, "Interpretable Convolutional Neural Networks"

695 views • 42 slides

Not Just a Black Box: Interpretable Deep Learning for Genomics Avan> Shrikumar, Peyton

Not Just a Black Box: Interpretable Deep Learning for Genomics Avan> Shrikumar, Peyton Greenside, Anshul Kundaje Peyton Anshul 1 With great power comes really poor interpretability Deep Interpretable Deep Power Learning Learning

783 views • 41 slides

Random Expert Distillation For Imitation Learning Ruohan Wang, Carlo

Random Expert Distillation For Imitation Learning Ruohan Wang, Carlo Ciliberto, Pierluigi Amadori, Yiannis Demiris ICML 2019 Imitation Learning Teacher Student Policy learning from

275 views • 11 slides

Trajectory Optimization, Imitation Learning Lecture 14 What will you take home today? Recap LQR

Trajectory Optimization, Imitation Learning Lecture 14 What will you take home today? Recap LQR Trajectory Optimization Paper Imitation Learning Supervised Learning Dagger How to solve Optimal Control Problems? Sequential Quadratic

750 views • 31 slides

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3 285 millions visually impaired people Retina Visual cortex 3 285 millions visually impaired people Retina Visual cortex

744 views • 63 slides

IMLI: An Incremental Framework for MaxSAT-Based Learning of Interpretable Classification Rules

IMLI: An Incremental Framework for MaxSAT-Based Learning of Interpretable Classification Rules Bishwamittra Ghosh Joint work with Kuldeep S. Meel 1 Applications of Machine Learning 2 Example Dataset 3 Representation of an interpretable

1.24k views • 55 slides

One-Shot Imitation Learning Yan Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas

One-Shot Imitation Learning Yan Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba Motivation & Problem - Imitation Learning commonly applied to isolated tasks - Desire:

555 views • 21 slides

STA 4273 / CSC 2547 Spring 2018 Learning Discrete Latent Structure What recently became easy in

STA 4273 / CSC 2547 Spring 2018 Learning Discrete Latent Structure What recently became easy in machine learning? Training continuous latent- variable models (VAEs, GANs) to produce large images Training large supervised models with

1.26k views • 61 slides

Common Language 2012-06-11 State Resources Excellent

Common Language 2012-06-11 State Resources Excellent Educators for New Jersey h@p://www.state.nj.us/educaFon/EE4NJ/ EE4NJ Goals Universal Vision Common

453 views • 26 slides

LANGUAGES OTHER THAN LANGUAGES OTHER THAN ENGLISH (LOTE) PRESENTATION ENGLISH (LOTE)

LANGUAGES OTHER THAN LANGUAGES OTHER THAN ENGLISH (LOTE) PRESENTATION ENGLISH (LOTE) PRESENTATION PART II: GRADES 6- -12 12 PART II: GRADES 6 Board of Education Report Board of Education Report June 2008 June 2008 Building Upon K- -5

381 views • 17 slides

Language Learning Expectations Initial Language Skills Student Perceptions of Exchange District

Language Learning Expectations Initial Language Skills Student Perceptions of Exchange District 5440 Outbound Student Orientation April 2019 Survey results from a presentation by Norm Samuelson How many outbound students have begun working

682 views • 20 slides

Semi-supervised Learning with Deep Generative Models Diedrik P. Kingma, Danilo J. Rezende, Shakir

Semi-supervised Learning with Deep Generative Models Diedrik P. Kingma, Danilo J. Rezende, Shakir Mohamed, Max Welling What is Deep Learning very good at? Classifying highly structured data -ImageNet -Part of Speech Tagging -MNIST

195 views • 18 slides

Learning Sciences: Impact on Learning Technologies & Learning Activities Phillip D. Long,

Learning Sciences: Impact on Learning Technologies & Learning Activities Phillip D. Long, Ph.D. 12-May, 2015 Narrative Arc ( 5 2) Conclusion: What do we want? Learning learning |lrniNG | noun the acquisition of knowledge or

653 views • 36 slides

Learning Semantic Relationships of Geographical Areas Based on Trajectories Presenter: Saim

Learning Semantic Relationships of Geographical Areas Based on Trajectories Presenter: Saim Mehmood trajectories (, , ) (spatiotemporal information of moving objects) 2 Trajectory Data Mining discovering patterns in trajectories to

673 views • 55 slides

GEO 101 Earth and Space Science - online Upul Senaratne, PhD Wor-Wic Community College

8/15/2019 Empowering online students as self-learners Teaching online labs GEO 101 Earth and Space Science - online Upul Senaratne, PhD Wor-Wic Community College Salisbury, Maryland, USA ABSRACT Empowering online students as self-learners

616 views • 29 slides