Autonomous Navigation CSE 571 Inverse Optimal Control (Inverse - PowerPoint PPT Presentation

5/25/20 Autonomous Navigation CSE 571 Inverse Optimal Control (Inverse Reinforcement Learning) Many slides by Drew Bagnell Carnegie Mellon University 1 2 Optimal Control Solution X X Learning Y Y Cost Map (Sensor Data) (Input) (Output) (Path to goal) 2-D Learning Planner Y (Path to goal) 3 4 1

5/25/20 Mode 1: Training example Mode 1: Training example 5 6 Mode 1: Learned behavior Mode 1: Learned behavior 7 8 2

5/25/20 Mode 1: Learned cost map Mode 2: Training example 9 10 Mode 2: Training example Mode 2: Learned behavior 11 12 3

5/25/20 Mode 2: Learned behavior Mode 2: Learned cost map 13 14 w=[], F=[] ( , High Cost) Learn F 1 ( , Low Cost) Feature vector Cost = w' F Weighting vector Ratliff, Bagnell, Zinkevich 2005 Ratliff, Bagnell, Zinkevich, ICML 2006 Ratliff, Bradley, Bagnell, Chestnutt, NIPS 2006 Ratliff, Bradley, Bagnell, Chestnutt, NIPS 2006 Silver, Bagnell, Stentz, RSS 2008 Silver, Bagnell, Stentz, RSS 2008 15 16 4

5/25/20 w=[w 1 ], F=[F 1 ] ( , High Cost) Learn F 2 ( , Low Cost) Ratliff, Bagnell, Zinkevich, ICML 2006 Ratliff, Bradley, Bagnell, Chestnutt, NIPS 2006 Silver, Bagnell, Stentz, RSS 2008 17 18 Learned Cost Function Examples Ratliff, Bradley, Chesnutt, Bagnell 06 Zucker, Ratliff, Stolle, Chesnutt, Bagnell, Atkeson, Kuffner 09 19 20 5

5/25/20 Learned Cost Function Examples Pedestrian Trajectory Prediction 21 22 Learning Manipulation Preferences Staying out of People’s Path • Input: Human demonstrations of preferred behavior (e.g., moving a cup of water upright without spilling) • Output: Learned cost function that results in trajectories satisfying user preferences 24 23 24 6

5/25/20 Demonstration(s) Demonstration(s) Graph 25 26 25 26 Demonstration(s) Graph Demonstration(s) Graph Projection 27 28 27 28 7

5/25/20 Demonstration(s) Graph Projection Demonstration(s) Graph Projection Learned cost 29 30 29 30 Demonstration(s) Graph Projection Demonstration(s) Graph Projection Discrete sampled Output Discrete sampled Learned cost Learned cost paths trajectories paths 31 32 31 32 8

5/25/20 Demonstration(s) Graph Projection Demonstration(s) Graph Projection Discrete Local Trajectory MaxEnt IOC Optimization Output Discrete sampled Output Discrete sampled Learned cost Learned cost trajectories paths trajectories paths 33 34 33 34 Laptop task: Demonstration Setup ( Not part of training set) • Binary state-dependent features (~95) • Histograms of distances to objects • Histograms of end-effector orientation • Object specific features (electronic vs non-electronic) • Approach direction w.r.t goal • Task • Hold cup upright while not moving above electronics 35 36 35 36 9

5/25/20 Laptop task: LTO + Discrete graph path Laptop task: LTO + Smooth random path 37 38 37 38 Readings Max-Ent IRL (Ziebart, Bagnell): • http://www.cs.cmu.edu/~bziebart/ CIOC (Levine) • http://graphics.stanford.edu/projects/cioc/cioc.pdf Manipulation (Byravan/Fox): https://rse- • lab.cs.washington.edu/papers/graph-based-IOC-ijcai-2015.pdf Imitation learning (Ermon): https://cs.stanford.edu/~ermon/ • Human/manipulation (Dragan): • https://people.eecs.berkeley.edu/~anca/research.html 39 39 10

Autonomous Navigation CSE 571 Inverse Optimal Control (Inverse - PowerPoint PPT Presentation

5/25/20 Autonomous Navigation CSE 571 Inverse Optimal Control (Inverse Reinforcement Learning) Many slides by Drew Bagnell Carnegie Mellon University 1 2 Optimal Control Solution X X Learning Y Y Cost Map (Sensor Data) (Input)

Navigation, Gravitation and Navigation, Gravitation and Navigation, Gravitation and Navigation,

Haptic Navigation in Mobile Contexts Agenda What is Haptic Navigation? Advantages of

React Native Navigation Screens, moving, parameters React Navigation React Navigation is not

React Native Navigation: Tabs 1 Tab Navigation the most common style of navigation in

AUTONOMOUS DRONE NAVIGATION WITH DEEP LEARNING Nikolai Smolyanskiy, Alexey Kamenev, Jeffrey Smith

OFDM Signal Navigation NAV 2008 2 OFDM Signal Navigation NAV 2008 3 OFDM Signal Navigation

HOUSING NAVIGATION CENTER https://www.hayward-ca.gov/content/hayward- housing-navigation-center

Spatial navigation in humans Recap: navigation strategies and spatial representations Spatial

MULTI-AGENT NAVIGATION MULTI-AGENT NAVIGATION Why do it? Autonomous cars Robot

LMIs and autonomous work 1 From autonomous work to discontinuous career paths Autonomous

Math 211 Math 211 Lecture #2 2 Autonomous Equations Autonomous Equations General equation:

Advanced Land Navigation LEADERS KNOW THE WAY 5/22/2017 Advanced Land Navigation Agenda

AquaWear WLN20 Wireless Navigation Data Gateway Enabling Next Generation Wearable Navigation The

Community Navigation in Brighton & Hove 16 th November 2017 Mel Pickett (Navigation

Navigation Reporting Pane Management Activity v4.1 MA Navigation Reporting Pane What is the

E-Navigation and PPUs Photo courtesy of Flinders Ports What is e-Navigation? People Systems

VAB: Visual Audit Browsing James A. Hoagland, Christopher Wee, Karl Levitt Computer Security

Slide 1 _ _

Labor Standards Enforcement Webinar Investigations June 6, 2017 Lindsay Moore, Dept. of

Law Chapter 12 Criminal Justice Process: The Investigation Typical Timeline: Crime is committed

Optimal Control Solution Mode 1: Training example Cost Map 2-D Learning Planner Y (Path to

Introduction to Symbolic Dynamics Part 3: Sofic shifts Silvio Capobianco Institute of

EPICS Base R3.14.11, Whats Next? Andrew Johnson Argonne National Laboratory Base R3.14.11

Jasper Bongertz, Airbus CyberSecurity @packetjay Scanning for network IoCs is relatively easy:

Autonomous Navigation CSE 571 Inverse Optimal Control (Inverse - PowerPoint PPT Presentation

5/25/20 Autonomous Navigation CSE 571 Inverse Optimal Control (Inverse Reinforcement Learning) Many slides by Drew Bagnell Carnegie Mellon University 1 2 Optimal Control Solution X X Learning Y Y Cost Map (Sensor Data) (Input)

Navigation, Gravitation and Navigation, Gravitation and Navigation, Gravitation and Navigation,

Haptic Navigation in Mobile Contexts Agenda What is Haptic Navigation? Advantages of

React Native Navigation Screens, moving, parameters React Navigation React Navigation is not

React Native Navigation: Tabs 1 Tab Navigation the most common style of navigation in

AUTONOMOUS DRONE NAVIGATION WITH DEEP LEARNING Nikolai Smolyanskiy, Alexey Kamenev, Jeffrey Smith

OFDM Signal Navigation NAV 2008 2 OFDM Signal Navigation NAV 2008 3 OFDM Signal Navigation

HOUSING NAVIGATION CENTER https://www.hayward-ca.gov/content/hayward- housing-navigation-center

Spatial navigation in humans Recap: navigation strategies and spatial representations Spatial

MULTI-AGENT NAVIGATION MULTI-AGENT NAVIGATION Why do it? Autonomous cars Robot

LMIs and autonomous work 1 From autonomous work to discontinuous career paths Autonomous

Math 211 Math 211 Lecture #2 2 Autonomous Equations Autonomous Equations General equation:

Advanced Land Navigation LEADERS KNOW THE WAY 5/22/2017 Advanced Land Navigation Agenda

AquaWear WLN20 Wireless Navigation Data Gateway Enabling Next Generation Wearable Navigation The

Community Navigation in Brighton &amp; Hove 16 th November 2017 Mel Pickett (Navigation

Navigation Reporting Pane Management Activity v4.1 MA Navigation Reporting Pane What is the

E-Navigation and PPUs Photo courtesy of Flinders Ports What is e-Navigation? People Systems

VAB: Visual Audit Browsing James A. Hoagland, Christopher Wee, Karl Levitt Computer Security

Slide 1 ___________________________________ ___________________________________

Labor Standards Enforcement Webinar Investigations June 6, 2017 Lindsay Moore, Dept. of

Law Chapter 12 Criminal Justice Process: The Investigation Typical Timeline: Crime is committed

Optimal Control Solution Mode 1: Training example Cost Map 2-D Learning Planner Y (Path to

Introduction to Symbolic Dynamics Part 3: Sofic shifts Silvio Capobianco Institute of

EPICS Base R3.14.11, Whats Next? Andrew Johnson Argonne National Laboratory Base R3.14.11

Jasper Bongertz, Airbus CyberSecurity @packetjay Scanning for network IoCs is relatively easy:

Community Navigation in Brighton & Hove 16 th November 2017 Mel Pickett (Navigation

Slide 1 _ _