Path following with reinforcement learning for autonomous cars - - PowerPoint PPT Presentation

Sep 11, 2023 •463 likes •714 views

Path following with reinforcement learning for autonomous cars - Mozzam Motiwala (IAS) Index Basics of Reinforcement Learning Model Based vs Model Free Reinforcement Learning Autonomous Car collision avoidance What is Reinforcement

Path following with reinforcement learning for autonomous cars - Mozzam Motiwala (IAS)
Index ● Basics of Reinforcement Learning ● Model Based vs Model Free Reinforcement Learning ● Autonomous Car collision avoidance
What is Reinforcement Learning? ● Learning by trial and error only based on a reward signal[1] https://towardsdatascience.com/solving-the-multi-armed- bandit-problem-b72de40db97c Exploration vs Exploitation?
Markov-Desicion Process [1] Reward Function? Policy? Optimal Policy? Transition Function?
Some terminalogy ● Value Function: ● Action Value Function: Why Discounting Factor?
Gridworld [1]
Finding Optimal Policy [1]
Cart Pole Balancing Problem https://towardsdatascience.com/cartpole-introduction-to- reinforcement-learning-ed0eb5b58288 https://www.youtube.com/watch?v=Lt-KLtkDlh8
Index ● Basics of Reinforcement learning ● Model Based vs Model Free Reinforcement Learning ● Autonomous Car collision avoidance
Model-based By a model of the environment we mean anything that an agent can use to predict how the environment will respond to its actions[2]. https://towardsdatascience.com/model-based-reinforcement-learning-cb9e41ff1f0d
Example https://towardsdatascience.com/model-based-reinforcement-learning-cb9e41ff1f0d Whats Next? :: Now lets sample from it to adjust the policy..
Why model-based RL? Reduced number of Advantages? interaction with the real ● Fast environment while ● Need less data learning. Problems? Types: Neural Network Model, ● What if the model is wrong? Guassian Process Model.. etc
Model Based+ Model Free [2]
Results [1]
Why better result? [1]
Index ● Basics of Reinforcement learning ● Model Based vs Model Free Reinforcement Learning ● Autonomous Car Collision Avoidance
Application: Autonomous Car Why Reinforcement Learning? Problem with traditional methods ● Slow ● Assumptions Learning in RL ● Adapting to environment ● Learning from mistakes
Generalized Computation Graph Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs(GCG) for Robot Navigation[3] [3] ● H=1 : Model-Free ● H= N (Length of Episode): Model-Based
Model Details ● Deep RNN as Model ● Model output 1= Current Reward ŷ: Robots speed ● Model output 2= Future Value to go(value of the state) ^b: Distance travelled before collision ● Policy Evaluation Function : ● Policy Evaluation by sampling k random action sequence and selecting the one with max reward.
GCG : Algorithm [3]
Evaluation and Results https://www.youtube.com/watch?v=NlFbLVG6LpA [3]
Summary ● Benefits of Reinforcement Learning ● Model-Free vs Model-Based ● Combined approach that subsumes Model-free and Model-based
References 1. R. Sutton and A. Barto, Reinforcement Learning: An Introduction 2. R. Sutton, “Dyna, an Integrated Architecture for Learning, Planning,and Reacting,” in AAAI, 1991. 3. G. Kahn, A. Villaflor, B. Ding, P. Abbeel, and S. Levine. Self- Supervised Deep ReinforcementLearning with Generalized Computation Graphs for Robot Navigation. InIEEE InternationalConference on Robotics and Automation, 2018.
Question?

Recommend

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

Visual Reinforcement Learning with Imagined Goals Ashvin Nair, Vitchyr Pong , Murtaza Dalal,

Visual Reinforcement Learning with Imagined Goals Ashvin Nair*, Vitchyr Pong* , Murtaza Dalal, Shikhar Bahl, Steven Lin, Sergey Levine Autonomous Learning Autonomous Learning Autonomous Learning Reinforcement Learning? Autonomous Learning:

594 views • 21 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

On Path Generation, Path Following On Path Generation, Path Following and Time Coordination for

On Path Generation, Path Following On Path Generation, Path Following and Time Coordination for and Time Coordination for Small UAVs UAVs Small I. Kaminer I. Kaminer Department of Mechanical and Astronautical Astronautical Engineering,

924 views • 44 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.42k views • 88 slides

Advanced planning for autonomous vehicles using reinforcement learning and deep inverse

Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning You, Lu, Filev, Tsiotras Laura Graves Overview Implements planning for autonomous vehicles using RL and inverse RL systems. This is

347 views • 14 slides

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning Haarnoja, Tang et al. (2017) Reinforcement Learning with Deep Energy Based Policies, ICML . Haarnoja, Zhou et al. (2018) Soft Actor-Critic: Off-Policy

684 views • 24 slides

Introduction to Reinforcement Learning Kevin Chen and Zack Khan Lecture 1: Introduction to

Lecture 1: Introduction to Reinforcement Learning Introduction to Reinforcement Learning Kevin Chen and Zack Khan Lecture 1: Introduction to Reinforcement Learning Outline 1. Course Logistics 2. What is Reinforcement Learning? 3.

930 views • 67 slides

Autonomous Ground Systems Improved Relative Positioning for Path Following in Autonomous Convoys

Autonomous Ground Systems Improved Relative Positioning for Path Following in Autonomous Convoys Troupe Tabb, Dr. Scott Martin, Dr. David Bevly, & Jeff Ratowski DISTRIBUTION STATEMENT A. Approved for public release; distribution unlimited.

406 views • 19 slides

Introduction to Reinforcement Learning and Q-Learning Skyler Seto (ss3349) May 2, 2016 Skyler

Reinforcement Learning and Markov Decision Process Q-Learning Q-Learning Convergence Introduction to Reinforcement Learning and Q-Learning Skyler Seto (ss3349) May 2, 2016 Skyler Seto (ss3349) Introduction to Reinforcement Learning and

565 views • 27 slides

7. Motor Control and Reinforcement Learning Outline A. Action Selection and Reinforcement B.

7. Motor Control and Reinforcement Learning Outline A. Action Selection and Reinforcement B. Temporal Difference Reinforcement Learning C. PVLV Model D. Cerebellum and Error-driven Learning 2/23/18 COSC 494/594 CCN 2 Sensory-Motor Loop

791 views • 56 slides

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement learning? Agent/Actor + Action + Environment + State + Reward How does reinforcement learning work?

793 views • 31 slides

Calibrated Model-Based Deep Reinforcement Learning IC ML 2019 Ali Malik, Volodymyr Kuleshov,

Calibrated Model-Based Deep Reinforcement Learning IC ML 2019 Ali Malik*, Volodymyr Kuleshov*, Jiaming Song, Danny Nemer, Harlan Seymour, Stefano Ermon June 13, 2019 *equal contribution Overview Importance of predictive uncertainty

388 views • 18 slides

Function Approximation via Tile Coding: Automating Parameter Choice Alexander Sherstov and Peter

Function Approximation via Tile Coding: Automating Parameter Choice Alexander Sherstov and Peter Stone Department of Computer Sciences The University of Texas at Austin About the Authors Alex Sherstov Peter Stone Thanks to Nick Jong for

529 views • 31 slides

Theo Keijzer a few slides with examples Article 6.1: tax avoidance term is used

Theo Keijzer a few slides with examples Article 6.1: tax avoidance term is used Expl.statement: para 80: no explanation, only says in French it is vitement fiscal The term tax avoidance for politicians is very broad. What is

68 views • 5 slides

Interrogating the Relationship Between Legally Defensible Tax Planning and Social Justice

Interrogating the Relationship Between Legally Defensible Tax Planning and Social Justice Daniel Shaviro, NYU Law School Northwestern Tax Policy Colloquium April 12, 2017 1 Tax Planning < > Social Justice I was asked to explore

521 views • 13 slides

CS 6320 Intro Immanuel Trummer itrummer@cornell.edu Course Organization Lecture Times

CS 6320 Intro Immanuel Trummer itrummer@cornell.edu Course Organization Lecture Times Tuesdays & Thursdays 1:25 PM to 2:40 PM Bard Hall 140 O ffi ce Hours Wednesday 3 PM to 4 PM 411b Gates Hall Web site (online

179 views • 15 slides

Shai Ben-David with Nati Srebro and Ruth Urner Philosophy of Machine Learning Workshop, NIPS,

Is learning possible without Prior Knowledge? Do Universal Learners exist? Shai Ben-David with Nati Srebro and Ruth Urner Philosophy of Machine Learning Workshop, NIPS, December, 2011 High level view of (Statistical) Machine Learning

840 views • 32 slides

Selecting Actions and Making Decisions: Lessons from AI Planning H ector Geffner ICREA and

Selecting Actions and Making Decisions: Lessons from AI Planning H ector Geffner ICREA and Universitat Pompeu Fabra Barcelona, Spain Workshop on Modeling Natural Action Selection Edinburgh, 7/05 Selecting Actions and Making Decisions:

217 views • 9 slides

Selection in Social Networks the influence of network structure when agents face decisions over

Selection in Social Networks the influence of network structure when agents face decisions over many similar choices Latsis Symposium, ETH, Zurich, September 2012 Paul Ormerod, Bassel Tarbush, Alex Bentley www.paulormerod.com Social network

714 views • 13 slides