Deep Reinforcement Learning Prof. Kuan-Ting Lai 2020/3/5 Course - PowerPoint PPT Presentation

Jul 09, 2023 •105 likes •210 views

Course Requirements of Deep Reinforcement Learning Prof. Kuan-Ting Lai 2020/3/5 Course Requirements Kaggle-style homework (60%) TBD VizDoom Microsoft AirSim Final Project (40%) Team members (1 ~ 4) Final report + Demo

Course Requirements of Deep Reinforcement Learning Prof. Kuan-Ting Lai 2020/3/5
Course Requirements • Kaggle-style homework (60%) − TBD − VizDoom − Microsoft AirSim • Final Project (40%) − Team members (1 ~ 4) − Final report + Demo + Source code • Attendance (5%) − Roll call − Answering questions 2
Textbooks & References • Maxim Lapan , “Deep Reinforcement Learning Hands - on,” Packt, 2018 • Richard S. Sutton and Andrew G. Barto, “Reinforcement Learning, An Introduction, 2 nd Edition” The MIT Press, 2018 • Latest publications on Nature, CVPR, NIPS, ICML, AAAI, ICLR https://github.com/PacktPublishing/Deep-Reinforcement-Learning-Hands-On 3
Schedule Date Syllabus 3/6 Introduction to Deep Reinforcement Learning (Sutton (2018), Chapter 1, 2) 3/13 Finite Markov Decision Processes and Dynamic Programming (Sutton (2018), Chapter 3, 4) HW1 TBD 3/20 PyTorch & OpenAI Gym (Lapan (2018), Chapter 2, 3) 3/27 Dynamic Programming & Monte Carlo Methods (Sutton (2018), Chapter 4, 5) 4/3 Temporal-Difference Learning (SARSA, Q-learning) (Sutton (2018), Chapter 6) HW2 TBD 4/10 Deep Q-Networks (Lapan (2018), Chapter 6, 7) 4/17 Policy Gradients (Lapan (2018), Chapter 9) 4/24 Actor-Critic Method (Lapan (2018), Chapter 10) HW3 Stocks Trading using RL 4
Schedule (cont.) Date 4/28 No Midterm, No Class 5/1 Final Project Proposal Due 5/8 A3C and A2C (Lapan (2018), Chapter 11 and OpenAI paper) 5/15 Continuous Action Space (Lapan (2018), Chapter 14) 5/22 Trust Regions – TRPO, PPO, and ACKTR (Lapan (2018), Chapter 15) HW4 Playing a Shooting Game (VizDoom) (Due 12/15) 5/29 Black-Box Optimization in RL ((Lapan (2018), Chapter 16) 6/5 Beyond Model-free (Lapan (2018), Chapter 17) 6/12 AlphaGo Zero (Lapan (2018), Chapter 18) 6/19 Final Project Demo 1 (20 mins, talk + demo, in English) 6/26 Final Project Demo 2 (20 mins, talk + demo, in English) 5
Grading Policy of Homework Kaggle Ranking Grade Description Grade Top 5% Excellent A+ 5% ~ 20% A 20 ~ 50% A- Others Very Good B+ < Random Guess C No submission F Top 3 students get one free cup of Bubble Tea! 6
7
Facebook Group (NTUT Deep RL Learning) 8
Teaching Assistants • 蔡榮成 : John Tsai (john0952270878@gmail.com) 9

Recommend

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement learning? Agent/Actor + Action + Environment + State + Reward How does reinforcement learning work?

793 views • 31 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Deep Reinforcement Learning [Mastering the Game of Go with Deep Reinforcement Learning and Tree

Deep Reinforcement Learning [Mastering the Game of Go with Deep Reinforcement Learning and Tree Search, Nature 2016] CS 486/686 University of Waterloo Lecture 21: July 12, 2017 Outline AlphaGo Supervised Learning of Policy Networks

541 views • 15 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.42k views • 88 slides

Deep Reinforcement Learning [Human-Level Control through deep reinforcement learning, Nature

Deep Reinforcement Learning [Human-Level Control through deep reinforcement learning, Nature 2015] CS 486/686 University of Waterloo Lecture 20: July 10, 2017 Outline Value Function Approximation Linear approximation Neural

706 views • 19 slides

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Deep learning Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December 25, 2018 Hamid Beigy | Sharif university of technology | December 25, 2018 1 / 65 Deep learning Table of contents 1 Introduction 2

836 views • 65 slides

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning Haarnoja, Tang et al. (2017) Reinforcement Learning with Deep Energy Based Policies, ICML . Haarnoja, Zhou et al. (2018) Soft Actor-Critic: Off-Policy

684 views • 24 slides

Deep Reinforcement Learning Philipp Koehn 21 April 2020 Philipp Koehn Artificial Intelligence:

Deep Reinforcement Learning Philipp Koehn 21 April 2020 Philipp Koehn Artificial Intelligence: Deep Reinforcement Learning 21 April 2020 Reinforcement Learning 1 Sequence of actions moves in chess driving controls in car

815 views • 63 slides

Deep Reinforcement Learning Philipp Koehn 18 April 2019 Philipp Koehn Artificial Intelligence:

Deep Reinforcement Learning Philipp Koehn 18 April 2019 Philipp Koehn Artificial Intelligence: Deep Reinforcement Learning 18 April 2019 Reinforcement Learning 1 Sequence of actions moves in chess driving controls in car

861 views • 63 slides

Deep he(a)p, big feat arXiv:1707.06887 A Distributional Perspective on Reinforcement Learning

Deep he(a)p, big feat arXiv:1707.06887 A Distributional Perspective on Reinforcement Learning arXiv:1702.08165 Reinforcement Learning with Deep Energy-Based Policies 1 / 25 Reinforcement Learning Environment Action Reward Interpreter State

531 views • 25 slides

CS885 Reinforcement Learning Lecture 1a: May 2, 2018 Course Introduction [SutBar] Chapter 1,

CS885 Reinforcement Learning Lecture 1a: May 2, 2018 Course Introduction [SutBar] Chapter 1, [Sze] Chapter 1 University of Waterloo CS885 Spring 2018 Pascal Poupart 1 Outline Introduction to Reinforcement Learning Course website and

424 views • 14 slides

$\ Task Scheduling in High-Performance Computing Thomas McSweeney School of Mathematics The$

\ Task Scheduling in High-Performance Computing Thomas McSweeney School of Mathematics The

\ Task Scheduling in High-Performance Computing Thomas McSweeney School of Mathematics The University of Manchester thomas.mcsweeney@postgrad.manchester.ac.uk Numerical Linear Algebra Group Meeting October 16, 2018 Outline 1 The task

315 views • 28 slides

SDRL: Interpretable and Data-efficient Deep Liu Reinforcement Learning Introduction Background

SDRL: Symbolic Deep Reinforcement Learning SDRL: Interpretable and Data-efficient Deep Liu Reinforcement Learning Introduction Background Leveraging Symbolic Planning Method Experiment Conclusion Bo Liu and Future Work Auburn

1.01k views • 20 slides

Reinforcement Learning: Basic models and algorithms Optimal decisions, Part VII Christos

Reinforcement Learning: Basic models and algorithms Optimal decisions, Part VII Christos Dimitrakakis Chalmers November 20, 2013 Christos Dimitrakakis (Chalmers) Reinforcement Learning: Basic models and algorithms November 20, 2013 1 / 28

405 views • 29 slides

55% Didactic Instruction Ongoing Training/PM 71% Lectures Monthly Feedback 1 10/6/2017

10/6/2017 Evidence Based Performance Management: Applying Behavioral Science to Support Practitioners Florence D. DiGennaro Reed, PhD, BCBA D Preservice Training 55% Didactic Instruction Ongoing Training/PM 71% Lectures Monthly Feedback

549 views • 29 slides

Presentation of WebCT usage in deploying quiz assignments Ivica.Matotek@CARNet.hr

Presentation of WebCT usage in deploying quiz assignments Ivica.Matotek@CARNet.hr Ivica.Matotek@CARNet.hr Edupoint Introduction WebCT (Web Courseware Tool) Developed at University of British Columbia, Canada, in 1995 Used in over 80

351 views • 20 slides

Mat MattNet tNet: : Modu Modular Atten lar Attention tion Network for Referring Network for

March 2020 Mat MattNet tNet: : Modu Modular Atten lar Attention tion Network for Referring Network for Referring Expres Expression Comp sion Comprehe rehension nsion Tong Gao Background Referring expressions are natural language

433 views • 26 slides

Synchronization CS 416: Operating Systems Design Department of Computer Science Rutgers

Synchronization CS 416: Operating Systems Design Department of Computer Science Rutgers University http://www.cs.rutgers.edu/~vinodg/teaching/416 Synchronization Problem Threads may share data Data consistency must be maintained Example

1.81k views • 99 slides