Towards TempoRL: Learning When to Act Andr Biedenkapp, Raghu Rajan, - PowerPoint PPT Presentation

Oct 02, 2023 •110 likes •277 views

Towards TempoRL: Learning When to Act Andr Biedenkapp, Raghu Rajan, Frank Hutter & Marius Lindauer Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020 In a Nutshell 1. We propose a proactive way of doing RL 2. We

Towards TempoRL: Learning When to Act André Biedenkapp, Raghu Rajan, Frank Hutter & Marius Lindauer Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
In a Nutshell 1. We propose a proactive way of doing RL 2. We introduce skip-connections into MDPs ○ through action repetition ○ allows for faster propagation of rewards 3. We propose a novel algorithm using skip-connections ○ learn what action to take & when to make new decisions ○ condition when on what 4. We evaluate our approach with tabular Q-learning on small grid worlds Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Motivation r = 0 r = 1 Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Motivation r = 0 r = -1 Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Optimal Policies Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Optimal Policies: When do we need to act? # Steps: 16 # Decisions: 16 Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Optimal Policies: When do we need to act? # Steps: 16 # Decisions: 5 Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Optimal Policies: When do we need to act? # Steps: 16 # Decisions: 4 Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Optimal Policies: When do we need to act? # Steps: 16 # Decisions: 3 Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Proactive Decision Making # Steps: 16 # Steps: 16 # Decisions: 16 # Decisions: 3 ~80% fewer Decision points Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Skip MDPs Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Flat Hierarchy 1. Use standard Q-learning to determine the behaviour 2. Condition skips on the chosen action. 3. Play action for the next steps The action Q-function The skip Q-function can be learned using n-step updates Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Experimental Evaluation Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Experimental Evaluation Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020
Wrap-Up Code & Data available: https://github.com/automl/TabularTempoRL Future work: - Use deep function approximation - Different exploration mechanisms for skip and behaviour policies Biedenkapp, Rajan, Hutter and Lindauer Towards TempoRL BIG@ICML 2020

Recommend

ACT Prep What is the ACT? The ACT is a standardized test used for college admissions in the

ACT Prep What is the ACT? The ACT is a standardized test used for college admissions in the United States. Certain schools require higher ACT scores for admittance The higher the ACT score, the more scholarship money a student can

241 views • 12 slides

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A long way to get here What is a Web Service? What is a Web Service? What is a Web Service? Web Services Web Services Software service :

552 views • 33 slides

Towards an Italian RSG ? Towards an Italian RSG ? Achille Zappa achille.zappa@gmail.com

Towards an Italian RSG ? Towards an Italian RSG ? Achille Zappa achille.zappa@gmail.com NETTAB 2010 NETTAB 2010 Phd student Bioengineering - University of Genoa Towards an Italian RSG ? Towards an Italian RSG ? What is ISCB? What

339 views • 20 slides

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Towards Deep Multi-View Stereo Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View Stereo Multi View Stereo 2 / 40 Towards Deep Multi-View Stereo Outline 1 Gipuma: massively parallel multi-view

718 views • 40 slides

Towards an Towards an Augment Augmented Reality ed Reality System for Violin Learning System

Towards an Towards an Augment Augmented Reality ed Reality System for Violin Learning System for Violin Learning Support Support H. Shiino, F. de Sorbier and H. Saito Keio University - Japan November 11 th WDIA 2012 Motivat Motivation

401 views • 21 slides

On ground implementation of the Biosecurity Act 2014 The Biosecurity Act 2014 (the Act) commenced

On ground implementation of the Biosecurity Act 2014 The Biosecurity Act 2014 (the Act) commenced on 1 July 2016, it ensures a consistent, modern, risk-based and less prescriptive approach to biosecurity in Queensland. The Act provides

615 views • 18 slides

LASTO Spring Conference March 4-6, 2015 ACT 654 Summary ACT 654 ACT 654 LASTO Suggested

LASTO Spring Conference March 4-6, 2015 ACT 654 Summary ACT 654 ACT 654 LASTO Suggested Changes during Spring 2015 Legislative Session - GJR 1.22.15.pdf 15RS-48 Draft - By Rep. Abramson - GJR 2.27.15.pdf ACT 654 (HB 600)

378 views • 13 slides

terms of the FIC Act AGENDA Compliance with the FIC Act Registration and Reporting Enforcement

FIC ROADSHOW Compliance obligations in terms of the FIC Act AGENDA Compliance with the FIC Act Registration and Reporting Enforcement of the FIC Act Status of the Financial Intelligence Centre Amendment Act The FIC Amendment Act, 2017

1.35k views • 111 slides

Pre-ACT All 10 th grade students What is the Pre-ACT? A multiple-choice test aimed at

Pre-ACT All 10 th grade students What is the Pre-ACT? A multiple-choice test aimed at preparing 10 th grade students for the ACT Serves as a way for students to see direct score predictions on the ACT What is tested on the Pre-ACT? 4

782 views • 9 slides

ADMINISTRATION OF THE CHARITIES ACT 2013 RATIONAL FOR THE ACT What was the driving force

2/22/2016 ADMINISTRATION OF THE CHARITIES ACT 2013 RATIONAL FOR THE ACT What was the driving force behind the establishment of a Charities Act. Purpose of the Charities Act 2013. 1 2/22/2016 RATIONAL FOR THE ACT Charitable

303 views • 28 slides

Essential Act 48 Information Division of Planning May, 2017 5/10/2017 1 Act 48 General

Essential Act 48 Information Division of Planning May, 2017 5/10/2017 1 Act 48 General Information about Act 48 Who is affected by Act 48? What must educators do to comply with Act 48? When does the five-year period begin?

367 views • 10 slides

Act 9 of 2014 Awareness Campaign and Public Participation Current Practice Trade Metrology

Legal Metrology Act Act 9 of 2014 Awareness Campaign and Public Participation Current Practice Trade Metrology Act The current Act is called the Trade Metrology Act (TM Act) as it only covers measurements made in trade transactions.

298 views • 17 slides

Indian Act Exemption for Employment Income Darren Patrick The Indian Act Section 87 of the

Indian Act Exemption for Employment Income Darren Patrick The Indian Act Section 87 of the Indian Act Taxation Marginal note: Property exempt from taxation 87 (1) Notwithstanding any other Act of Parliament or any Act of the legislature of a

724 views • 27 slides

SAFETY Act Webinar What is the SAFETY Act and how do you apply? Office of SAFETY Act

SAFETY Act Webinar What is the SAFETY Act and how do you apply? Office of SAFETY Act Implementation (OSAI) February 11, 2015 Presenters Name June 17, 2003 What is the SAFETY Act? Congress enacted the Support Anti-terrorism by

582 views • 24 slides

Learning Dexterity Peter Welinder SEPTEMBER 09, 2018 Learning Trends towards learning-based

Learning Dexterity Peter Welinder SEPTEMBER 09, 2018 Learning Trends towards learning-based robotics Reinforcement Learning Go (AlphaGo Zero) Dota 2 (OpenAI Five) What about Robotics? RL doesnt work because it uses lots of experience. 5

749 views • 35 slides

Why Attitude to Good Towards Explanation . . . Towards Explanation . . . People Is Not Always

Formulation of the . . . Towards Explanation Why Attitude to Good Towards Explanation . . . Towards Explanation . . . People Is Not Always Commonsense . . . Positive: Explanation Based Home Page on Decision Theory Title Page

429 views • 6 slides

What Does the CARES Act Mean for my Small Business? Answering your Questions with Congressman

What Does the CARES Act Mean for my Small Business? Answering your Questions with Congressman Paul Mitchell What is the CARES Act? The Coronavirus Aid, Relief, and Economic Security (CARES) Act is the third Coronavirus relief package

437 views • 16 slides

Agenda Wha hat t is A is ACT District CT District Testing esting? Why d hy do A o

ACT CT, the , the Compa Company ny The he ACT CT Colle College ge and and Car Career eer Read eadiness iness Benchma Ben hmarks Agenda Wha hat t is A is ACT District CT District Testing esting? Why d hy do

422 views • 27 slides

Mix Mixin ing P g Patterns i in S Social N l Networks Leto Peel Universit catholique de

Mix Mixin ing P g Patterns i in S Social N l Networks Leto Peel Universit catholique de Louvain @PiratePeel Bird irds s of of a a feat feather. her... fl floc ock tog oget ether her Visi Visibi bilit ity y an and ran

1.04k views • 48 slides

Table of Contents Convolutional Neural Nets (CNNs) 1 Deep Q Learning 2 Lecture 6: CNNs and Deep

Lecture 6: CNNs and Deep Q Learning 2 Emma Brunskill CS234 Reinforcement Learning. Winter 2018 2 With many slides for DQN from David Silver and Ruslan Salakhutdinov and some vision slides from Gianni Di Caro and images from Stanford CS231n,

850 views • 67 slides

Barry Whaley, MS Project Director Southeast ADA Center Burton Blatt Institute Syracuse

Barry Whaley, MS Project Director Southeast ADA Center Burton Blatt Institute Syracuse University College of Law Syracuse, NY The Americans with Disabilities Act: Disclosure and Reasonable Accommodations in Employment September 25, 2020 The

542 views • 15 slides

The CARES Act What businesses need to know 1 The CARES Act Panelist Click to edit Master title

Click to edit Master title style B U S I N E S S S T R A T E G I E S : C O P I N G W I T H C O V I D - 1 9 The CARES Act What businesses need to know 1 The CARES Act Panelist Click to edit Master title style What businesses need to know

882 views • 40 slides

CARES Act Town Hall: Part II MAY 13, 2020 9:00 10:30AM ZOOM FACILITATORS Erica Orians,

CARES Act Town Hall: Part II MAY 13, 2020 9:00 10:30AM ZOOM FACILITATORS Erica Orians, Michigan Center for Student Success Precious Miller, Michigan Center for Student Success Priya Chaplot, National Center for Inquiry and Improvement

537 views • 21 slides

Saving Active Management A self-guided exploration : Ron Surz Click on any picture to open a

Saving Active Management A self-guided exploration : Ron Surz Click on any picture to open a related article. Ron@PPCA-inc.com (949)488-8339 Active Managers Have Lost $500 Billion in Last 3 Years S&P, Morningstar, Vanguard, etc. say

494 views • 12 slides