Per-Decision Option Discounting Anna Harutyunyan, Peter Vrancx, - - PowerPoint PPT Presentation

▶

May 12, 2023 34 likes •226 views

Per-Decision Option Discounting Anna Harutyunyan, Peter Vrancx, Philippe Hamel, Ann Nowe, Doina Precup Motivation: Agents that reason over long temporal horizons Motivation: Agents that reason over long temporal horizons Horizon depends on

SLIDE 1

Per-Decision Option Discounting

Anna Harutyunyan, Peter Vrancx, Philippe Hamel, Ann Nowe, Doina Precup

SLIDE 2

Motivation: Agents that reason over long temporal horizons

SLIDE 3

Horizon depends on discount γ Motivation: Agents that reason over long temporal horizons

SLIDE 4

Motivation: Agents that reason over long temporal horizons Horizon depends on discount γ

SLIDE 5

Horizon depends on discount γ Larger grid requires a larger γ Motivation: Agents that reason over long temporal horizons

SLIDE 6

Horizon depends on discount γ Larger grid requires a larger γ Large γ-s are inefficient in practice :( Motivation: Agents that reason over long temporal horizons

SLIDE 7

Horizon depends on discount γ Larger grid requires a larger γ Temporal abstraction? Motivation: Agents that reason over long temporal horizons

SLIDE 8

Motivation: Agents that reason over long temporal horizons Horizon depends on discount γ Larger grid requires a larger γ Temporal abstraction? Options still tied to γ!

SLIDE 9

Motivation: Agents that reason over long temporal horizons Horizon depends on discount γ Larger grid requires a larger γ Temporal abstraction? Options still tied to γ! Contribution: Generalize the options framework to let it extend the agent’s horizon.

SLIDE 10