Reinforcement learning
Advanced Econometrics 2, Hilary term 2021 Reinforcement learning
Maximilian Kasy
Department of Economics, Oxford University
1 / 21
Advanced Econometrics 2, Hilary term 2021 Reinforcement learning - - PowerPoint PPT Presentation
Reinforcement learning Advanced Econometrics 2, Hilary term 2021 Reinforcement learning Maximilian Kasy Department of Economics, Oxford University 1 / 21 Reinforcement learning Agenda Markov decision problems: Goal oriented interactions
Reinforcement learning
Department of Economics, Oxford University
1 / 21
Reinforcement learning
2 / 21
Reinforcement learning
3 / 21
Reinforcement learning
4 / 21
Reinforcement learning Markov decision problems
5 / 21
Reinforcement learning Markov decision problems
t′≥t
t′≥t
6 / 21
Reinforcement learning Markov decision problems
s′,r
a′
s′,r
a′
7 / 21
Reinforcement learning Markov decision problems
8 / 21
Reinforcement learning Expected updates - dynamic programming
9 / 21
Reinforcement learning Sample updates
10 / 21
Reinforcement learning Sample updates
11 / 21
Reinforcement learning Sample updates
s′,r
a′
12 / 21
Reinforcement learning Sample updates
13 / 21
Reinforcement learning Sample updates
a′
14 / 21
Reinforcement learning Approximation
15 / 21
Reinforcement learning Approximation
16 / 21
Reinforcement learning Approximation
a′
17 / 21
Reinforcement learning Eligibility traces
t′=t
t = t+k−1
t′=t
t − Qπ(At,St;θ)
18 / 21
Reinforcement learning Eligibility traces
t = t+k
t′=t
t = (1−λ)
k=1
t .
t ?
19 / 21
Reinforcement learning Eligibility traces
t
20 / 21
Reinforcement learning References
21 / 21