Data
xt-1 State st State st-1 Action at-1 mt
Data
xt State st+1 mt+1
Data
xt+1 Action at Action at+!
…
mt-1
E x t e r n a l E n v i r
- n
m e n t I n t e r n a l E n v i r
- n
m e n t P l a n n e r
Option KB Critic State Repr.
O p t i
- n
O b s e r v a t i
- n
/ A c t i
- n
A Case Against Generative Models for Reinforcement Learning?
Generative models for RL workshop DALI 2018 @shakir_za shakir@deepmind.com