SLIDE 1
DeepMDP
Learning Latent Space Continuous Models for Representation Learning
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare
DeepMDP Learning Latent Space Continuous Models for Representation - - PowerPoint PPT Presentation
DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare Simple Representations for RL 2 12 DeepMDP Latent Space Model: Neural networks MDP:
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare
Ferns, N., Panangaden, P., and Precup, D. Metrics for Finite Markov Decision
Intelligence, UAI ’04, pp. 162–169, 2004.
2D latent space + DeepMDP losses
Visualization of latent distance
Base C51 agent + DeepMDP losses
Base C51 agent + DeepMDP losses