deepmdp
play

DeepMDP Learning Latent Space Continuous Models for Representation - PowerPoint PPT Presentation

DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare Simple Representations for RL 2 12 DeepMDP Latent Space Model: Neural networks MDP:


  1. DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare

  2. Simple Representations for RL 2 12

  3. DeepMDP Latent Space Model: Neural networks MDP: & trained via the following two losses:

  4. Reward Loss

  5. Transition Loss

  6. Tractable Losses

  7. Deep Policies

  8. Representation Quality

  9. Only Discards: Ferns, N., Panangaden, P., and Precup, D. Metrics for Finite Markov Decision Processes. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, UAI ’04, pp. 162–169, 2004.

  10. Phi as a Representation

  11. Donut World

  12. DeepMDP on Donut World 2D latent space + DeepMDP losses

  13. DeepMDP on Donut World Visualization of latent distance

  14. DeepMDP Auxiliary Task Base C51 agent + DeepMDP losses

  15. DeepMDP Auxiliary Task Base C51 agent + DeepMDP losses

  16. ● DeepMDPs as Models of the Environment ● Norm-MMD Metrics and their Associated Smoothness

  17. Thanks For Listening Poster #108

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend