DeepMDP Learning Latent Space Continuous Models for Representation - - PowerPoint PPT Presentation

deepmdp
SMART_READER_LITE
LIVE PREVIEW

DeepMDP Learning Latent Space Continuous Models for Representation - - PowerPoint PPT Presentation

DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare Simple Representations for RL 2 12 DeepMDP Latent Space Model: Neural networks MDP:


slide-1
SLIDE 1

DeepMDP

Learning Latent Space Continuous Models for Representation Learning

Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare

slide-2
SLIDE 2

2 12

Simple Representations for RL

slide-3
SLIDE 3

Neural networks

DeepMDP

Latent Space Model: MDP: & trained via the following two losses:

slide-4
SLIDE 4

Reward Loss

slide-5
SLIDE 5

Transition Loss

slide-6
SLIDE 6

Tractable Losses

slide-7
SLIDE 7

Deep Policies

slide-8
SLIDE 8

Representation Quality

slide-9
SLIDE 9

Only Discards:

Ferns, N., Panangaden, P., and Precup, D. Metrics for Finite Markov Decision

  • Processes. In Proceedings of the 20th Conference on Uncertainty in Artificial

Intelligence, UAI ’04, pp. 162–169, 2004.

slide-10
SLIDE 10

Phi as a Representation

slide-11
SLIDE 11

Donut World

slide-12
SLIDE 12

DeepMDP on Donut World

2D latent space + DeepMDP losses

slide-13
SLIDE 13

DeepMDP on Donut World

Visualization of latent distance

slide-14
SLIDE 14

DeepMDP Auxiliary Task

Base C51 agent + DeepMDP losses

slide-15
SLIDE 15

DeepMDP Auxiliary Task

Base C51 agent + DeepMDP losses

slide-16
SLIDE 16
  • DeepMDPs as Models of the Environment
  • Norm-MMD Metrics and their Associated Smoothness
slide-17
SLIDE 17

Thanks For Listening

Poster #108