SLIDE 1
ICML19
Projections for Approximate Policy Iteration Algorithms
Riad Akrour, Joni Pajarinen, Gerhard Neumann, Jan Peters IAS, TU Darmstadt, Germany
Projections for Approximate Policy Iteration Algorithms Riad Akrour - - PowerPoint PPT Presentation
Projections for Approximate Policy Iteration Algorithms Riad Akrour , Joni Pajarinen, Gerhard Neumann, Jan Peters IAS, TU Darmstadt, Germany ICML19 Entropy Regularization in RL Widespread with actor-critic methods ICML19 Hard vs Soft
ICML19
Riad Akrour, Joni Pajarinen, Gerhard Neumann, Jan Peters IAS, TU Darmstadt, Germany
ICML19
ICML19
– Harder to optimize, easier to interpret and tune Policy return Entropy reg.
ICML19
ICML19
– Deep RL – Projected gradient – Direct policy search
ICML19
– Deep RL – Projected gradient – Direct policy search