Action Robust Reinforcement Learning and Applications in Continuous Control
Chen Tessler*, Yonathan Efroni* and Shie Mannor
*equal contribution
Action Robust Reinforcement Learning and Applications in Continuous - - PowerPoint PPT Presentation
Action Robust Reinforcement Learning and Applications in Continuous Control Chen Tessler *, Yonathan Efroni* and Shie Mannor *equal contribution Poster #272 Action Robust Reinforcement Learning and Applications in Continuous Control Robust MDPs
Chen Tessler*, Yonathan Efroni* and Shie Mannor
*equal contribution
Important model, yet not feasible in practical applications.
Action Robust Reinforcement Learning and Applications in Continuous Control
Action Robust Reinforcement Learning and Applications in Continuous Control
AR-MDPs are a special case of RMDPs, which consider uncertainty in the performed action.
Action Robust Reinforcement Learning and Applications in Continuous Control
Theorem 1. This procedure converges to the Nash equilibrium.
Action Robust Reinforcement Learning and Applications in Continuous Control
Update adversary towards the 1-step greedy policy Find optimal actor policy Evaluate joint policy
Action Robust Reinforcement Learning and Applications in Continuous Control
Ours(𝛽=1) Baseline
Action Robust Reinforcement Learning and Applications in Continuous Control
Action Robust Reinforcement Learning and Applications in Continuous Control