Learning Action Representations for Reinforcement Learning
Yash Chandak Georgios Theocharous James Kostas Scott Jordan Philip Thomas
Learning Action Representations for Reinforcement Learning Georgios - - PowerPoint PPT Presentation
Learning Action Representations for Reinforcement Learning Georgios Scott Yash James Philip Theocharous Jordan Chandak Kostas Thomas Reinforcement Learning Problem Statement Thousands of possible actions! Problem Statement Thousands of
Yash Chandak Georgios Theocharous James Kostas Scott Jordan Philip Thomas
Thousands of possible actions!
Thousands of possible actions!
Thousands of possible actions!
Thousands of possible actions!
prescription
Thousands of possible actions!
prescription
Thousands of possible actions!
prescription
Thousands of possible actions!
prescription
Thousands of possible actions!
prescription
quantities.
quantities.
underlying their behavior pattern.
quantities.
underlying their behavior pattern.
independent of the reward.
quantities.
underlying their behavior pattern.
independent of the reward.
this space of behavior and feedback can be generalized to similar actions.
(a) Supervised learning of action representations.
(a) Supervised learning of action representations. (b) Learning internal policy with policy gradients.
Maze domain Actual behavior of 212 actions Learned representations of 212 actions
performance, independent of the mapping function.
earlier assumption:
Toy Maze:
Adobe Datasets: