Kei Ota1, Tomoaki Oiki1, Devesh K. Jha2, Toshisada Mariyama1, and Daniel Nikovski2
- 1. Mitsubishi Electrics, Kanagawa, JP
- 2. Mitsubishi Electric Research Labs, MA, US.
Can Increasing Input Dimensionality Improve Deep Reinforcement - - PowerPoint PPT Presentation
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? Kei Ota 1 , Tomoaki Oiki 1 , Devesh K. Jha 2 , Toshisada Mariyama 1 , and Daniel Nikovski 2 1. Mitsubishi Electrics, Kanagawa, JP 2. Mitsubishi Electric Research Labs, MA,
https://www.youtube.com/watch?v=rQIShnTz1kU [Akkaya, 2019]
Policy !
Feature Extractor
Policy !
Standard RL SRL + RL
Policy !
Feature Extractor
Policy !
Standard RL SRL + RL
State-Action Feature Extractor
Value Function Networks
+
Policy Network
State Feature Extractor
State-Action Feature Extractor
State Feature Extractor
/012
Linear Network
/012 !"#,%# − 34=> ?
Policy '
FC concat concat
Value Func +
FC
7
FC
FC FC
FC
concat FC concat FC MLP Net MLP ResNet MLP DenseNet
better better
Policy !
OFENet
Policy !
Raw observation OFENet representation Feature Extractor
Feature Extractor
OFENet ML-DDPG
OFE (OURS) Original ML-SAC (1/3) ML-SAC (OFE like) OFE (OURS) Original OFE (OURS) Original SAC TD3 PPO