PUBLIC
Deep Reinforcement Learning for Robotics Using DIANNE
Tim Verbelen, Steven Bohez, Elias De Coninck, Sam Leroux, Pieter Van Molle Bert VanKeirsbilck, Pieter Simoens, Bart Dhoedt
sam.leroux@ugent.be
Deep Reinforcement Learning for Robotics Using DIANNE Tim Verbelen, - - PowerPoint PPT Presentation
Deep Reinforcement Learning for Robotics Using DIANNE Tim Verbelen, Steven Bohez, Elias De Coninck, Sam Leroux, Pieter Van Molle Bert VanKeirsbilck, Pieter Simoens, Bart Dhoedt sam.leroux@ugent.be PUBLIC How can we build robots that are able to
PUBLIC
sam.leroux@ugent.be
3
5 axis arm Length: 66 cm Gripper Omnidirectional wheels Max speed: 0.8 m/s Battery operated Embedded PC
5
Environment Agent
6
Environment Observation Agent
7
Environment Action Agent
8
Environment Reward Agent
9
Observation Action
11
12
13
14
15
Base Sensor Arm
17
19
20
21
Deployed agent Deployed agent
22
Experience Pool Deployed agent Deployed agent
23
Experience Pool Repository Training Deployed agent Deployed agent
24
Experience Pool Repository Training Deployed agent Deployed agent
26
“Playing Atari with Deep Reinforcement Learning” (Mnih et al, 2013) Expected future return for each possible action raw laser scanner measurements (512 values) Q Values
27
28
Continuous control with Deep Reinforcement Learning (Lillicrap, et al. 2015) Actor network Critic network raw laser scanner measurements (512 values) Continuous action Expected future return
29
30
PUBLIC