Deep Reinforcement Learning for Robotics Using DIANNE Tim Verbelen, - - PowerPoint PPT Presentation

deep reinforcement learning for robotics using dianne
SMART_READER_LITE
LIVE PREVIEW

Deep Reinforcement Learning for Robotics Using DIANNE Tim Verbelen, - - PowerPoint PPT Presentation

Deep Reinforcement Learning for Robotics Using DIANNE Tim Verbelen, Steven Bohez, Elias De Coninck, Sam Leroux, Pieter Van Molle Bert VanKeirsbilck, Pieter Simoens, Bart Dhoedt sam.leroux@ugent.be PUBLIC How can we build robots that are able to


slide-1
SLIDE 1

PUBLIC

Deep Reinforcement Learning for Robotics Using DIANNE

Tim Verbelen, Steven Bohez, Elias De Coninck, Sam Leroux, Pieter Van Molle Bert VanKeirsbilck, Pieter Simoens, Bart Dhoedt

sam.leroux@ugent.be

slide-2
SLIDE 2

How can we build robots that are able to execute complex tasks without programming them explicitly ?

slide-3
SLIDE 3

Kuka Youbot

3

5 axis arm Length: 66 cm Gripper Omnidirectional wheels Max speed: 0.8 m/s Battery operated Embedded PC

slide-4
SLIDE 4

Hokuyo Laser rangefinder Kuka soft gripper

slide-5
SLIDE 5

Reinforcement learning

5

Environment Agent

slide-6
SLIDE 6

Reinforcement learning

6

Environment Observation Agent

slide-7
SLIDE 7

Reinforcement learning

7

Environment Action Agent

slide-8
SLIDE 8

Reinforcement learning

8

Environment Reward Agent

slide-9
SLIDE 9

Deep Reinforcement learning

9

  • The actor needs to process high dimensional observations to determine the next action.
  • Our favorite processing block: deep neural networks

Observation Action

slide-10
SLIDE 10

How can we train without destroying our robot ?

slide-11
SLIDE 11

11

slide-12
SLIDE 12

V-REP simulator

12

slide-13
SLIDE 13

13

Multiple simulator instances gathering experience on CPU

slide-14
SLIDE 14

14

Multiple simulator instances gathering experience on CPU GPU system training the model

slide-15
SLIDE 15

15

Abstraction layer with ROS

Base Sensor Arm

slide-16
SLIDE 16

How can we evaluate our models on the robot ?

slide-17
SLIDE 17

17

Brain transplantation !

slide-18
SLIDE 18

How can we connect the different components ?

slide-19
SLIDE 19

19

slide-20
SLIDE 20

20

Dianne

  • Modular software framework for designing, training and evaluating neural networks.
  • Distributed training and evaluation
  • Java based
  • Easy integration (service based architecture)
  • GUI
  • Open source (AGPL 3)
slide-21
SLIDE 21

21

Deployed agent Deployed agent

slide-22
SLIDE 22

22

Experience Pool Deployed agent Deployed agent

slide-23
SLIDE 23

23

Experience Pool Repository Training Deployed agent Deployed agent

slide-24
SLIDE 24

24

Experience Pool Repository Training Deployed agent Deployed agent

slide-25
SLIDE 25

Deep Reinforcement learning algorithms

slide-26
SLIDE 26

DQN

26

“Playing Atari with Deep Reinforcement Learning” (Mnih et al, 2013) Expected future return for each possible action raw laser scanner measurements (512 values) Q Values

slide-27
SLIDE 27

27

slide-28
SLIDE 28

DDPG

28

Continuous control with Deep Reinforcement Learning (Lillicrap, et al. 2015) Actor network Critic network raw laser scanner measurements (512 values) Continuous action Expected future return

slide-29
SLIDE 29

29

slide-30
SLIDE 30

30

Visit dianne.intec.ugent.be for more information

slide-31
SLIDE 31

PUBLIC