Safe Reinforcement Learning in Robotics with Bayesian Models Feli - - PowerPoint PPT Presentation

▶

Dec 19, 2023 121 likes •383 views

Safe Reinforcement Learning in Robotics with Bayesian Models Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause @Workshop on Reliable AI, October 2017 A new era of autonomy Images: rethink robotics, Waymob, iRobot

SLIDE 1

Safe Reinforcement Learning in Robotics with Bayesian Models

Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause

@Workshop on Reliable AI, October 2017

SLIDE 2

A new era of autonomy

Felix Berkenkamp

Images: rethink robotics, Waymob, iRobot

SLIDE 3

Policy

Reinforcement learning

Felix Berkenkamp

Image: Plainicon, https://flaticon.com

Explo loration Poli licy update

SLIDE 4

Dangers of autonomous learning

Felix Berkenkamp

Image: Freepik, https://flaticon.com

Safety despite uncertain inty Safe exp xploration

SLIDE 5

Policy

Safe reinforcement learning

Felix Berkenkamp

Image: Plainicon, https://flaticon.com

Exploration Policy update Bayesian models for safety Model-free Model-based

SLIDE 6

Model-free reinforcement learning

Felix Berkenkamp

Tracking performance Safety constraint Few experiments Sa Safety for r all ll experiments

SLIDE 7

Gaussian process

Felix Berkenkamp

SLIDE 8

Constrained Bayesian optimization

Felix Berkenkamp

SLIDE 9

Felix Berkenkamp

Vid ideo avail ilable at http:/ ://t /tiny.cc/ic icra16_video

SLIDE 10

Felix Berkenkamp

SLIDE 11

Policy

Safe reinforcement learning

Felix Berkenkamp

Image: Plainicon, https://flaticon.com

Exploration Policy update Bayesian models for safety Model-free Model-based

SLIDE 12

Model-based reinforcement learning

Felix Berkenkamp

Model Modelling Implement Control Theory

SLIDE 13

Poli licy update

Approximate dynamic programming

Felix Berkenkamp

Dynamics Expected cost

SLIDE 14

Uncertain dynamics

Felix Berkenkamp

Dynamics model

Safety-critical

SLIDE 15

Approximate dynamic programming

Felix Berkenkamp

Dynamics

SLIDE 16

Policy

Reinforcement learning

Felix Berkenkamp

Image: Plainicon, https://flaticon.com

Explo loration Poli licy update Sa Safe exploration Sa Safe poli licy update

SLIDE 17

Region of attraction

Felix Berkenkamp

SLIDE 18

Lyapunov functions

Felix Berkenkamp

[A.M. Lyapunov 1892]

SLIDE 19

Safe policy optimization (NIPS 2017)

Felix Berkenkamp

Optimize policy for performance Determine safe region Poli licy update

SLIDE 20

Policy optimization

Felix Berkenkamp

Policy

SLIDE 21

Policy optimization

Felix Berkenkamp

Need to explore!

SLIDE 22

Obtaining data

Felix Berkenkamp

SLIDE 23

Experimental results

Felix Berkenkamp

SLIDE 24

Policy performance

Felix Berkenkamp

SLIDE 25

Conclusion

Felix Berkenkamp

Sa Safe fe re rein info forcement lea learnin ing! Can use st statis istic ical models to give high-probability safety guarantees Theoretical guarantees in the paper Code at github.com/befelix More safe learning at http://berkenkamp.me