Learning based Robust Control: Guaranteeing Stability while - - PowerPoint PPT Presentation

learning based robust control guaranteeing stability
SMART_READER_LITE
LIVE PREVIEW

Learning based Robust Control: Guaranteeing Stability while - - PowerPoint PPT Presentation

Learning based Robust Control: Guaranteeing Stability while Improving Performance Felix Berkenkamp and Angela P. Schoellig IROS 2014 Machine Learning in Planning and Control of Robot Motion Workshop (14. Sep.) Why is model based control


slide-1
SLIDE 1

Learning‐based Robust Control: Guaranteeing Stability while Improving Performance

Felix Berkenkamp and Angela P. Schoellig

IROS 2014 – Machine Learning in Planning and Control of Robot Motion Workshop (14. Sep.)

slide-2
SLIDE 2

Why is model‐based control not always sufficient?

  • Model inaccuracies limit achievable performance!
  • Surface Material
  • Topography
  • Unknown dynamics

2 Felix Berkenkamp

slide-3
SLIDE 3

Robust Robust control control

  • Specify prior uncertainty

in model

  • Guarantee stability and

performance for all possible models

Onl Online ne le learning

  • Learn from online data
  • Improve the model

True model Nominal model Set of possible models True model Nominal model Online learning

3

How these problems have been tackled so far

Felix Berkenkamp

slide-4
SLIDE 4

The missing link

4

Robust Control Online learning Learning- based Robust Control

Models uncertainty

  

Guarantees stability

  

Improves

  • nline

  

True model Nominal model Set of possible models Online learning True model Nominal model Set of possible models

Felix Berkenkamp

  • S. Schaal and C. G. Atkeson, “Learning control in robotics,”

IEEE Robotics & Automation Magazine, vol. 17, no. 2, pp. 20–29, 2010.

slide-5
SLIDE 5

Model for our approach

  • Stabilization of an operating point
  • Robust Control: Guaranteed stability / performance
  • Gaussian Process: Online learning

5 Felix Berkenkamp

slide-6
SLIDE 6

Framework: linear Robust Control

  • Extended linear plant
  • Output
  • Uncertainty
  • Disturbance
  • Error signal

Goal: Find K such that

  • Stab

ability ility: stable for all allowed

  • Perform

Performance: minimized

Convex optimization problem

6 Felix Berkenkamp

slide-7
SLIDE 7

Gaussian Process (GP) learning

  • Objective: learn the model error with uncertainties from input/output data
  • Assumption: Correlation of data given by a kernel function

Similar inputs will lead to similar outputs

  • Hyperparameters (noise and scaling): learned from data

7 Felix Berkenkamp

slide-8
SLIDE 8

Gaussian Process learning

8 Felix Berkenkamp

slide-9
SLIDE 9

Gaussian Process in our context

  • Model:
  • GP inputs:
  • GP outputs:

True model Nominal model Set of possible models

9 Felix Berkenkamp

slide-10
SLIDE 10

Example: simple linear/affine system

  • True model:
  • Without prior knowledge, use GP to learn system dynamics

10 Felix Berkenkamp

slide-11
SLIDE 11

Parameters estimated using GP

11 Felix Berkenkamp

slide-12
SLIDE 12

Bringing GPs and Robust Control together

12

This generalizes to non-linear, non-scalar, MIMO systems using the same process for element- wise uncertainty

Felix Berkenkamp

slide-13
SLIDE 13

The complete process

13 Felix Berkenkamp

True model Nominal model Set of possible models True model Nominal model Set of possible models

slide-14
SLIDE 14

2 4 6 8 10 12 14

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 time [s] x position [m] step response 2 4 6 8 10 12 14

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 time [s] y position [m] 800 1000 desired 800 1000 desired 2 4 6 8 10 12 14

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 time [s] x position [m] step response 2 4 6 8 10 12 14

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 time [s] y position [m] 800 1000 2000 desired 800 1000 2000 desired 2 4 6 8 10 12 14

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 time [s] x position [m] step response 2 4 6 8 10 12 14

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 time [s] y position [m] 800 1000 2000 3000 desired 800 1000 2000 3000 desired

Experimental results: https://youtu.be/YqhLnCm0KXY?list=PLC12E387419CEAFF2

14 Felix Berkenkamp

slide-15
SLIDE 15

Summary

  • Combined

Combined Gaussian Process lear arni ning ng with Linear Robust Robust Cont Control rol

  • Enables contro

controlle ller perform performance to to im improve onlin

  • nline while

providing stab stability ility guaran guarantees

15

True model Nominal model Set of possible models True model Nominal model Set of possible models

Felix Berkenkamp

slide-16
SLIDE 16

Thank you!

befelix@ethz.ch Felix Berkenkamp