Optimal Control, LQR, Trajectory Optimization Lecture 13 What will - - PowerPoint PPT Presentation

optimal control lqr trajectory optimization
SMART_READER_LITE
LIVE PREVIEW

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will - - PowerPoint PPT Presentation

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will you take home today? Intro Optimal Control Principle of Optimality Bellman Equation Deriving LQR Trajectory Optimization Paper Optimal Control and Reinforcement Learning


slide-1
SLIDE 1

Optimal Control, LQR, Trajectory Optimization

Lecture 13

slide-2
SLIDE 2

What will you take home today?

Intro Optimal Control Principle of Optimality Bellman Equation Deriving LQR Trajectory Optimization Paper

slide-3
SLIDE 3

Optimal Control and Reinforcement Learning from a unified point of view

Optimal Control Problem

slide-4
SLIDE 4

Principle of Optimality – Example: Graph Search problem

slide-5
SLIDE 5

Quiz

slide-6
SLIDE 6

Forward Search

slide-7
SLIDE 7

Backward Search

slide-8
SLIDE 8

Principle of Optimality

slide-9
SLIDE 9

Problem setup

System Dynamics Cost function

slide-10
SLIDE 10

Goal

slide-11
SLIDE 11

Formalize Cost-to-Go / Value function

slide-12
SLIDE 12

Optimal Value function = V with lowest cost

slide-13
SLIDE 13

Deriving the Bellman Equation

slide-14
SLIDE 14

Comparing Optimal Bellman and Value Function

slide-15
SLIDE 15

Infinite time horizon, deterministic system

slide-16
SLIDE 16

Infinite time horizon, deterministic system

slide-17
SLIDE 17

Infinite time horizon, deterministic system

slide-18
SLIDE 18

Finite Horizon, Stochastic system

Stochastic System Dynamics Cost function

slide-19
SLIDE 19

Finite Horizon, Stochastic system

Value function Optimal Value function Optimal Policy

slide-20
SLIDE 20

Finite Horizon, Stochastic system

Bellman Equation Optimal Bellman Equation

slide-21
SLIDE 21

Infinite Horizon, Stochastic system

  • Combining formulation from infinite horizon - discrete system with stochastic system derivation
  • See lecture notes on webpage
slide-22
SLIDE 22

Continuous time systems

Hamilton-Jacobi-Bellman Equation

slide-23
SLIDE 23

How do you solve these equations?

slide-24
SLIDE 24

Sequential Quadratic Programming

slide-25
SLIDE 25

Sequential Quadratic Programming

slide-26
SLIDE 26

Example – Newton-Raphson Method

slide-27
SLIDE 27

Sequential Linear Quadratic Programming

slide-28
SLIDE 28

SLQ Algorithm

slide-29
SLIDE 29

Iterative Linear Quadratic Controller: ILQC

slide-30
SLIDE 30

ILQC – Linearizing Dynamics

slide-31
SLIDE 31

ILQC - Quadratizing the cost

slide-32
SLIDE 32

Deriving the Value function and Bellman Equation

slide-33
SLIDE 33

Incremental Updates

slide-34
SLIDE 34

Linear Dynamical Systems, Quadratic cost – Linear Quadratic Regulator (LQR)

slide-35
SLIDE 35

Linear Dynamical Systems, Quadratic cost – Linear Quadratic Regulator (LQR)

slide-36
SLIDE 36

Trajectory Optimization

slide-37
SLIDE 37
slide-38
SLIDE 38