Optimal Control using Iterative Dynamic Programming Daniel M. Webb, - - PowerPoint PPT Presentation

optimal control using iterative dynamic programming
SMART_READER_LITE
LIVE PREVIEW

Optimal Control using Iterative Dynamic Programming Daniel M. Webb, - - PowerPoint PPT Presentation

Optimal Control using Iterative Dynamic Programming Daniel M. Webb, W. Fred Ramirez advising Department of Chemical and Biological Engineering University of Colorado, Boulder May 16, 2007 Problem Introduction Park-Ramirez Model Optimal


slide-1
SLIDE 1

Optimal Control using Iterative Dynamic Programming

Daniel M. Webb, W. Fred Ramirez advising

Department of Chemical and Biological Engineering University of Colorado, Boulder

May 16, 2007

slide-2
SLIDE 2

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))
slide-3
SLIDE 3

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0

slide-4
SLIDE 4

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield

slide-5
SLIDE 5

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products

slide-6
SLIDE 6

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation

slide-7
SLIDE 7

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation Real-time optimal control

slide-8
SLIDE 8

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation Real-time optimal control Some basic modeling steps: Formulate

slide-9
SLIDE 9

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation Real-time optimal control Some basic modeling steps: Formulate Train (identify model parameters)

slide-10
SLIDE 10

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation Real-time optimal control Some basic modeling steps: Formulate Train (identify model parameters) Offline optimization

slide-11
SLIDE 11

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation Real-time optimal control Some basic modeling steps: Formulate Train (identify model parameters) Offline optimization Online optimization (real-time optimal control)

slide-12
SLIDE 12

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Problem Introduction

ODE models:

  • X = f(X,ν(X,ω))

X(t = 0) = X0 Model uses (especially batch systems): Optimize yield Minimize unwanted products Reduce run-to-run variation Real-time optimal control Some basic modeling steps: Formulate Train (identify model parameters) Offline optimization Online optimization (real-time optimal control)

slide-13
SLIDE 13

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =
  • X =
  • S =
  • PT =
  • PM =
slide-14
SLIDE 14

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =
  • S =
  • PT =
  • PM =
slide-15
SLIDE 15

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =
  • S =
  • PT =
  • PM =
slide-16
SLIDE 16

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =
  • PT =
  • PM =
slide-17
SLIDE 17

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, S ( g

L)

µ ( 1

hr)

10 9 8 7 6 5 4 3 2 1 0.4 0.3 0.2 0.1

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =
  • PT =
  • PM =
slide-18
SLIDE 18

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, S ( g

L)

µ ( 1

hr)

10 9 8 7 6 5 4 3 2 1 0.4 0.3 0.2 0.1

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =
  • PT =
  • PM =
slide-19
SLIDE 19

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, S ( g

L)

µ ( 1

hr)

10 9 8 7 6 5 4 3 2 1 0.4 0.3 0.2 0.1

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =
  • PM =
slide-20
SLIDE 20

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =
  • PM =
slide-21
SLIDE 21

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM =
slide-22
SLIDE 22

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, S ( g

L)

fP ( 1

hr)

10 1 0.1 0.01 0.001 0.3 0.2 0.1

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM =
slide-23
SLIDE 23

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, and foreign protein secretion. A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM =
slide-24
SLIDE 24

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, and foreign protein secretion. A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM = φ(PT − PM)
  • q

V PM

slide-25
SLIDE 25

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, and foreign protein secretion. S ( g

L)

φ ( 1

hr)

10 9 8 7 6 5 4 3 2 1 5 4 3 2 1

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM = φ(PT − PM)
  • q

V PM

slide-26
SLIDE 26

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, and foreign protein secretion. The optimal control problem: MAX(Φ)

Φ = PM(tf)· V(tf)

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM = φ(PT − PM)
  • q

V PM

slide-27
SLIDE 27

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Park-Ramirez Bioreactor Model

For genetically modified yeast in a fed- batch reactor, predict: cell growth, substrate consumption, foreign protein production, and foreign protein secretion.

Time (hr)

q ( L

hr)

14 12 10 8 6 4 2 3 2.5 2 1.5 1 0.5

A c c u m u l a t i

  • n

G e n e r a t i

  • n

I n p u t D i l u t i

  • n
  • V =

q

  • X =

µX

  • q

V X

  • S =

−YµX

+

q V Sf - q V S

  • PT =

fPX

  • q

V PT

  • PM = φ(PT − PM)
  • q

V PM

slide-28
SLIDE 28

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

slide-29
SLIDE 29

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

slide-30
SLIDE 30

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors.

slide-31
SLIDE 31

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial).

slide-32
SLIDE 32

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

slide-33
SLIDE 33

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

slide-34
SLIDE 34

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

slide-35
SLIDE 35

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too.

slide-36
SLIDE 36

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too. Seems to be much less common than control vector parametrization.

slide-37
SLIDE 37

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too. Seems to be much less common than control vector parametrization.

3

Necessary conditions and Pontryagin’s Maximum Principle

slide-38
SLIDE 38

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too. Seems to be much less common than control vector parametrization.

3

Necessary conditions and Pontryagin’s Maximum Principle

Can provide much more insight into the problem solution.

slide-39
SLIDE 39

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too. Seems to be much less common than control vector parametrization.

3

Necessary conditions and Pontryagin’s Maximum Principle

Can provide much more insight into the problem solution. Requires more skill/education than parametrization methods.

slide-40
SLIDE 40

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too. Seems to be much less common than control vector parametrization.

3

Necessary conditions and Pontryagin’s Maximum Principle

Can provide much more insight into the problem solution. Requires more skill/education than parametrization methods. Difficult to apply to singular control problems.

slide-41
SLIDE 41

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Optimal Control Solution Methods

The four primary ways currently used to solve optimal control problems:

1

Control vector parametrization (CVP)

Break the control u(t) into piecewise vectors. Each piecewise vector is a function approximation ν(ω) (ie. constant, linear, polynomial). Find the sensitivities ∂Φ

∂ω.

Solve for ω using a NLP solver.

2

Collocation

Similar to control vector parametrization except discretizing states too. Seems to be much less common than control vector parametrization.

3

Necessary conditions and Pontryagin’s Maximum Principle

Can provide much more insight into the problem solution. Requires more skill/education than parametrization methods. Difficult to apply to singular control problems.

4

Iterative dynamic programming (IDP)

slide-42
SLIDE 42

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Dynamic Programming

What is dynamic programming? A way to solve discrete multi-stage decision problem by:

slide-43
SLIDE 43

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Dynamic Programming

What is dynamic programming? A way to solve discrete multi-stage decision problem by: Breaking the problem into smaller subproblems.

slide-44
SLIDE 44

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Dynamic Programming

What is dynamic programming? A way to solve discrete multi-stage decision problem by: Breaking the problem into smaller subproblems. Remembering the best solutions to the subproblems.

slide-45
SLIDE 45

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Dynamic Programming

What is dynamic programming? A way to solve discrete multi-stage decision problem by: Breaking the problem into smaller subproblems. Remembering the best solutions to the subproblems. Combining the solutions to the subproblems to get the overall solution.

slide-46
SLIDE 46

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Iterative Dynamic Programming (IDP)

What is iterative dynamic programming (IDP)? A type of dynamic programming to solve the optimal control problem by:

slide-47
SLIDE 47

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Iterative Dynamic Programming (IDP)

What is iterative dynamic programming (IDP)? A type of dynamic programming to solve the optimal control problem by: Breaking the control u(t) into k stages.

slide-48
SLIDE 48

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Iterative Dynamic Programming (IDP)

What is iterative dynamic programming (IDP)? A type of dynamic programming to solve the optimal control problem by: Breaking the control u(t) into k stages. Guess several profiles for the stagewise constant controls uk.

slide-49
SLIDE 49

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Iterative Dynamic Programming (IDP)

What is iterative dynamic programming (IDP)? A type of dynamic programming to solve the optimal control problem by: Breaking the control u(t) into k stages. Guess several profiles for the stagewise constant controls uk. Use the guessed controls uk to calculate a guessed continuous state profiles X(t), S(t), etc.

slide-50
SLIDE 50

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Iterative Dynamic Programming (IDP)

What is iterative dynamic programming (IDP)? A type of dynamic programming to solve the optimal control problem by: Breaking the control u(t) into k stages. Guess several profiles for the stagewise constant controls uk. Use the guessed controls uk to calculate a guessed continuous state profiles X(t), S(t), etc. Starting at the final time and working backward, find out which of the guessed controls was the best and remember it for the next iteration.

slide-51
SLIDE 51

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

Iterative Dynamic Programming (IDP)

What is iterative dynamic programming (IDP)? A type of dynamic programming to solve the optimal control problem by: Breaking the control u(t) into k stages. Guess several profiles for the stagewise constant controls uk. Use the guessed controls uk to calculate a guessed continuous state profiles X(t), S(t), etc. Starting at the final time and working backward, find out which of the guessed controls was the best and remember it for the next iteration. Animation 1: Basic IDP algorithm

slide-52
SLIDE 52

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why?

slide-53
SLIDE 53

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster.

slide-54
SLIDE 54

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc).

slide-55
SLIDE 55

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution.

slide-56
SLIDE 56

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution. IDP sometimes has noisy controls.

slide-57
SLIDE 57

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution. IDP sometimes has noisy controls. What’s good about IDP though? Stochastic method: if sufficient samples are taken, likely to find global

  • ptimum.
slide-58
SLIDE 58

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution. IDP sometimes has noisy controls. What’s good about IDP though? Stochastic method: if sufficient samples are taken, likely to find global

  • ptimum.

No sensitivity derivatives needed.

slide-59
SLIDE 59

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution. IDP sometimes has noisy controls. What’s good about IDP though? Stochastic method: if sufficient samples are taken, likely to find global

  • ptimum.

No sensitivity derivatives needed. Obtains an approximate solution very quickly.

slide-60
SLIDE 60

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution. IDP sometimes has noisy controls. What’s good about IDP though? Stochastic method: if sufficient samples are taken, likely to find global

  • ptimum.

No sensitivity derivatives needed. Obtains an approximate solution very quickly. Very simple algorithm

slide-61
SLIDE 61

Problem Introduction Park-Ramirez Model Optimal Control Solution Methods

CVP vs. IDP

CVP seems to be more popular than IDP . Why? Fast! At least 4x faster than IDP , often 20x faster. Easy (Matlab optimization toolbox, etc). Gradient method: fast in general, very fast near the solution. IDP sometimes has noisy controls. What’s good about IDP though? Stochastic method: if sufficient samples are taken, likely to find global

  • ptimum.

No sensitivity derivatives needed. Obtains an approximate solution very quickly. Very simple algorithm (although maybe not after I’m done with it).

slide-62
SLIDE 62

Speeding up IDP Smoothing IDP

Speeding up IDP

IDP is slow, how can we speed it up?

slide-63
SLIDE 63

Speeding up IDP Smoothing IDP

Speeding up IDP

IDP is slow, how can we speed it up? Reduce integrator relative tolerance.

slide-64
SLIDE 64

Speeding up IDP Smoothing IDP

Speeding up IDP

IDP is slow, how can we speed it up? Reduce integrator relative tolerance.

Integrator Relative Tolerance CPU time (seconds)

10−1 10−3 10−5 10−7 100 10 1

slide-65
SLIDE 65

Speeding up IDP Smoothing IDP

Speeding up IDP

IDP is slow, how can we speed it up? Reduce integrator relative tolerance. Use my new adaptive region size update methods.

slide-66
SLIDE 66

Speeding up IDP Smoothing IDP

Speeding up IDP

IDP is slow, how can we speed it up? Reduce integrator relative tolerance. Use my new adaptive region size update methods.

slide-67
SLIDE 67

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

First-order Control Filter

IDP often leads to noisy controls: Animation 3: Basic IDP with 50 stages

slide-68
SLIDE 68

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

First-order Control Filter

IDP often leads to noisy controls: Animation 3: Basic IDP with 50 stages Try a first-order control filter after every iteration.

slide-69
SLIDE 69

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

First-order Control Filter

IDP often leads to noisy controls: Animation 3: Basic IDP with 50 stages Try a first-order control filter after every iteration. Animation 4: First-order control filter

slide-70
SLIDE 70

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter.

slide-71
SLIDE 71

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

slide-72
SLIDE 72

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then

slide-73
SLIDE 73

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

slide-74
SLIDE 74

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

3:

if test control leads to a smoother control profile then

slide-75
SLIDE 75

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

3:

if test control leads to a smoother control profile then

4:

γs = exp

  • −∆Φ

T·MΦ

  • − R(0,1)

5:

end if

slide-76
SLIDE 76

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

3:

if test control leads to a smoother control profile then

4:

γs = exp

  • −∆Φ

T·MΦ

  • − R(0,1)

5:

end if

6:

end for

7:

Choose the test control with the largest γs

8: end if

slide-77
SLIDE 77

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

3:

if test control leads to a smoother control profile then

4:

γs = exp

  • −∆Φ

T·MΦ

  • − R(0,1)

5:

end if

6:

end for

7:

Choose the test control with the largest γs

8: end if

Temperature cools with control region size and number of iterations.

slide-78
SLIDE 78

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

3:

if test control leads to a smoother control profile then

4:

γs = exp

  • −∆Φ

T·MΦ

  • − R(0,1)

5:

end if

6:

end for

7:

Choose the test control with the largest γs

8: end if

Temperature cools with control region size and number of iterations. Animation 5: Simulated annealing filter

slide-79
SLIDE 79

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Simulated Annealing Control Filter

First-order control filter can over-filter. How about filtering more where Φ is hurt less?

1: if no test controls improve Φ then 2:

for each test control do

3:

if test control leads to a smoother control profile then

4:

γs = exp

  • −∆Φ

T·MΦ

  • − R(0,1)

5:

end if

6:

end for

7:

Choose the test control with the largest γs

8: end if

Temperature cools with control region size and number of iterations. Animation 5: Simulated annealing filter Less likely to find local minima for this problem.

slide-80
SLIDE 80

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Control Damping

How about punishing control activity directly?

slide-81
SLIDE 81

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Control Damping

How about punishing control activity directly?

Φ∗ = Φ−Φd

slide-82
SLIDE 82

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Control Damping

How about punishing control activity directly?

Φ∗ = Φ−Φd Φd = discrete second derivative of controls.

slide-83
SLIDE 83

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Control Damping

How about punishing control activity directly?

Φ∗ = Φ−Φd Φd = discrete second derivative of controls.

Animation 6: Damping

slide-84
SLIDE 84

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Control Damping

How about punishing control activity directly?

Φ∗ = Φ−Φd Φd = discrete second derivative of controls.

Animation 6: Damping Changes the problem!

slide-85
SLIDE 85

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Control Damping

How about punishing control activity directly?

Φ∗ = Φ−Φd Φd = discrete second derivative of controls.

Animation 6: Damping Changes the problem! Was often harmful to solution if large enough to filter well. Good in small doses in combination with other methods.

slide-86
SLIDE 86

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Pivot Point Test Controls

Regular test controls work backwards one stage at a time.

slide-87
SLIDE 87

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Pivot Point Test Controls

Regular test controls work backwards one stage at a time. Why not change two stage controls at a time?

slide-88
SLIDE 88

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Pivot Point Test Controls

Regular test controls work backwards one stage at a time. Why not change two stage controls at a time? Animation 7: Pivot points (show every test control)

slide-89
SLIDE 89

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Pivot Point Test Controls

Regular test controls work backwards one stage at a time. Why not change two stage controls at a time? Animation 7: Pivot points (show every test control) Animation 8: Pivot points (show every stage)

slide-90
SLIDE 90

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Pivot Point Test Controls

Regular test controls work backwards one stage at a time. Why not change two stage controls at a time? Animation 7: Pivot points (show every test control) Animation 8: Pivot points (show every stage) Very fast convergence (to local minima).

slide-91
SLIDE 91

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found.

slide-92
SLIDE 92

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found.

slide-93
SLIDE 93

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found. Solve using two-step process:

slide-94
SLIDE 94

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found. Solve using two-step process:

1

Solve most of the way using basic IDP .

slide-95
SLIDE 95

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found. Solve using two-step process:

1

Solve most of the way using basic IDP .

2

Solve some more with smoothed IDP .

slide-96
SLIDE 96

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found. Solve using two-step process:

1

Solve most of the way using basic IDP .

2

Solve some more with smoothed IDP . The two-step results?

slide-97
SLIDE 97

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Big Problem with Smoothing

All the smoothing techniques caused local minima to be found. Solve using two-step process:

1

Solve most of the way using basic IDP .

2

Solve some more with smoothed IDP . The two-step results?

slide-98
SLIDE 98

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Stagewise Linear Continuous Controls

Why not try a stagewise linear discretization of controls?

slide-99
SLIDE 99

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Stagewise Linear Continuous Controls

Why not try a stagewise linear discretization of controls? Animation 9: Continuous linear controls (show every stage)

slide-100
SLIDE 100

Speeding up IDP Smoothing IDP First-order Filter Simulated Annealing Filter Damping Pivot Point Test Controls Linear Controls

Stagewise Linear Continuous Controls

Why not try a stagewise linear discretization of controls? Animation 9: Continuous linear controls (show every stage) Useful; seems to obtain equivalent Φ but smoother.

slide-101
SLIDE 101

Conclusions Acknowledgments

Conclusions

Several smoothing methods were developed for IDP which greatly increase convergence speed. If used with the two-step procedure, you get the best of both worlds:

slide-102
SLIDE 102

Conclusions Acknowledgments

Conclusions

Several smoothing methods were developed for IDP which greatly increase convergence speed. If used with the two-step procedure, you get the best of both worlds:

Resistance to local minima.

slide-103
SLIDE 103

Conclusions Acknowledgments

Conclusions

Several smoothing methods were developed for IDP which greatly increase convergence speed. If used with the two-step procedure, you get the best of both worlds:

Resistance to local minima. Fast convergence near the solution.

slide-104
SLIDE 104

Conclusions Acknowledgments

Acknowledgments

  • Dr. W. Fred Ramirez
slide-105
SLIDE 105

Conclusions Acknowledgments

Acknowledgments

  • Dr. W. Fred Ramirez

National Science Foundation (funding)

slide-106
SLIDE 106

Conclusions Acknowledgments

Acknowledgments

  • Dr. W. Fred Ramirez

National Science Foundation (funding) The Free software community for the software tools I used.

slide-107
SLIDE 107

Conclusions Acknowledgments

Acknowledgments

  • Dr. W. Fred Ramirez

National Science Foundation (funding) The Free software community for the software tools I used. My wife Janet and daughter Ani for being patient with me.

slide-108
SLIDE 108

Conclusions Acknowledgments

Acknowledgments

  • Dr. W. Fred Ramirez

National Science Foundation (funding) The Free software community for the software tools I used. My wife Janet and daughter Ani for being patient with me.