11/27/2006 Massachusetts Institute of Technology Context Optimal - PDF document

11/27/2006 Massachusetts Institute of Technology Context • Optimal predictive control of dynamic systems – “What is best sequence of control inputs that takes system state from Optimal, Robust Predictive the start to the goal?” Control using Particles • Non-convex feasible region Initial state Lars Blackmore • Non-holonomic Goal region dynamics November 27, 2006 • Discrete-time LB43 • Constrained optimization approaches have been successful (Schouwenaars01,Maciejowski02,Dunbar02,Richards03) 2 Why Probabilistic Uncertainty? Optimal Controls are Not Robust to Uncertainty • Two principal ways to represent uncertainty: • Uncertainty arises due to: LB5 – Disturbances LB44 These are typically described – Uncertain state estimation 1. Set-bounded uncertainty probabilistically – Inaccurate modeling (Zhou88,Gossner97,Bemporad99,Kerrigan01) y True path ∈ x S 0 S Goal x 2. Probabilistic uncertainty p(x,y) Planned path (Bertsekas78, Ma 2005) LB18 = ˆ ( ) ( , ) p x N x P 0 0 0 y x 3 4 Why Probabilistic Uncertainty? Robust Control of Probabilistic Dynamic Systems • Probabilistic representations much richer • Given probabilistic uncertainty, we want to plan distribution of future state in optimal, robust manner – Set-bounded representation subsumed by p.d.f. • Chance-constrained formulation: Find optimal sequence of • Probabilistic representations often more realistic control inputs such that p (failure) ≤ δ – What is the absolute maximum possible wind velocity? LB42 Expected path • Probabilistic representations readily available in many cases – Disturbances e.g. Dryden turbulence model – Uncertain state estimation e.g. Particle filter, Kalman filter, SLAM – Inaccurate modeling e.g. Parameter estimation p (failure) ≤ δ L1 � We deal with probabilistic uncertainty 5 6 1

LB43 Mention: Example: UAV path planning using Mixed Integer Linear Programming Lars Blackmore, 8/18/2006 Slide 3 LB5 Relate to UAV example Lars Blackmore, 8/14/2006 LB18 stress optimal paths particularly bad Lars Blackmore, 8/15/2006 Slide 4 LB44 Spend a little more time on this Lars Blackmore, 8/18/2006 Slide 6 LB42 stress the trade of performance vs conservatism Lars Blackmore, 8/18/2006 L1 mention the 3 challenges Lars, 8/20/2006

11/27/2006 Problem Statement Chance-constrained Control: Prior Work • Prior work developed chance-constrained Model Predictive Control (Li00, VanHessem04) • Design a finite, optimal sequence of control – Restricted to case of Gaussian uncertainty inputs u 0:T-1 such that system state trajectory leaves feasible region with probability at most δ – Restricted to control within convex regions [ ] ( , ) • We extend this work to arbitrary uncertainty • Cost function: h u x ( , ) � E h u x 0 : − 1 1 : T T 0 : − 1 1 : T T distributions and non-convex regions • Feasible region: F LB37 = ν � Chance-constrained particle control ( , , ) • System dynamics: x f u x 0 : − 1 0 0 : − 1 t t t t Random variables with known p.d.f.s Random variable with p.d.f. (at least approximately) to be optimized 7 8 Particles: Probabilistic Properties Chance-constrained Particle Control: Intuition • In estimation, Kalman Filters have been very successful for • Particles can approximate arbitrary distributions: linear systems, Gaussian noise – Draw N samples x (i) from a r.v. X with distribution p(x) – State distribution given model and observations can be calculated – Distribution approximated with delta functions at samples: analytically p(x) 1 N ∑ ≈ δ ( ) ( ) p x x • More recently, Particle Filters have been successful for ( ) i x N = 1 i nonlinear systems, with non-Gaussian noise x Samples drawn from X – State distribution is approximated by a number of particles 1 N ∫ ∑ ∫ ( ∈ ) = ( ) ≈ δ ( ) = fraction of particles in P X S p x dx x dx S – Convergence of approximation to true distribution as number of ( i ) x N S S = i 1 particles tends to infinity • Convergence results: – Number of particles used determined by available resources 1 N ∑ ( ) ⎯ ⎯ almost ⎯ surely ⎯ → → ∞ ( i ) [ ( )] as LB45 f x E f X N N • Idea: Use particles for anytime robust control = 1 i ⎯ almost ⎯ ⎯ surely ⎯ → ∈ → ∞ fraction of particles in set ( ) as S p X S N – Control the distribution of particles to achieve a probabilistic goal LB20 9 10 Technical Approach Technical Approach • Question: How can particles be used to solve chance- 1. Use particles to sample random variables constrained probabilistic control problem? ν p ν ( ) ~ ( ) ( i ) ~ ( ) = Κ = Κ i 1 0 x p x i N t T 0 0 t t • Chance-constrained particle control: 1. Use particles to sample random variables (noise, initial position, Goal Region disturbances) 2. Calculate future state trajectory for each particle leaving explicit dependence on control inputs u 0:T-1 Obstacle 1 3. Express probabilistic optimization problem approximately in terms of particles 4. Solve approximate deterministic optimization problem for u 0:T-1 Initial state distribution Obstacle 2 � Approximate optimization goal tends to true goal as number of Particles approximating particles tends to infinity initial state distribution 11 12 2

LB37 "state depends on x_0 and v which are... because they are random variables, x_t is also." Lars Blackmore, 8/16/2006 Slide 9 LB45 note anytime Lars Blackmore, 8/18/2006 Slide 10 LB20 these are used in filtering --> I will use for control Lars Blackmore, 8/15/2006

11/27/2006 Technical Approach Technical Approach 2. Calculate future state trajectory for each particle leaving 2. Calculate future state trajectory for each particle leaving explicit dependence on control inputs u 0:T-1 explicit dependence on control inputs u 0:T-1 ⎡ ⎤ ⎡ ⎤ ( ) ( ) x i x i 1 1 ⎢ ⎥ ⎢ ⎥ ( ) = ( , ( ) , ν ( ) ) ( ) = ( , ( ) , ν ( ) ) i i i ( ) = Μ i i i ( ) = Μ x f u x i x f u x i x ⎢ ⎥ x ⎢ ⎥ 0 : − 1 0 0 : − 1 1 : 0 : − 1 0 0 : − 1 1 : t t t t T t t t t T ⎢ ⎥ ⎢ ⎥ ( ) ( ) i i ⎣ x ⎦ ⎣ x ⎦ T T Particles 1…N Goal Region Goal Region Particle 1 for u = u B t=4 for u = u B LB38 Obstacle 1 Obstacle 1 t=3 LB24 LB21 t=4 Particles 1…N Particle 1 for u = u A t=2 t=3 for u = u A t=2 Obstacle 2 Obstacle 2 t=1 t=1 t=0 t=0 13 14 Technical Approach Technical Approach 3. Express probabilistic optimization problem approximately 4. Solve approximate deterministic optimization in terms of particles problem for u 0:T-1 δ = 0.1 LB46 Goal Region Goal Region Fraction of particles 10% of particles fail t=4 Sample mean of cost failing approximates in optimal solution t=4 function approximates probability of failure true expectation: Obstacle 1 [ ] Obstacle 1 ( , ) E h u x t=3 − 0 : T 1 1 : T t=3 Sample mean 1 N ∑ ≈ ( ) ( , i ) h u x approximates − 0 : T 1 1 : T t=2 t=2 N state mean = 1 Obstacle 2 i Obstacle 2 t=1 t=1 t=0 t=0 15 16 Convergence Convergence As N � ∞ , approximation becomes exact As N � ∞ , approximation becomes exact – – Goal Region Goal Region 10% probability of failure Obstacle 1 Obstacle 1 Obstacle 2 Obstacle 2 17 18 3

LB21 This is easy to do because all uncertainty has been removed Lars Blackmore, 8/15/2006 Slide 14 LB24 highlight every particle has same control input Lars Blackmore, 8/15/2006 LB38 stress a particle is a _trajectory_ Lars Blackmore, 8/16/2006 Slide 15 LB46 MAX failure rate Lars Blackmore, 8/18/2006

11/27/2006 Massachusetts Institute of Technology Context Optimal - PDF document

11/27/2006 Massachusetts Institute of Technology Context Optimal predictive control of dynamic systems What is best sequence of control inputs that takes system state from Optimal, Robust Predictive the start to the goal? Control

2006 Group Business Strategy 2006 Group Business Strategy Group Business Strategy 2006 2006

Second Quarter Second Quarter Second Quarter 2006 2006 2006 Earnings Earnings Earnings

Topics 11/13/2006 Chapter 11, start Chapter 12 11/20/2006 Chapter 12 11/27/2006 Chapter 13

Results Meeting Results Meeting for for 3rd Quarter of FY 2006 3rd Quarter of FY 2006 (April

The Fiscal 2006 The Fiscal 2006 First Informational Meeting First Informational Meeting

1 2006 Bon Courage 2006 Moreson 2006 Fantail Pinotage 2005 Pulpit Rock 2005 Fleur du Cap

2006 OPERATING BUDGETS 2006 OPERATING BUDGETS MARCH 6, 2006 PRESENTATION MARCH 6, 2006

Information Meeting 2006 Information Meeting 2006 13 November, 2006 November, 2006 13 2

Topics 11/13/2006 Chapter 11, start Chapter 12 11/20/2006 Chapter 12 Inheritance Concepts

2006-2007 Event Highlights F all 2006 P artner M eeting and Poster Competition 2006 P oster C

Results for Q1, 2006 Palais Brongniart, 29 September 2006 1906 2006: 100 years as a listed

Macquarie Bank Limited 2006 Annual General Meeting 20 July 2006 Macquarie Bank Limited 2006

H1 - 2006 RESULTS Business Report 21 September 2006 Park Hyatt hotel Solutions de transactions

Earnings Release Earnings Release 3 rd Quarter 2006 3 rd Quarter 2006 Weyerhaeuser

First Quarter 2006 First Quarter 2006 Financial Results Financial Results April 26, 2006

Earnings Release Earnings Release 2 nd Quarter 2006 2 nd Quarter 2006 Weyerhaeuser

Face Recognition Intro to recognition PCA and Eigenfaces LDA and Fisherfaces

Grid adaptivity for systems of conservation laws M. Semplice 1 G. Puppo 2 1 Dipartimento di

Asian SWIFT method Efficient wavelet-based valuation of arithmetic Asian options Alvaro

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

MA162: Finite mathematics . Jack Schmidt University of Kentucky October 17, 2011 Schedule:

Linear Programming Lecturer: Shi Li Department of Computer Science and Engineering University at

Computational Optimization Constrained Optimization m R b , m n n Easiest Problem

Linear Programming DPV Chapter 7, Part 1 Jim Royer March 20, 2019 Uncredited diagrams are from