Solving High Dimensional Hamilton- Jacobi-Bellman Equations Using Low Rank Tensor Decomposition
Yoke Peng Leong California Institute of Technology
Joint work with Elis Stefansson, Matanya Horowitz, Joel Burdick
Solving High Dimensional Hamilton- Jacobi-Bellman Equations Using - - PowerPoint PPT Presentation
Solving High Dimensional Hamilton- Jacobi-Bellman Equations Using Low Rank Tensor Decomposition Yoke Peng Leong California Institute of Technology Joint work with Elis Stefansson, Matanya Horowitz, Joel Burdick 1 Motivation Synthesize
Joint work with Elis Stefansson, Matanya Horowitz, Joel Burdick
1 > 40 degree of freedoms > 12 degree of freedoms
2
Brownian noise 3
PDE Nonlinear
4
Log transformation:
Condition:
Any system of the following form can satisfy the condition
Desirability function
5
Suboptimal Stabilizing Controllers for Linearly Solvable Systems, Y. P. Leong, M. B. Horowitz, J. W. Burdick, CDC 2015 Linearly Solvable Stochastic Control Lyapunov Functions, Y. P. Leong, M. B. Horowitz, J. W. Burdick, SIAM Journal on Control and Optimization, Accepted
Upper bound solution Relaxation Sum of squares program
6
Theorem: The approximate solution is a stochastic control Lyapunov function (SCLF). Corollary: The suboptimal controller is stabilizing in probability.
Proof: Relaxed HJB and satisfies the definition of SCLF.
The approximate solution gives a suboptimal stabilizing controller. Theorem: Given the controller , then
Expected cost of a system using the given controller
The approximate solution gives an upper bound to the actual cost when using the suboptimal controller.
Proof: Manipulate the relaxed HJB and the error bound of approximate value function. 7
Suboptimal Stabilizing Controllers for Linearly Solvable Systems, Y. P. Leong, M. B. Horowitz, J. W. Burdick, CDC 2015 Linearly Solvable Stochastic Control Lyapunov Functions, Y. P. Leong, M. B. Horowitz, J. W. Burdick, SIAM Journal on Control and Optimization, Accepted
Upper bound solution Relaxation Sum of squares program
8
9
10 > 40 degree of freedoms > 12 degree of freedoms
11
12
13
Tensor term Separation rank Basis function Normalization constant
14
Function Operator
Fokker-Planck equations. ACC, 2014
15
16
Rewrite Tensor form
17
18 x =
Boundary Solution Operator 1 1 1
Rewrite Tensor form Benefit: Memory and operations scale linearly with dimension
19
20
21
regularization 𝛽 ~ 𝜈 approximately rounding error
22
Normal equation
Complexity Accuracy Rounding error Number
23
24 Residual Error
Normal equation
25
26 x =
Boundary Solution Operator 1 1 1
27 Residual Error
MATLAB code is available at http://www.cds.caltech.edu/~yleong/
28 Residual Error
29
Canonical Tensor Decompositions: Application To A PDE With Random Data, SIAM J. Scientific Computing, 2016
Random matrix
30
31 Linux machine with 3 GHz i7 processor and 64 GB RAM MATLAB 2014a Solution rank, rF = 328
problems,” J. Dynamics and Differential Equations, 2006.
Accuracy
Number of grid = 201
32
33
control design for slightly non-minimum phase systems: application to V/STOL
Number of grid = 100
34 Number of grid = 100
35
mini-rotorcraft. In Quad Rotorcraft Control 2013. Springer London.
Number of grid = 100
36
37
38