A discrete time DP approach on a tree structure for finite horizon - PowerPoint PPT Presentation

A discrete time DP approach on a tree structure for finite horizon optimal control problems Maurizio Falcone joint works with A. Alla (PUC, Rio) and L. Saluzzi (GSSI, L ’Aquila) ICODE Workshop "Numerical Solution of HJB Equations" Paris VII, January 9, 2020

Outline 2 / 1

Outline 3 / 1

HJB equation for the finite horizon problem Controlled Dynamics and Cost Functional � ˙ y ( s , u ) = f ( y ( s ) , u ( s ) , s ) s ∈ ( t , T ] y ( t ) = x u ( t ) ∈ U = { u : [ t , T ] → U ⊂ R m compact , measurable } , � T L ( y ( s , u ) , u ( s ) , s ) e − λ ( s − t ) ds + g ( y ( T )) e − λ ( T − t ) J x , t ( u ) = t 4 / 1

HJB equation for the finite horizon problem Controlled Dynamics and Cost Functional � ˙ y ( s , u ) = f ( y ( s ) , u ( s ) , s ) s ∈ ( t , T ] y ( t ) = x u ( t ) ∈ U = { u : [ t , T ] → U ⊂ R m compact , measurable } , � T L ( y ( s , u ) , u ( s ) , s ) e − λ ( s − t ) ds + g ( y ( T )) e − λ ( T − t ) J x , t ( u ) = t Value Function v ( x , t ) := u ( · ) ∈U J x , t ( u ) inf 4 / 1

HJB equation for the finite horizon problem Dynamic Programming Principle �� τ � e − λ ( s − t ) L ( y ( s ) , u ( s ) , s ) ds + v ( y ( τ ) , τ ) e − λ ( τ − t ) v ( x , t ) = min u ∈U t 5 / 1

HJB equation for the finite horizon problem Dynamic Programming Principle �� τ � e − λ ( s − t ) L ( y ( s ) , u ( s ) , s ) ds + v ( y ( τ ) , τ ) e − λ ( τ − t ) v ( x , t ) = min u ∈U t HJB equation  − ∂ v ∂ t ( x , t ) + λ v ( x , t ) = min u ∈ U { L ( x , u , t ) + ∇ v ( x , t ) · f ( x , u , t ) }  v ( x , T ) = g ( x ) , x ∈ R d  5 / 1

HJB equation for the finite horizon problem Dynamic Programming Principle �� τ � e − λ ( s − t ) L ( y ( s ) , u ( s ) , s ) ds + v ( y ( τ ) , τ ) e − λ ( τ − t ) v ( x , t ) = min u ∈U t HJB equation  − ∂ v ∂ t ( x , t ) + λ v ( x , t ) = min u ∈ U { L ( x , u , t ) + ∇ v ( x , t ) · f ( x , u , t ) }  v ( x , T ) = g ( x ) , x ∈ R d  Optimal Feedback Map u ∗ ( x , t ) = arg min u ∈ U { L ( x , u , t ) + ∇ v ( x , t ) · f ( x , u , t ) } 5 / 1

Classical approach Semi-Lagrangian scheme ( λ = 0)  V n − 1 u ∈ U [∆ t L ( x i , u , t n ) + V n ( x i + ∆ t f ( x i , u , t n ))] , n = N , . . . , 1 = min i   x i ∈ Ω ∆ x .  V N = g ( x i ) ,  i 6 / 1

Classical approach Semi-Lagrangian scheme ( λ = 0)  V n − 1 u ∈ U [∆ t L ( x i , u , t n ) + V n ( x i + ∆ t f ( x i , u , t n ))] , n = N , . . . , 1 = min i   x i ∈ Ω ∆ x .  V N = g ( x i ) ,  i Cons of the approach V n ( x i + ∆ t f ( x i , u , t n )) is computed by interpolation operator. We need a numerical domain (not always given in the problem) Selection of boundary conditions (not always given in the problem) The curse of dimensionality makes the problem difficult to solve in high dimension (need e.g. model order reduction). 6 / 1

Other approaches and acceleration techniques Several methods have been developed to accelerate the computation and/or mitigate the curse of dimensionality Domain decomposition (static or dynamic): F .-Lanucara-Seghini (1994-...), Krener-Navasca (2007-...), Cacace-Cristiani-F .-Picarelli (2012) Iteration in policy space: Bellman (1957), Howard (1960), Bokanowski- Maroso-Zidani (2009), Alla-F .-Kalise (2015), Bokanowki–Desilles-Zidani (2018) Max-plus algebra and Galerkin approximation: Akian- Gaubert-Lakhoua (2008), McEneaney (2009-...), Dower (2017) 7 / 1

Other approaches and acceleration techniques Model Order Reduction: Kunisch-Volkwein-Xie (2004), Alla-F-Volkwein (2017) Sparse grids: Bokanowski-Garke-Griebel-Klompmaker (2013), Garke-Kroner (2016) Spectral Methods and Tensor Calculus: Kalise-Kundu-Kunisch (2019), Dolgov-Kalise-Kunisch (2019) Hopf formulas: Osher-Darbon (2016- ...), Yegorov-Dower-Grüne (2018) DNN/DGM: Pham-Warin (2019) 8 / 1

Outline 9 / 1

Tree Structure Algorithm (Alla, F. , Saluzzi ’18) We start with an initial condition x ∈ R d forming the first level T 0 . x 10 / 1

Tree Structure Algorithm (Alla, F. , Saluzzi ’18) We start with an initial condition x ∈ R d forming the first level T 0 . x Discretization : constant ∆ t for time and N u discrete controls. 10 / 1

Tree Structure Algorithm (Alla, F. , Saluzzi ’18) We start with an initial condition x ∈ R d forming the first level T 0 . x Discretization : constant ∆ t for time and N u discrete controls. Starting with x, we follow the dynamics given by the discrete controls T 1 = { ζ 1 i } i = { x + ∆ t f ( x , u i , t 0 ) } i , i = 1 , ..., N u ζ 1 1 x ζ 1 N u 10 / 1

Tree Structure Algorithm Given the nodes in the previous level, we construct the following one T n = { ζ n − 1 , u j , t n − 1 ) } N u + ∆ t f ( ζ n − 1 i = 1 , . . . , N n u . i i j = 1 ζ N 1 ... ζ 1 1 x ζ 1 N u ... ζ N N uN 11 / 1

Approximation of the value function Computation of the value function on the tree The tree structure defines T = {T r } N r = 0 , where we can compute the numerical value function:  V n ( ζ n u ∈ U ∆ u { V n + 1 ( ζ n i + ∆ t f ( ζ n i , u , t n )) + ∆ t L ( ζ n ζ n i ∈ T n i ) = min i , u , t n ) }  V N ( ζ N i ) = g ( ζ N ζ N ∈ T N i )  i 12 / 1

Approximation of the value function Computation of the value function on the tree The tree structure defines T = {T r } N r = 0 , where we can compute the numerical value function:  V n ( ζ n u ∈ U ∆ u { V n + 1 ( ζ n i + ∆ t f ( ζ n i , u , t n )) + ∆ t L ( ζ n ζ n i ∈ T n i ) = min i , u , t n ) }  V N ( ζ N i ) = g ( ζ N ζ N ∈ T N i )  i Pros No need for interpolation since the nodes x i + ∆ t f ( x i , u , t n ) belong to the tree by construction. Mitigation of the curse of dimensionality (e.g. , d ≫ 10). 12 / 1

Approximation of the value function Computation of the value function on the tree The tree structure defines T = {T r } N r = 0 , where we can compute the numerical value function:  V n ( ζ n u ∈ U ∆ u { V n + 1 ( ζ n i + ∆ t f ( ζ n i , u , t n )) + ∆ t L ( ζ n ζ n i ∈ T n i ) = min i , u , t n ) }  V N ( ζ N i ) = g ( ζ N ζ N ∈ T N i )  i Pros No need for interpolation since the nodes x i + ∆ t f ( x i , u , t n ) belong to the tree by construction. Mitigation of the curse of dimensionality (e.g. , d ≫ 10). Cons Dimensionality problem. In fact, given N u controls and N time steps, the cardinality of the tree is O ( N N + 1 ) . u 12 / 1

Solution : Pruning the tree 13 / 1

Solution : Pruning the tree ζ m-1 ζ m-1 ζ jm ζ jm T ε ζ in ζ in Pruning rule Given a threshold ε T , two nodes ζ n i and ζ n j will be merged if � ζ n i − ζ n j � ≤ ε T 14 / 1

The case of an autonomous dynamics The pruning rule and the computation of value function can be simplified, since we can extend the computation to the all previous tree levels 15 / 1

The case of an autonomous dynamics The pruning rule and the computation of value function can be simplified, since we can extend the computation to the all previous tree levels Pruning rule Given a threshold ε T , two nodes ζ n i and ζ m will be merged if j � ζ n i − ζ m j � ≤ ε T 15 / 1

The case of an autonomous dynamics The pruning rule and the computation of value function can be simplified, since we can extend the computation to the all previous tree levels Pruning rule Given a threshold ε T , two nodes ζ n i and ζ m will be merged if j � ζ n i − ζ m j � ≤ ε T Computation of the value function on the tree  u ∈ U ∆ u { V n + 1 ( ζ + ∆ t f ( ζ, u )) + ∆ t L ( ζ, u , t n ) } V n ( ζ ) = min ζ ∈ ∪ n k = 0 T k  V N ( ζ ) = g ( ζ ) ζ ∈ T  15 / 1

The case of an autonomous dynamics The pruning rule and the computation of value function can be simplified, since we can extend the computation to the all previous tree levels Pruning rule Given a threshold ε T , two nodes ζ n i and ζ m will be merged if j � ζ n i − ζ m j � ≤ ε T Computation of the value function on the tree  u ∈ U ∆ u { V n + 1 ( ζ + ∆ t f ( ζ, u )) + ∆ t L ( ζ, u , t n ) } V n ( ζ ) = min ζ ∈ ∪ n k = 0 T k  V N ( ζ ) = g ( ζ ) ζ ∈ T  Important reduction of the cardinality, we can get more information on V and this can be useful for the feedback reconstruction. 15 / 1

Efficient pruning Problem The computation of the distances among all the nodes would be very expensive, especially for high dimensional problems. 16 / 1

Efficient pruning Problem The computation of the distances among all the nodes would be very expensive, especially for high dimensional problems. One possible solution We project the data onto a lower dimensional linear space such that the variance of the projected data is maximized. This can be done e.g. computing the Singular Value Decomposition of the data matrix and taking the first basis. 16 / 1

A discrete time DP approach on a tree structure for finite horizon - PowerPoint PPT Presentation

A discrete time DP approach on a tree structure for finite horizon optimal control problems Maurizio Falcone joint works with A. Alla (PUC, Rio) and L. Saluzzi (GSSI, L Aquila) ICODE Workshop "Numerical Solution of HJB Equations"

Discrete-time Systems in the Time Domain Chaiwoot Boonyasiriwat August 21, 2020 Discrete-time

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Discrete Buffer and Wire Sizing for Discrete Buffer and Wire Sizing for Link-Based Non-Tree Clock

Discrete time Markov chains Today: Discrete Time Markov Chains, Limiting Discrete time Markov

Tree-sitter @maxbrunsfeld What is Tree-sitter? Why I wrote Tree-sitter What were

Final Examples Announcements Trees Tree-Structured Data def tree(label, branches=[]): A tree

Discrete Mathematics Jeremy Siek Spring 2010 Jeremy Siek Discrete Mathematics 1 / 118 Jeremy

Cyber-Physical Systems Discrete Dynamics IECE 553/453 Fall 2019 Prof. Dola Saha 1 Discrete

CMSC 222: Discrete Mathematics Prof S Fall 2018 What is Discrete Mathematics? Discrete

Cyber-Physical Systems Discrete Dynamics ICEN 553/453 Fall 2018 Prof. Dola Saha 1 Discrete

Plan Discrete paths as Heyting algebras Discrete paths as categories Discrete paths as quantales

Evidence evaluation for discrete data Evidence evaluation for discrete data Evidence evaluation

The R-Tree Yufei Tao ITEE University of Queensland INFS4205/7205, Uni of Queensland The R-Tree

Simulation of Discrete-Time Markov Chains Discrete-Time Markov Chains (DTMCs) Numerical Solution

Discrete-Time Processing Overview Introduction Review of key discrete-time concepts

Circuits (Eulerian and Hamiltonian) Ioan Despi despi@turing.une.edu.au University of New

Discrete analytic functions. Integrable structure Alexander Bobenko Technical University Berlin

Final Exam Review CMPS/MATH 2170: Discrete Mathematics Overview Final Exam Format:

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Classification Relative to Hierarchical Order and Extension Property Luciano Vianna F elix

Fuzzy geometry via noncommutative frames: fuzzy de Sitter space Maja Buri c University of

MAT2345 Discrete Math The Course Propositional Logic Dr. Van Cleave Propositional

Semidefinite Approximations of Reachable Sets for Discrete-time Polynomial Systems Victor Magron ,