A Fokker-Planck control framework for multidimensional stochastic - PowerPoint PPT Presentation

A Fokker-Planck control framework for multidimensional stochastic processes Alfio Borz` ı Institute for Mathematics, Universit¨ at W¨ urzburg, Germany Joint work with Mario Annunziato (U Salerno) Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

A multidimensional stochastic process We consider continuous-time stochastic processes described by the following multidimensional model � dX t = b ( X t , t ; u ) dt + σ ( X t , t ) dW t X t 0 = X 0 , where X t ∈ R n is the state variable and dW t ∈ R m is a multi-dimensional Wiener process, with stochastically independent components. We consider the action of a time-dependent control u ( t ) ∈ R ℓ in the drift term b ( X t , t ; u ) that allows to drive the vector random process. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The average objective Since X t is random, a deterministic objective will result into a random variable for which an averaging step is required. Therefore, the following objective is usually considered � T J ( X , u ) = E [ L ( t , X t , u ( t )) dt + Ψ[ X T ]] . 0 With this formulation it is supposed that the controller knows (all) the state of the system at each instant of time! The average E [ · ] of functionals of X t is omnipresent in almost all stochastic optimal control problems considered in the literature. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

Alternative approaches with deterministic objective The state of a stochastic process can be characterized by the shape of its statistical distribution represented by the probability density function (PDF). In some works, control schemes were proposed, where the deterministic objective depends on the PDF of the stochastic state variable and no average is needed. Examples are objectives defined by the Kullback-Leibler distance or the square distance between the state PDF and a desired one. Nevertheless, stochastic governing models are used and the state PDF is obtained by averaging or by an interpolation strategy. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The Fokker-Planck (-Kolmogorov) equation (step 1) Consider a particle at x at time t . Let π (+) ∆ x ( x ) and π ( − ) ∆ x ( x ) be the probabilities that the particle will be at x + ∆ x and x − ∆ x , at t + ∆ t . Let p ( x 0 , x ; t )∆ x be the conditional probability that the particle arrives at x at time t starting from x 0 at t = 0 following a random path. We have p ( x 0 , x − ∆ x ; t − ∆ t ) π (+) p ( x 0 , x ; t )∆ x = ∆ x ( x − ∆ x )∆ x p ( x 0 , x + ∆ x ; t − ∆ t ) π ( − ) + ∆ x ( x + ∆ x )∆ x p ( x 0 , x ; t − ∆ t )(1 − π (+) ∆ x ( x ) − π ( − ) + ∆ x ( x ))∆ x . From this discrete model of a stochastic process, we build one with infinitesimal increments for ∆ x , ∆ t → 0. For a meaningful statistical limiting process, the probabilities π (+) ∆ x and π ( − ) ∆ x must be subject to some constraints. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The Fokker-Planck (-Kolmogorov) equation (step 2) Consider the mean of change of particle position X ( t ), conditional on X ( t ) = x , E [ X ( t + ∆ t ) − X ( t ) | X ( t ) = x ] β ( x ) = lim ∆ t ∆ t → 0 and the corresponding variance is given by V [ X ( t + ∆ t ) − X ( t ) | X ( t ) = x ] α ( x ) = lim . ∆ t ∆ t → 0 On the other hand, given the particle at x at time t , then at time t + ∆ t the mean value of change of position is as follows ∆ x ( π (+) ∆ x ( x ) − π ( − ) ∆ x ( x )) and the corresponding variance is given by ∆ x 2 ( π (+) ∆ x ( x ) + π ( − ) ∆ x ( x ) − ( π (+) ∆ x ( x ) − π ( − ) ∆ x ( x )) 2 ) . Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The Fokker-Planck (-Kolmogorov) equation (step 3) For the limiting process, we require ∆ x ( x ))∆ x ∆ x , ∆ t → 0 ( π (+) ∆ x ( x ) − π ( − ) β ( x ) = lim ∆ t and ∆ x ( x )) 2 )∆ x 2 ∆ x , ∆ t → 0 ( π (+) ∆ x ( x ) − π ( − ) ∆ x ( x ) − ( π (+) ∆ x ( x ) − π ( − ) α ( x ) = lim ∆ t . These provide constraints for the form of π (+) ∆ x ( x ) and π ( − ) ∆ x ( x ). We suppose the scale law (∆ x ) 2 = A ∆ t (Wiener or Gaussian white noise). The choices ∆ x ( x ) = 1 π (+) 2 A ( α ( x ) + β ( x )∆ x ) and ∆ x ( x ) = 1 π ( − ) 2 A ( α ( x ) − β ( x )∆ x ) satisfy the above constraints. We require α ( x ) ≥ β ( x )∆ x . Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The Fokker-Planck (-Kolmogorov) equation (step 4) By expanding in Taylor series (step 1) up to second order, we obtain ′ ∆ x + 1 ′′ ∆ x 2 ) 2 p xx ∆ x 2 − p t ∆ t )( π (+) ∆ x − π (+) 2 π (+) ( p − p x ∆ x + 1 p ≃ ∆ x ∆ x ′ ∆ x + 1 ′′ ∆ x 2 ) 2 p xx ∆ x 2 − p t ∆ t )( π (+) ∆ x + π (+) 2 π (+) ( p + p x ∆ x + 1 + ∆ x ∆ x ( p − p t ∆ t )(1 − π (+) ∆ x − π ( − ) + ∆ x ) . Finally, by using the constraints for α and β , and the scale law, we obtain the Fokker-Planck equation ∂ t p ( x 0 , x ; t ) = 1 2 ∂ 2 xx ( α ( x ) p ( x 0 , x ; t )) − ∂ x ( β ( x ) p ( x 0 , x ; t )) . Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

A new approach based on the Fokker-Planck equation The evolution of the PDF given by f = f ( x , t ), x ∈ Ω ⊂ R n , associated to the stochastic process is modelled by the Fokker-Planck (FP) equation. n n ∂ t f − 1 � � ∂ 2 x i x j ( a ij f ) + ∂ x i ( b i ( u ) f ) = 0 2 i , j =1 i =1 f ( t 0 ) = ρ This is a partial differential equation of parabolic type with Cauchy data given by the initial PDF distribution. The formulation of objectives with the PDF and the Fokker-Planck equation provide a consistent framework to the optimal control of stochastic processes. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The transition density probability and the PDF Denote with f ( x , t ) the probability density to find the process at x = ( x 1 , . . . , x n ) at time t . Let ˆ f ( x , t ; y , s ) denotes the transition density probability distribution function for the stochastic process to move from y ∈ R n at time s to x ∈ R n at time t . Both f ( x , t ) and ˆ f ( x , t ; y , s ) are nonnegative functions and the following holds � ˆ ˆ f ( x , t | y , s ) ≥ 0 , f ( x , t | y , s ) dx = 1 for all t ≥ s . Ω Given an initial PDF ρ ( y , s ) at time s , we have the following � ˆ f ( x , t ) = f ( x , t | y , s ) ρ ( y , s ) dy , t > s . Ω � Also ρ should be nonnegative and Ω ρ ( y , s ) dy = 1. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

A tracking objective We consider the control problem formulated in the time window ( t k , t k +1 ) with known initial value at time t k . We formulate the problem to determine a piecewise constant control u ( t ) ∈ R ℓ such that the process evolves towards a desired target probability density f d ( x , t ) at time t = t k +1 . This objective can be formulated by the the following tracking functional J ( f , u ) := 1 L 2 (Ω) + ν 2 � f ( · , t k +1 ) − f d ( · , t k +1 ) � 2 2 | u | 2 . where | u | 2 = u 2 1 + . . . + u 2 ℓ . Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

A Fokker-Planck optimal control problem The optimal control problem to find u that minimizes the objective J subject to the constraint given by the FP equation is formulated by the following min J ( f , u ) := 1 L 2 (Ω) + ν 2 � f ( · , t k +1 ) − f d ( · , t k +1 ) � 2 2 | u | 2 n n ∂ t f − 1 � ∂ 2 � x i x j ( a ij f ) + ∂ x i ( b i ( u ) f ) = 0 2 i , j =1 i =1 f ( t k ) = ρ. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

A Fokker-Planck optimality system This first-order necessary optimality condition is characterized as the solution of the following optimality system ∂ t f − 1 � n x i x j ( a ij f ) + � n i , j =1 ∂ 2 i =1 ∂ x i ( b i ( u ) f ) = 0 in Q k 2 f ( x , t k ) = ρ ( x ) in Ω � n x i x j p − � n − ∂ t p − 1 i , j =1 a ij ∂ 2 i =1 b i ( u ) ∂ x i p = 0 in Q k 2 p ( x , t k +1 ) = f ( x , t k +1 ) − f d ( x , t k +1 ) in Ω f = 0 , p = 0 on Σ k �� n � i =1 ∂ x i ( ∂ b i ν u l + ∂ u l f ) , p = 0 in Q k l = 1 , . . . , ℓ where Q k = Ω × ( t k , t k +1 ) and Σ k = ∂ Ω × ( t k , t k +1 ). Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

The reduced gradient In the optimality equation, we have used the following inner product � t k +1 � ( φ, ψ ) = φ ( x , t ) ψ ( x , t ) dx dt . t k Ω The l th component of the reduced gradient ∇ ˆ J is given by � n � � ∂ b i � ( ∇ ˆ � J ) l = ν u l + ∂ x i f , p , l = 1 , . . . , ℓ, ∂ u l i =1 where p = p ( u ) is the solution of the adjoint equation for given f ( u ). Notice that we are discussing a nonlinear control mechanism and thus the optimization problem is nonconvex. Alfio Borz` ı A Fokker-Planck control framework for multidimensional stochastic

A Fokker-Planck control framework for multidimensional stochastic - PowerPoint PPT Presentation

A Fokker-Planck control framework for multidimensional stochastic processes Alfio Borz` Institute for Mathematics, Universit at W urzburg, Germany Joint work with Mario Annunziato (U Salerno) Alfio Borz` A Fokker-Planck control

An operator splitting method for solving a class of Fokker-Planck equations Beatrice Gaviraghi

Outline Outline Itos Equation Itos Equation Fokker Fokker- -Planck

Outline Outline Stationary Solution to Fokker Stationary Solution to Fokker- - Planck

GenFoo: a general Fokker-Planck solver with applications in fusion plasma physics L. J. Hk

Geometric Kramers-Fokker-Planck operators with boundary Francis Nier, conditions IRMAR, Univ.

Experience with Stress Workflow Automation at Fokker Aerostructures Fokker Aerostructures Ronald

EE 355 Unit 5 Multidimensional Arrays Mark Redekopp 2 MULTIDIMENSIONAL ARRAYS 3

Multidimensional SAR SAR Imaging Imaging: : Multidimensional Studies in the in the Framework

TECHNISCHE UNIVERSIT AT WIEN Entropy method for hypocoercive & non-symmetric

The Kramers- Fokker-Planck equation with a short-range potential Xue Ping WANG Universit de

Outline Outline Markov Processes Markov Processes Important Properties

A Fokker-Planck Model in for Two Interacting Populations of Neurons J.A. Carrillo 1 , S. Cordier 2

Global Hypoellipticity and Compactness of Resolvent for Fokker-Planck Operator Wei-Xi Li

Quasineutral Limits of the Vlasov- Introduction VPFP system Poisson/Maxwell-Fokker-Planck System

A Semi-Lagrangian discretization of non linear fokker Planck equations E. Carlini Universit` a

High Capability Multidimensional Data High Capability Multidimensional Data Compression on GPUs

AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs Gabriele Abbati *1 , Philippe

An Efficient Algorithm for Computing the Time Resolved Full Sky Cross Power in an Interferometer

Dessin de triangulations: algorithmes, combinatoire, et analyse Eric Fusy Projet ALGO,

Space-time models with dust and cosmological constant, that allow integrating the

Optimal sampling of multiple linear processes over a shared medium Sebin Mathew a , Karl H.

Rigidity in Markovian maximal couplings. Sayan Banerjee (Joint work with Wilfrid S. Kendall).

Stochastic areas and windings Fabrice Baudoin (Joint with J. Wang) ICTP, Trieste September 17,

Neurons (nerve cells) Faculty of Science The Morris Lecar neuron model gives rise to the

Sambuz

Useful Links

Newsletter

Mail Us

A Fokker-Planck control framework for multidimensional stochastic - PowerPoint PPT Presentation

A Fokker-Planck control framework for multidimensional stochastic processes Alfio Borz` Institute for Mathematics, Universit at W urzburg, Germany Joint work with Mario Annunziato (U Salerno) Alfio Borz` A Fokker-Planck control

An operator splitting method for solving a class of Fokker-Planck equations Beatrice Gaviraghi

Outline Outline Itos Equation Itos Equation Fokker Fokker- -Planck

Outline Outline Stationary Solution to Fokker Stationary Solution to Fokker- - Planck

GenFoo: a general Fokker-Planck solver with applications in fusion plasma physics L. J. Hk

Geometric Kramers-Fokker-Planck operators with boundary Francis Nier, conditions IRMAR, Univ.

Experience with Stress Workflow Automation at Fokker Aerostructures Fokker Aerostructures Ronald

EE 355 Unit 5 Multidimensional Arrays Mark Redekopp 2 MULTIDIMENSIONAL ARRAYS 3

Multidimensional SAR SAR Imaging Imaging: : Multidimensional Studies in the in the Framework

TECHNISCHE UNIVERSIT AT WIEN Entropy method for hypocoercive &amp; non-symmetric

The Kramers- Fokker-Planck equation with a short-range potential Xue Ping WANG Universit de

Outline Outline Markov Processes Markov Processes Important Properties

A Fokker-Planck Model in for Two Interacting Populations of Neurons J.A. Carrillo 1 , S. Cordier 2

Global Hypoellipticity and Compactness of Resolvent for Fokker-Planck Operator Wei-Xi Li

Quasineutral Limits of the Vlasov- Introduction VPFP system Poisson/Maxwell-Fokker-Planck System

A Semi-Lagrangian discretization of non linear fokker Planck equations E. Carlini Universit` a

High Capability Multidimensional Data High Capability Multidimensional Data Compression on GPUs

AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs Gabriele Abbati *1 , Philippe

An Efficient Algorithm for Computing the Time Resolved Full Sky Cross Power in an Interferometer

Dessin de triangulations: algorithmes, combinatoire, et analyse Eric Fusy Projet ALGO,

Space-time models with dust and cosmological constant, that allow integrating the

Optimal sampling of multiple linear processes over a shared medium Sebin Mathew a , Karl H.

Rigidity in Markovian maximal couplings. Sayan Banerjee (Joint work with Wilfrid S. Kendall).

Stochastic areas and windings Fabrice Baudoin (Joint with J. Wang) ICTP, Trieste September 17,

Neurons (nerve cells) Faculty of Science The Morris Lecar neuron model gives rise to the

Sambuz

Useful Links

Newsletter

Mail Us

TECHNISCHE UNIVERSIT AT WIEN Entropy method for hypocoercive & non-symmetric