A numerical method for mean-field type problems Laurent Pfeiffer - PowerPoint PPT Presentation

Mean-field type problems Algorithm and application A numerical method for mean-field type problems Laurent Pfeiffer Institute for Mathematics and Scientific Computing, University of Graz Numerical methods for HJB equations in optimal control and related fields Ricam, Linz, November 22nd, 2016

Mean-field type problems Algorithm and application Introduction Goal: analysing and solving stochastic optimal control problems Specificity: the cost function is a function of the probability distribution of the state variable at the final time. Method: kind of gradient method. Application: risk-averse optimization.

Mean-field type problems Algorithm and application 1 Mean-field type problems Fokker-Planck equation Problem formulation Optimality conditions 2 Algorithm and numerical example Algorithm Results

Mean-field type problems Algorithm and application Fokker-Planck equation Consider the stochastic differential equation (SDE) : d X t = f ( X t ) d t + σ ( X t ) d W t , X 0 = x 0 . with f : R n → R , σ : R n → R , ( W t ) t ≥ 0 a Brownian motion, and x 0 a random variable in R n with probability distribution m 0 . Let m ( t , · ) ∈ P ( R ) be the probability distribution of X t : � � � P X t ∈ Ω = 1 d m ( t , x ) , ∀ Ω ⊂ R . Ω Under assumptions: weak solution to the Fokker-Planck equation (FP) : n n ∂ x i ( mf i ) + 1 � � ∂ t m = − ∂ x i x j ( m σ i σ j ) . 2 i =1 i , j =1

Mean-field type problems Algorithm and application Problem formulation Let U be the set of adapted control processes taking values in a given compact U . For all t ∈ [0 , T ], x ∈ R n , u ∈ U , let ( X t , x , u ) s ∈ [ t , T ] be solution to: s d X s = f ( X s , u s ) d s + σ ( X s , u s ) d W s , X t = x , where f : R n × U → R n and σ : R n × U → R n are given. Assumptions: ∃ L > 0, ∀ x , y ∈ R n , ∀ u , v ∈ U , | f ( x , u ) | + | σ ( x , u ) | ≤ L (1 + | x | + | u | ) , | f ( x , u ) − f ( y , v ) | + | σ ( x , u ) − σ ( y , v ) | ≤ L ( | y − x | + | v − u | ) . | X 0 | 2 � � Let X 0 be a random variable such that E < + ∞ .

Mean-field type problems Algorithm and application Problem formulation For all u ∈ U , we denote by m u the probability distribution of X 0 , X 0 , u , for a fixed initial state X 0 . We aim at solving: T � m u � min u ∈U χ ( P ) where the cost χ : P ( R n ) → R is given. Remark: attempt of a PDE-constrained problem formulation: u :[0 , T ] × R n → U χ ( m ( T , · )) , min subject to:  ∂ t m ( t , · ) = − � n � � m ( t , · ) f i ( · , u ( t , · ))  i =1   � n + 1 � � m ( t , · ) σ i σ j ( · , u ( t , · )) i , j =1 ∂ x i x j 2   m (0 , · ) = L ( X 0 ) .  But well-posedness of the Fokker-Planck equation is not ensured.

Mean-field type problems Algorithm and application Problem formulation Possible application: risk-averse optimization ( n = 1). Penalization of the variance: � � � � 2 � χ ( m ) = x d m ( x ) + ε x − y d m ( y ) d m ( x ) . R R R Conditional Value at Risk: 1 � CVaR β = x 1 x ≥ VaR β d m ( x ) 1 − β R � � � where: VaR β = sup z ∈ R | 1 x ≤ z d m ( x ) ≤ β . R

Mean-field type problems Algorithm and application Optimality conditions Specific case: standard problems. Assume ∃ φ : R n → R s.t.: � φ ( X 0 , x 0 , u χ ( m u ) = R n φ ( x )d m u ( x ) = E � � ) . T The corresponding problem is solved by dynamic programming. φ ( X 0 , x 0 , u � � min u ∈U E ) . ( P ( φ )) T Theorem φ ( X t , x , u � � The value function: V ( t , x ) = min u ∈U E ) is the T solution to the Hamilton-Jacobi-Bellman (HJB) equation: ∇ V ( t , x ) ⊤ f ( x , u ) + 1 ∇ 2 V ( t , x ) σσ ⊤ ( x , u ) � � �� − ∂ t V ( t , x ) =min 2 tr u ∈ U V ( T , x ) = φ ( x ) . → Provides a characterization of the optimal control.

Mean-field type problems Algorithm and application Optimality conditions General case. Theorem Assume the following: 1 χ is continuous for the Wasserstein d 1 -distance 2 χ is diff.: ∀ m 1 , m 2 ∈ P ( R n ) , ∃ D χ ( m 1 , · ) ∈ C ( R n , R ) s.t.: � � χ (1 − θ ) m 1 + θ m 2 − χ ( m 1 ) � � � − → D χ ( m 1 , x ) d m 2 ( x ) − m 1 ( x ) . θ θ → 0 We also assume: ∃ K > 0 , ∀ x ∈ R n , D χ ( m 1 , x ) ≤ K (1 + | x | 2 ) . u is a solution to P ( D χ ( m ¯ u )) . If ¯ u ∈ U is a solution to ( P ) , then ¯ Remark: The associated value function V ( t , x ) may be seen as a Lagrange multiplier for the Fokker-Planck equation.

Mean-field type problems Algorithm and application Optimality conditions Let R be the set of reachable prob. distributions: { m u | u ∈ U} . Lemma The closure of R (for the d 1 -distance), cl ( R ) is convex. m = m ¯ u . By continuity of χ , Proof of the theorem. Let ¯ χ ( ¯ m ) = m ∈ cl( R ) χ ( m ) . inf By convexity of cl( R ), for all u ∈ U , for all θ ∈ [0 , 1], 0 ≤ χ ( θ m u + (1 − θ ) ¯ m ) − χ ( ¯ m ) � − → R n D χ ( ¯ m , x )d( m ( x ) − ¯ m ( x )) . θ θ → 0 m , X 0 , X 0 , ¯ m , X 0 , X 0 , u u � � � � Thus: E D χ ( ¯ ) ≤ E D χ ( ¯ ) . T T

Mean-field type problems Algorithm and application 1 Mean-field type problems Fokker-Planck equation Problem formulation Optimality conditions 2 Algorithm and numerical example Algorithm Results

Mean-field type problems Algorithm and application Algorithm Set k = 0, choose m 0 ∈ R , fix δ > 0. While ε ( m k ) > δ , do: 1 Backward phase (HJB): solve P ( D χ ( m k )), optimal sol.: u k . 2 Forward phase (FP): compute m = m u k . 3 Solve: min θ ∈ [0 , 1] χ ( θ m k + (1 − θ ) m ), solution: θ k . Set: m k +1 = θ k m k + (1 − θ k ) m . 4 Set k = k + 1. The criterion ε ( m ) is defined by: � R n D χ ( m , x )d( m ′ ( x ) − m ( x )) ≥ 0 . ε ( m ) = − inf m ′ ∈ cl( R ) u satisfies the optimality conditions iff ε ( m u ) = 0. Note that ¯ Remark: does not provide a feedback optimal solution.

Mean-field type problems Algorithm and application Algorithm Theorem Assume that: ∃ K > 0 , ∀ m 1 , m 2 , m 3 , m 4 ∈ cl ( R ) , � � � d ( m 2 ( x ) − m 1 ( x )) ≤ Kd 1 ( m 1 , m 2 ) 2 D χ ( m 2 , x ) − D χ ( m 1 , x ) R n � � � D χ ( m 2 , x ) − D χ ( m 1 , x ) d ( m 4 ( x ) − m 3 ( x )) ≤ Kd 1 ( m 1 , m 2 ) . R n Then, the sequence ( m k ) k ∈ N generated by the method (without stopping criterion) possesses a limit point ¯ m such that ε ( ¯ m ) = 0 . Moreover, χ ( m k ) → χ ( ¯ m ) . Idea of proof. Inspired from gradient descent methods. There exist A > 0 and B > 0 such that: χ ( m k +1 ) − χ ( m k ) ≤ − min A ε ( m k ) , B ε ( m k ) 2 � � .

Mean-field type problems Algorithm and application Algorithm Given φ 1 ,..., φ N : R n → R , and Ψ : R N → R , define: � � � χ ( m u ) =Ψ R n φ 1 ( x )d m u ( x ) , ..., � R n φ N ( x )d m u ( x ) � �� φ 1 ( X 0 , X 0 , u φ N ( X 0 , X 0 , u � � � = Ψ ) , ..., E ) . E T T Lemma Assume that Ψ is differentiable with a Lipschitz-derivative, assume that for some p ≥ 2 : | φ i ( x ) | (1 + | x | ) − p − | X 0 | p � � | x |→∞ 0 , → < + ∞ . E Then, the assumptions of the previous theorem are satisfied, with: � � � D χ ( m , x ) = � N R n φ 1 ( x ) dm u ( x ) , ... i =1 ∂ y i Ψ φ i ( x ) .

Mean-field type problems Algorithm and application Algorithm Backward phase: Discretization of the SDE (Semi-Lagrangian scheme) with a controlled Markov chain Resolution of the HJB equation (discrete dynamic programming principle) Forward phase: Resolution of the FP equation (adjoint equation to the Markov chain → Chapman-Kolmogorov equation.) Remarks: Curse of dimensionality Computational effort in the backward phase.

Mean-field type problems Algorithm and application Results Example considered: SDE: d X s = u s d s + d W s , X 0 = 0, with final time 1. Controls: u s ∈ U = [ − 1 , 1] Cost: χ ( m ) = d 2 ( m , m ref ), with: m ref = 1 3 ( δ − 2 + δ 0 + δ 2 ) . Discretization: Semi-Lagrangian scheme 100 × 5000 points in [0 , 1] × [ − 5 , 5], 20 points for the control Convergence: Iterations 0 10 20 30 40 50 χ ( m k ) 0 . 874 0 . 551 0 . 536 0 . 531 0 . 528 0 . 526 ε ( m k ) 0 . 43 0 . 043 0 . 030 0 . 020 0 . 030 0 . 025

Mean-field type problems Algorithm and application Results 0.2 0.1 0 0.2 0.4 0.6 4 0.8 2 0 −2 Time 1 −4 Space Figure: Distribution along time

Mean-field type problems Algorithm and application Results 1 0 −1 0 0.2 0.4 0.6 5 0.8 0 Time 1 −5 Space Figure: Control

Mean-field type problems Algorithm and application Results 4 2 0 −2 0 0.5 4 2 0 −2 1 Time −4 Space Figure: Value function

Mean-field type problems Algorithm and application Bibliography References: A. Bensoussan, J. Frehse, and P. Yam. Mean-field games and mean-field type control theory. Springer, 2013. L. Pfeiffer. Optimality conditions for mean-field type control problems. Preprint. L. Pfeiffer. Numerical methods for mean-field type optimal control problems. Pure and Applied Functional Analysis , 1(4):629-655, 2016. Thank you for you attention.

A numerical method for mean-field type problems Laurent Pfeiffer - PowerPoint PPT Presentation

Mean-field type problems Algorithm and application A numerical method for mean-field type problems Laurent Pfeiffer Institute for Mathematics and Scientific Computing, University of Graz Numerical methods for HJB equations in optimal control

Type Checking Grammar Rule Semantic Rule var-decl id : type-exp Insert (id.name, type-exp .

Mean Field Games problems for linear control system and ergodic behavior of Mean Field Games

Overview of mean-field and beyond mean-field theoretical studies on giant resonances G. Col

Mean Field Games: Numerical Methods Y. Achdou October 24, 2011 Content A partial review on

Type Reconstruction and Polymorphism 1 Type Checking and Type Reconstruction We now come to the

JUST THE MATHS SLIDES NUMBER 17.7 NUMERICAL MATHEMATICS 7 (Numerical solution) of

JUST THE MATHS SLIDES NUMBER 17.8 NUMERICAL MATHEMATICS 8 (Numerical solution) of

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Statistics in Biology The Mean Mean ( x ) is a measure of the central tendency of a set of data

Notion of mean point in the data Why bother about mean point? Defining mean point can be

JUST THE MATHS SLIDES NUMBER 13.2 INTEGRATION APPLICATIONS 2 (Mean values) & (Root

As a prelude to the back-analysis intended for the full MAE Center report that is currently under

ACTIVE AND EPHEMERAL REGIONS IN THE SOLAR MEAN MAGNETIC FIELD EDDIE ROSS W.J. CHAPLIN, G.R.

A Tutorial on Mean Field and Refined Mean Field Approximation Nicolas Gast Inria, Grenoble,

Mean Field Equilibria of Dynamic Auctions Ramesh Johari Stanford University June 7, 2012 1 / 99

DETERMINISTIC MEAN FIELD GAMES Italo Capuzzo Dolcetta Sapienza Universit` a di Roma and GNAMPA

The Master Equation in a Bounded Domain with Neumann Conditions Michele Ricciardi Universit di

From the master equation to mean field game asymptotics Daniel Lacker Division of Applied

Variational estimates for martingale transforms Pavel Zorin-Kranich University of Bonn

Facility for Antiproton & Ion Research A World-Wide Unique Accelerator Lab Peter Senger, GSI

Advanced File Systems, Advanced File Systems, ZFS ZFS http://d3s.mff.cuni.cz/aosy

Deriving Enforcement Mechanisms from H. Janicke, et.al. Policies Motivation ITL Policy Rules

Air Quality Sensing in Denmark Sebastian Bttrich ICTP March 2017 IT

Probability, Control and Finance In honor for Ioannis Karatzas Columbia University, June 6, 2012

A numerical method for mean-field type problems Laurent Pfeiffer - PowerPoint PPT Presentation

Mean-field type problems Algorithm and application A numerical method for mean-field type problems Laurent Pfeiffer Institute for Mathematics and Scientific Computing, University of Graz Numerical methods for HJB equations in optimal control

Type Checking Grammar Rule Semantic Rule var-decl id : type-exp Insert (id.name, type-exp .

Mean Field Games problems for linear control system and ergodic behavior of Mean Field Games

Overview of mean-field and beyond mean-field theoretical studies on giant resonances G. Col

Mean Field Games: Numerical Methods Y. Achdou October 24, 2011 Content A partial review on

Type Reconstruction and Polymorphism 1 Type Checking and Type Reconstruction We now come to the

JUST THE MATHS SLIDES NUMBER 17.7 NUMERICAL MATHEMATICS 7 (Numerical solution) of

JUST THE MATHS SLIDES NUMBER 17.8 NUMERICAL MATHEMATICS 8 (Numerical solution) of

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Statistics in Biology The Mean Mean ( x ) is a measure of the central tendency of a set of data

Notion of mean point in the data Why bother about mean point? Defining mean point can be

JUST THE MATHS SLIDES NUMBER 13.2 INTEGRATION APPLICATIONS 2 (Mean values) &amp; (Root

As a prelude to the back-analysis intended for the full MAE Center report that is currently under

ACTIVE AND EPHEMERAL REGIONS IN THE SOLAR MEAN MAGNETIC FIELD EDDIE ROSS W.J. CHAPLIN, G.R.

A Tutorial on Mean Field and Refined Mean Field Approximation Nicolas Gast Inria, Grenoble,

Mean Field Equilibria of Dynamic Auctions Ramesh Johari Stanford University June 7, 2012 1 / 99

DETERMINISTIC MEAN FIELD GAMES Italo Capuzzo Dolcetta Sapienza Universit` a di Roma and GNAMPA

The Master Equation in a Bounded Domain with Neumann Conditions Michele Ricciardi Universit di

From the master equation to mean field game asymptotics Daniel Lacker Division of Applied

Variational estimates for martingale transforms Pavel Zorin-Kranich University of Bonn

Facility for Antiproton &amp; Ion Research A World-Wide Unique Accelerator Lab Peter Senger, GSI

Advanced File Systems, Advanced File Systems, ZFS ZFS http://d3s.mff.cuni.cz/aosy

Deriving Enforcement Mechanisms from H. Janicke, et.al. Policies Motivation ITL Policy Rules

Air Quality Sensing in Denmark Sebastian Bttrich ICTP March 2017 IT

Probability, Control and Finance In honor for Ioannis Karatzas Columbia University, June 6, 2012

JUST THE MATHS SLIDES NUMBER 13.2 INTEGRATION APPLICATIONS 2 (Mean values) & (Root

Facility for Antiproton & Ion Research A World-Wide Unique Accelerator Lab Peter Senger, GSI