convergence of some deterministic mean field games to
play

Convergence of some deterministic Mean Field Games to aggregation - PowerPoint PPT Presentation

Convergence of some deterministic Mean Field Games to aggregation and flocking models Martino Bardi joint work with Pierre Cardaliaguet Dipartimento di Matematica "Tullio Levi-Civita" Universit di Padova "Crowds: models and


  1. Convergence of some deterministic Mean Field Games to aggregation and flocking models Martino Bardi joint work with Pierre Cardaliaguet Dipartimento di Matematica "Tullio Levi-Civita" Università di Padova "Crowds: models and control" CIRM Marseille, June 3–7, 2019 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 1 / 26

  2. Plan Connecting MFGs to kinetic models ? 1 ◮ Mean Field Games and their system of PDEs ◮ Agent-based models ◮ Large interest rate limit for stochastic MFG: Bertucci-Lasry-Lions ◮ The setting of Degond-Herty-Liu Convergence of MFGs to nonlocal continuity equations 2 ◮ a) MFG with controlled velocity → aggregation equation ◮ b) MFG with controlled acceleration → kinetic equations of flocking type ◮ Outline of the proof for controlled velocity ◮ Outline of the proof for controlled acceleration Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 2 / 26

  3. 1. Mean Field Games PDEs − ∂ u in ( 0 , T ) × R d  ∂ t − ν ∆ u + H ( ∇ u ) = F ( x , m )      ∂ m in ( 0 , T ) × R d (MFE) ∂ t − ν ∆ m − div ( m ∇ H ( ∇ u )) = 0      u ( T , x ) = g ( x ) , m ( 0 , x ) = m o ( x ) , m ( x , t ) = equilibrium distribution of the agents at time t ; u ( x , t ) = value function of the representative agent Data: ν ≥ 0 , H = L ∗ , e.g., H ( p ) = | p | 2 2 , F : R d × P 1 ( R d ) → R = running cost , g = terminal cost, � m o ≥ 0 = initial distribution of the agents, R d m o ( x ) dx = 1. 1st equation is backward H-J-B, 2nd equation is forward K-F-P eq. Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 3 / 26

  4. Control interpretation of the MFE � T u ( x , t ) = inf E [ L ( α ( s )) + F ( y ( s ) , m ( s )) ds + g ( y ( T ))] t over controls α and trajectories of √ dy ( s ) = α ( s ) ds + 2 ν dW ( s ) , y ( t ) = x √ dy ( s ) = −∇ H ( ∇ u ( y ( s ) , s )) ds + 2 ν dW ( s ) = optimal trajectory of the representative agent m ( x , t ) = distribution of particles moving along optimal trajectories In particular, for ν = 0 and H ( p ) = | p | 2 / 2 the dynamics with ˙ optimal feedback is y ( s ) = −∇ u ( y ( s ) , s ) and the KFP equation becomes ∂ m ∂ t − div ( m ∇ u ) = 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 4 / 26

  5. Control interpretation of the MFE � T u ( x , t ) = inf E [ L ( α ( s )) + F ( y ( s ) , m ( s )) ds + g ( y ( T ))] t over controls α and trajectories of √ dy ( s ) = α ( s ) ds + 2 ν dW ( s ) , y ( t ) = x √ dy ( s ) = −∇ H ( ∇ u ( y ( s ) , s )) ds + 2 ν dW ( s ) = optimal trajectory of the representative agent m ( x , t ) = distribution of particles moving along optimal trajectories In particular, for ν = 0 and H ( p ) = | p | 2 / 2 the dynamics with ˙ optimal feedback is y ( s ) = −∇ u ( y ( s ) , s ) and the KFP equation becomes ∂ m ∂ t − div ( m ∇ u ) = 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 4 / 26

  6. Control interpretation of the MFE � T u ( x , t ) = inf E [ L ( α ( s )) + F ( y ( s ) , m ( s )) ds + g ( y ( T ))] t over controls α and trajectories of √ dy ( s ) = α ( s ) ds + 2 ν dW ( s ) , y ( t ) = x √ dy ( s ) = −∇ H ( ∇ u ( y ( s ) , s )) ds + 2 ν dW ( s ) = optimal trajectory of the representative agent m ( x , t ) = distribution of particles moving along optimal trajectories In particular, for ν = 0 and H ( p ) = | p | 2 / 2 the dynamics with ˙ optimal feedback is y ( s ) = −∇ u ( y ( s ) , s ) and the KFP equation becomes ∂ m ∂ t − div ( m ∇ u ) = 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 4 / 26

  7. Agent-based models They typically are nonlocal continuity equations of the form Q : P p ( R d ) → C 1 ( R d , R d ) ∂ t m − div ( m Q [ m ]) = 0 The aggregation equation (Bertozzi, Carrillo, Laurent and many others): � Q [ m ]( x , t ) = ∇ R d k ( x − y ) dm ( y ) k ( x ) = −| x | e − a | x | , a > 0 , ◮ k ( x ) = e −| x | − Fe −| x | / L , 0 < F < 1 , L > 1 ◮ Nonlinear friction equation of granular flows (Toscani et al.): same form with k ( x ) = | x | α /α, α > 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 5 / 26

  8. Agent-based models They typically are nonlocal continuity equations of the form Q : P p ( R d ) → C 1 ( R d , R d ) ∂ t m − div ( m Q [ m ]) = 0 The aggregation equation (Bertozzi, Carrillo, Laurent and many others): � Q [ m ]( x , t ) = ∇ R d k ( x − y ) dm ( y ) k ( x ) = −| x | e − a | x | , a > 0 , ◮ k ( x ) = e −| x | − Fe −| x | / L , 0 < F < 1 , L > 1 ◮ Nonlinear friction equation of granular flows (Toscani et al.): same form with k ( x ) = | x | α /α, α > 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 5 / 26

  9. Models of crowd dynamics (Cristiani-Piccoli-Tosin) � ◮ ∂ t m − div ( m ( v + Q [ m ])) = 0 , v = v ( x ) , Q [ m ] = ∇ R d k ( x − y ) dm ( y ) k = φ ( | x | ) with compact support, φ decreasing for small | x | , then increasing ◮ models with "social forces", or mesoscopic, or kinetic: state variables: position and velocity ( x , v ) ∈ R 2 d in ( 0 , T ) × R 2 d ∂ t m + v · D x m − div v ( mQ [ m ]) = 0 � Q [ m ]( x , v ) = ∇ v R 2 d k ( x − y , v − v ∗ ) m ( y , v ∗ , t ) dydv ∗ Flocking models: as the last one with different k , e.g. | v | 2 Cucker-Smale : k ( x , v ) = ( α + | x | 2 ) β , α > 0 , β ≥ 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 6 / 26

  10. Models of crowd dynamics (Cristiani-Piccoli-Tosin) � ◮ ∂ t m − div ( m ( v + Q [ m ])) = 0 , v = v ( x ) , Q [ m ] = ∇ R d k ( x − y ) dm ( y ) k = φ ( | x | ) with compact support, φ decreasing for small | x | , then increasing ◮ models with "social forces", or mesoscopic, or kinetic: state variables: position and velocity ( x , v ) ∈ R 2 d in ( 0 , T ) × R 2 d ∂ t m + v · D x m − div v ( mQ [ m ]) = 0 � Q [ m ]( x , v ) = ∇ v R 2 d k ( x − y , v − v ∗ ) m ( y , v ∗ , t ) dydv ∗ Flocking models: as the last one with different k , e.g. | v | 2 Cucker-Smale : k ( x , v ) = ( α + | x | 2 ) β , α > 0 , β ≥ 0 Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 6 / 26

  11. Question: connection among MFGs and ABMs? For MFG with dynamics ˙ y = α the equation for the density m is ∂ m ∂ t − div ( m ∇ H ( ∇ u )) = 0 which is a continuity equation with Q [ m ] = ∇ H ( ∇ u ) and u depends on m in a non-local way via the HJB equation, so the dependence is not explicit. For MFG with dynamics ÿ = α the density m solves ∂ t m + v · D x m − div v ( m ∇ v H ( ∇ u )) = 0 which is a kinetic equation with Q [ m ] = ∇ v H ( ∇ u ) and u depends on m via the HJB equation. Q.: can one connect in a rigorous way the classical ABMs to some MFGs ? Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 7 / 26

  12. Stochastic case: Bertucci-Lasry-Lions 2018 − ∂ t u λ + λ u λ − ∆ u λ + Q [ m λ ] · Du λ + | Du λ | 2  = F ( x ) ,    2     (MF0) in R d × R + ∂ t m λ − ∆ m λ − div ( m λ ( Du λ + Q [ m λ ])) = 0       in R d m λ ( 0 ) = m 0 ,  λ = the discount factor in the cost functional associated to the HJB equation = "inter temporal preference parameter that measures the weight of anticipation for a given agent", the dynamics of an agent in the MFG is √ dy ( s ) = ( α ( s ) − Q [ m λ ]) ds + 2 ν dW ( s ) , y ( t ) = x Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 8 / 26

  13. The limit λ → ∞ Theorem (Bertucci-Lasry-Lions) Q : P 1 ( R d ) → Lip ( R d ) , � Q [ m ] � ∞ ≤ C , ∀ m = ⇒ any solution ( u λ , m λ ) of (MF0) is bounded uniformly in λ and for any λ n → ∞ such that m λ n → m the limit m is a solution of the continuity equation ∂ t m − ∆ m − div ( m Q [ m ]) = 0 . So "any" ABM model (with diffusion), defined by Q , has at least one solution that is the limit of the solution of a MFG. Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 9 / 26

  14. The setting of Degond-Herty-Liu 2017 Here MPC = Model Predictive Control We address the horizontal ? = ⇒ ? with a different approach. Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 10 / 26

  15. The control problem for a single agent is � T [ | α | 2 ˙ y = v ( y ) + α, y ( t ) = x , inf + F ( y ( s ) , m ( s ))] ds 2 α ( · ) t MPC approximation: ∆ t | α | 2 � � y ( t + ∆ t ) = x + ∆ t ( v ( x ) + α ) , min + F ( y ( t + ∆ t ) , m ( t )) 2 α Note that the scaling with ∆ t means that the control is cheap. Taking the derivative w.r.t. α we get the optimal control ¯ α if ∆ t [¯ α + DF ( x , m ( t ))] = 0 . This suggests that, for short horizon T and cheap control, the optimal feedback should be approximated by the α ≈ − DF ( x , m ( t )) . ¯ steepest decent of the running cost Martino Bardi (Università di Padova) MFGs and aggregation models Marseille, June 4, 2019 11 / 26

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend