Mean Field Games on Unbounded Networks and the Graphon MFG Equations Peter E. Caines McGill University Work with Shuang Gao and Minyi Huang CROWDS models and control
CIRM, Marseille, France, June, 2019
Work supported by NSERC and ARL
1 / 55
Mean Field Games on Unbounded Networks and the Graphon MFG Equations - - PowerPoint PPT Presentation
Mean Field Games on Unbounded Networks and the Graphon MFG Equations Peter E. Caines McGill University Work with Shuang Gao and Minyi Huang CROWDS models and control CIRM, Marseille, France, June, 2019 Work supported by NSERC and ARL 1 / 55
1 / 55
2 / 55
0 (t) and zN i (t).
0 (t) = 1
N
0 (t), uN 0 (t), zN j (t))dt
N
0 (t), zN j (t))dw0(t),
0 (0) = z0(0),
i (t) = 1
N
i (t), zN 0 (t), uN i (t), zN j (t))dt
N
i (t), zN j (t))dwi(t),
i (0) = zi(0),
3 / 55
0 (uN 0 ; uN −0) := E
N
0 (t), uN 0 (t), zN j (t)]
i (uN i ; uN −i) := E
N
i (t), zN 0 (t), uN i (t), zN j (t)]
t≥0, P): a complete filtered probability space
t
t
4 / 55
u∈U J(u, µ)
u∈U E
5 / 55
i ; ua i adapted to Uloc,i, 1 ≤ i ≤ N}
i , ua −i) = inf ui∈U Ji(ui, ua −i)
6 / 55
−2 −1 1 2 −2 −1 1 2 −4 −3 −2 −1 1 2 3 4 x y
7 / 55
i ; 1 ≤ i ≤ N} generates
i , u0 −i) − ε ≤ inf ui∈U Ji(ui, u0 −i) ≤ Ji(u0 i , u0 −i)
8 / 55
u∈U
9 / 55
i = ϕ(t, x|µt), 1 ≤ i ≤ N.
i (u0 i , u0 −i) − ǫ ≤ inf ui∈U JN i (ui, u0 −i) ≤ JN i (u0 i , u0 −i),
10 / 55
11 / 55
12 / 55
13 / 55
14 / 55
15 / 55
16 / 55
17 / 55
Graph Adjacency Matrix Pixel Picture − ! B B @ 1 1 1 1 1 1 1 1 1 C C A − !
Graph, Adjacency Matrix, Pixel Picture
18 / 55
Graph Sequence Converging to its Limit
0 := {W : [0, 1]2 ! [0, 1]}
1 := {W : [0, 1]2 ! [−1, 1]}
R := {W : [0, 1]2 ! R}
19 / 55
− ! B B @ 1 1 1 1 1 1 1 1 1 C C A − !
M,T⊂[0,1]
φ Wφ − V
φ Wφ − V2
20 / 55
0 , d), and hence the closed subsets of
R , d), are compact.
21 / 55
1 as an operator:
1
22 / 55
1 is the infinitesimal generator of the uniformly continuous
∞
23 / 55
24 / 55
1
= Neighborhood
Xi Xl Xk Xn Xm Xj
ail aim aij ain aik ajn alm amj alk akn
+
1
t = 1
N
t + 1
N
t
t ∈ R1: state
t ∈ R1 : control
25 / 55
1
=
1
+
1 1
=
1 1
Graphon Graphon
Vectors and Matrices functions and Step Functions functions and Graphons +
26 / 55
1 , B ∈ GAI
27 / 55
28 / 55
29 / 55
30 / 55
Finite Dim
Network System
Infinite Dim
Network System
s
s
Converge
N → ∞
Infinite Dim
Limit System
(Min-Energy and LQR) Control Law u for (A; B)
Approximate
Control Law u[N] for (A[N]
s
; B[N]
s
)
Control Law uN for (AN ; BN )
Control Design Procedure for Network Systems via Graphon Limits
31 / 55
s
s
s
s
s
s
xT(v) − xN
T(v[N])2 ≤AN ∆2B2
T eT −τ(T − τ) · vτ2dτ + BN
∆2
T e(T −τ)A[N]
s
2 · vτ2dτ,
(9)
∆ = A − A[N] s
∆ = B − B[N] s
N→∞ xT(v) − xN T(v[N])2 = 0.
32 / 55
s
s
xT(u) − xN
T(u[N])2 ≤AN ∆2
T eT −τ(T − τ)uτ2dτ + T [uτ − uN
τ ]dτ2,
(10)
∆ = A − A[N] s
u[N]
t
(α) = N
ut(β)dβ, ∀α ∈ Pi, (11)
33 / 55
2dτ =
34 / 55
τ = BT eAT (t−τ)Wt−1(xt − eAtx0),
35 / 55
N
i=1 from [0, 1]
1 1 pi pj
U(pi, pj)
36 / 55
1 2 3 4 5 6 7 8 9 10
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
Weighted Graph Generated from U, its Stepfunction and Graphon Limit
37 / 55
10 20 30 40 50 Agents 0.1 0.2 0.3 0.4 0.5 State Value Target Terminal State (50 Nodes) 10 20 30 40 50 Agents 0.1 0.2 0.3 0.4 0.5 State Value Achieved Terminal State (50 Nodes) 10 20 30 40 50 Agents
5 State Value 10-3Terminal State Error (50 Nodes)
Minimum Energy Target State Control on Network with 50 Nodes
38 / 55
39 / 55
40 / 55
State Evolution under Graphon Control Control Input of Graphon Control Network of 160 Nodes State Evolution under Optimal LQR Control Input of Optimal LQR
0.5 0.5 1 1 1
Graphon Limit
41 / 55
42 / 55
43 / 55
44 / 55
u∈U
u H(xα, u, µG; gα),
45 / 55
46 / 55
47 / 55
48 / 55
i (t) = ϕ(t, xt|µG; gα) yields an ǫ-Nash equilibrium
i (u0 i , u0 −i) − ǫ ≤ inf ui∈U JN i (ui, u0 −i) ≤ JN i (u0 i , u0 −i),
49 / 55
50 / 55
51 / 55
i Rui]dt
52 / 55
53 / 55
αRuα]dt
54 / 55
|Cβ|→∞
55 / 55