Network Games: On the Extension of Nash’s Theory to Networks
Lacra Pavel*
Systems Control Group University of Toronto
∗based on joint work with Dian Gadjov, Peng Yi
Farzad Salehisadaghiani NecSys 2018, Groningen, 27-28 August, 2018
1 / 56
Network Games: On the Extension of Nashs Theory to Networks Lacra - - PowerPoint PPT Presentation
Network Games: On the Extension of Nashs Theory to Networks Lacra Pavel* Systems Control Group University of Toronto based on joint work with Dian Gadjov, Peng Yi Farzad Salehisadaghiani NecSys 2018, Groningen, 27-28 August, 2018 1 /
∗based on joint work with Dian Gadjov, Peng Yi
1 / 56
2 / 56
3 / 56
4 / 56
5 / 56
i such that
i , x−i) ≤ Ji(xi, x−i),
i , x∗ −i) ≤ Ji(xi, x∗ −i),
i , x∗ −i) (action N-tuple or profile)
6 / 56
7 / 56
8 / 56
9 / 56
10 / 56
Srikant’02, Alpcan & Basar’05, Yin, Shanbhag & Mehta’11].
& Pavel’09, Menache & Ozdaglar’11].
Marden & Shamma’07].
Hu’17].
fase
11 / 56
12 / 56
3 1 N 2 3 1 2 GI GC N Interference Graph Communication Graph
13 / 56
3 1 N 2 3 1 2 GI GC N Interference Graph Communication Graph
14 / 56
3 1 N 2 3 1 2 GI GC N Interference Graph Communication Graph
Pavel’16,’18, Gadjov Pavel’18, Yi & Pavel’18] 15 / 56
3 1 4 2 J1(x1) := J1(x1, . . . , x4) 3 1 4 2 J1(x1) := J1(x1, x2) 3 1 4 2 J1(x1) := J1(x1, . . . , x4) 3 1 4 2 J1(x1) := J1(x1, x3) GI GI GI GI Complete Interference Graph Partial Interference Graph Partial Interference Digraph Complete Interference Digraph
16 / 56
17 / 56
xi
2-Player Game.
i , x∗ −i) lies at the intersection of all best-response maps,
i ∈ arg min xi∈Ωi Ji(xi, x∗ −i),
18 / 56
xi
i = PΩi(x∗ i − αi∇xiJi(x∗ i , x∗ −i)), ∀i ∈ V.
19 / 56
20 / 56
21 / 56
−i. 4 2 1 3 4 2 1 3 (x1
1(k), x1 −1(k))
4 2 1 3 4 2 1 3 (x2
2(k), x2 −2(k))
(x3
3(k), x3 −3(k))
(x4
4(k), x4 −4(k))
22 / 56
23 / 56
i, xi −i) that estimates all others’
24 / 56
1
−ik = xik
−ik +xjk −ik
2
−jk = xjk
−jk +xik −jk
2
3 1 4 2 3 1 4 2 3 1 4 2 3 1 4 2 ik jk GC : GI : 2
j |j∈NI(i)
[Salehi & Pavel, Automatica’16, Automatica’18, Lecture Notes LNICST-Springer (GameNets’17)] 25 / 56
26 / 56
1
2
3
27 / 56
i
−i
i, xi −i) − j∈Ni
i − xj i)
j∈Ni
−i − xj −i)
28 / 56
29 / 56
N = {1N ⊗ x |x ∈ Rn} and its
N)⊥.
N, x⊥ ∈ (Cn N)⊥.
N, due to F(1N ⊗ x) = F(x).
N, balance the shortage of passivity of F by Lipschitz and excess passivity
N)⊥. 30 / 56
µ + θ, any solution of (4) will converge asymptotically
[Gadjov & Pavel,TAC’18]
31 / 56
32 / 56
xi
i, xi −i) local copy (estimate) for each player i,
xi
i∈Ωi
i, xi −i)
33 / 56
i, xi −i) a local copy of x,
xi
i ∈R
i, xi −i) + IΩi(xi i)
i (min Ji) and xi −i (0 obj. fcn.):
i
i, xi −i) + IΩi(xi i)
34 / 56
i, xi −i), where xi i is his action and xi −i is the estimate of
i(k + 1) = TΩi
i(k) − αi∇iJi(xi(k))
i(k + 1) + c
i(k) − xj i (k)
−i(k + 1) =
−i(k) + cαi
−i(k)
−i(k + 1)
35 / 56
[Salehi, Shi & Pavel, IFAC’17, under review Automatica]
x∗ x Ω ΩN C O x(t)
36 / 56
37 / 56
38 / 56
50 100 150 200 250 300 50 100 150 200 250 time action
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
39 / 56
50 100 150 200 250 300 50 100 150 200 250 300 350 400 450 500 time action
40 / 56
41 / 56
r:Lj∈Rr xr ≤ Cj
R4 L15 L13 L12 L14 1 2 16 14 13 15 12 7 6 8 3 4 5 11 10 9 L16 L4 L5 L6 L2 L1 L3 L11 L10 L9 L8 L7 R3 R2 R13 R1 R11 R15 R10 R9 R8 R7 R6 R14 R5 R12
Wireless Link Player’s Path
w:Lj∈Rw xw
12 3 6 7 8 5 9 10 4 11 1 14 15 13 2 GC 43 / 56
44 / 56
45 / 56
1 3 2 4 5 1 3 2 4 5 GI GC 46 / 56
i (x) − f 2 i (x)
i (x) [Payoff from users whom i follows]: A payoff that user i obtains from
i (x) := Li
C(i)
i (x) [Attention from the network]: An incremental payoff that user i obtains
i (x) =
C (i)
C(l)
including from i
C(l)\{i}
except from i
47 / 56
1000 2000 3000 4000 5000 6000 7000 8000 −0.5 0.5 1 1.5 2 2.5 3 Iteration Unit of information that each user produces x1 x2 x3 x4 x5
1 3 2 4 5 1 3 2 4 5 GI GC
4 ≥ x∗ 3 ≥ x∗ 1,2,5.
5 ≥ x∗ 1,2. 48 / 56