CSC2556 Lecture 9
Noncooperative Games 1: Nash Equilibria, Price of Anarchy, Cost-Sharing Games
CSC2556 - Nisarg Shah 1
CSC2556 Lecture 9 Noncooperative Games 1: Nash Equilibria, Price - - PowerPoint PPT Presentation
CSC2556 Lecture 9 Noncooperative Games 1: Nash Equilibria, Price of Anarchy, Cost-Sharing Games CSC2556 - Nisarg Shah 1 Game Theory How do rational, self-interested agents act in a given environment? Each agent has a set of possible
CSC2556 - Nisarg Shah 1
CSC2556 - Nisarg Shah 2
➢ Rewards for the agents as a function of the actions taken
➢ No external trusted agency, no legal agreements
CSC2556 - Nisarg Shah 3
➢ Given the action profile Ԧ
CSC2556 - Nisarg Shah 4
Sam’s Actions John’s Actions Stay Silent Betray Stay Silent (-1 , -1) (-3 , 0) Betray (0 , -3) (-2 , -2)
CSC2556 - Nisarg Shah 5
➢ Deterministic choice of an action, e.g., “Betray”
➢ Randomized choice of an action, e.g., “Betray with
CSC2556 - Nisarg Shah 6
′ if 𝑡𝑗 is “better than”
′, irrespective of other players’ strategies.
➢ 𝑣𝑗 𝑡𝑗, Ԧ
′, Ԧ
➢ Strict inequality for some Ԧ
➢ Strict inequality for all Ԧ
CSC2556 - Nisarg Shah 7
CSC2556 - Nisarg Shah 8
Sam’s Actions John’s Actions Stay Silent Betray Stay Silent (-1 , -1) (-3 , 0) Betray (0 , -3) (-2 , -2)
➢ Betray if the other player will stay silent ➢ Betray if the other player will betray
CSC2556 - Nisarg Shah 9
➢ No single strategy dominates every other strategy ➢ But some strategies might still be dominated
➢ Can remove their dominated strategies ➢ Might reveal a newly dominant strategy
CSC2556 - Nisarg Shah 10
➢ Microsoft vs Startup ➢ Enter the market or stay out?
Microsoft Startup Enter Stay Out Enter (2 , -2) (4 , 0) Stay Out (0 , 4) (0 , 0)
CSC2556 - Nisarg Shah 11
➢ Each student guesses a real number between 0 and 100
➢ The student whose number is the closest to 2/3 of the
CSC2556 - Nisarg Shah 12
Students Professor Attend Be Absent Attend (3 , 1) (-1 , -3) Be Absent (-1 , -1) (0 , 0)
CSC2556 - Nisarg Shah 13
➢ A strategy profile Ԧ
′, Ԧ
′
CSC2556 - Nisarg Shah 14
Sam’s Actions John’s Actions Stay Silent Betray Stay Silent (-1 , -1) (-3 , 0) Betray (0 , -3) (-2 , -2)
CSC2556 - Nisarg Shah 15
Microsoft Startup Enter Stay Out Enter (2 , -2) (4 , 0) Stay Out (0 , 4) (0 , 0)
CSC2556 - Nisarg Shah 16
Students Professor Attend Be Absent Attend (3 , 1) (-1 , -3) Be Absent (-1 , -1) (0 , 0)
CSC2556 - Nisarg Shah 17
P2 P1 Rock Paper Scissor Rock (0 , 0) (-1 , 1) (1 , -1) Paper (1 , -1) (0 , 0) (-1 , 1) Scissor (-1 , 1) (1 , -1) (0 , 0)
CSC2556 - Nisarg Shah 18
P2 P1 Rock Paper Scissor Rock (0 , 0) (-1 , 1) (1 , -1) Paper (1 , -1) (0 , 0) (-1 , 1) Scissor (-1 , 1) (1 , -1) (0 , 0)
CSC2556 - Nisarg Shah 19
CSC2556 - Nisarg Shah 20
➢ Stag requires both hunters, food is good for 4 days for
➢ Hare requires a single hunter, food is good for 2 days ➢ If they both catch the same hare, they share.
Hunter 2 Hunter 1 Stag Hare Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1)
CSC2556 - Nisarg Shah 21
➢ Other hunter plays “Stag” → “Stag” is best response ➢ Other hunter plays “Hare” → “Hare” is best reponse
Hunter 2 Hunter 1 Stag Hare Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1)
CSC2556 - Nisarg Shah 22
➢ Given the other hunter plays 𝑡, equal 𝔽[reward] for Stag
➢ 𝔽 Stag = 𝑞 ∗ 4 + 1 − 𝑞 ∗ 0 ➢ 𝔽 Hare = 𝑞 ∗ 2 + 1 − 𝑞 ∗1 ➢ Equate the two ⇒ 𝑞 = 1/3
Hunter 2 Hunter 1 Stag Hare Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1)
CSC2556 - Nisarg Shah 23
➢ If both report the same number, each gets this value. ➢ If one reports a lower number (𝑡) than the other (𝑢), the
100 99 98 97 96
. . . . . . . . . . . 95
CSC2556 - Nisarg Shah 24
➢ The brother at 𝑡 gets 0,
𝑡+𝑢 2 , the other gets 𝑡+𝑢 2 , 1 1 s t
CSC2556 - Nisarg Shah 25
CSC2556 - Nisarg Shah 26
➢ Rationality is common knowledge.
➢ Rationality is perfect = “infinite wisdom”
➢ Full information about what other players are doing.
CSC2556 - Nisarg Shah 27
➢ No binding contracts.
➢ No player can commit first.
➢ No external help.
➢ Humans reason about randomization using expectations.
CSC2556 - Nisarg Shah 28
➢ Cannot expect humans to find it if your computer cannot.
CSC2556 - Nisarg Shah 29
➢ For human agents, take it with a grain of salt. ➢ For AI agents playing against AI agents, perfect!
CSC2556 - Nisarg Shah 30
CSC2556 - Nisarg Shah 31
Costs → flip: Nash equilibrium divided by optimum
CSC2556 - Nisarg Shah 32
➢ (Stag, Stag) : Social utility = 8 ➢ (Hare, Hare) : Social utility = 2 ➢ (Stag:1/3 - Hare:2/3, Stag:1/3 - Hare:2/3)
Hunter 2 Hunter 1 Stag Hare Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1)
CSC2556 - Nisarg Shah 33
➢ Wants to go from 𝑡𝑗 to 𝑢𝑗 ➢ Strategy set 𝑇𝑗 = {directed 𝑡𝑗 → 𝑢𝑗 paths} ➢ Denote his chosen path by 𝑄𝑗 ∈ 𝑇𝑗
➢ Cost is split among all players taking edge 𝑓 ➢ That is, among all players 𝑗 with 𝑓 ∈ 𝑄𝑗
CSC2556 - Nisarg Shah 34
➢ Note that 𝐷 𝑄 = σ𝑓∈𝐹 𝑄 𝑑𝑓, where
𝐹(𝑄)={edges taken in 𝑄 by at least one player}
➢ What if both players take the direct paths? ➢ What if both take the middle paths? ➢ What if only one player takes the middle path while
the other takes the direct path?
CSC2556 - Nisarg Shah 35
➢ All taking the n-edge: social cost = 𝑜 ➢ All taking the 1-edge: social cost = 1
CSC2556 - Nisarg Shah 36
➢ Suppose the social optimum is (𝑄
1 ∗, 𝑄2 ∗, … , 𝑄 𝑜 ∗), in which
∗.
➢ Take any NE with cost 𝑑𝑗 to player 𝑗. ➢ Let 𝑑𝑗
′ be his cost if he switches to 𝑄𝑗 ∗.
➢ NE ⇒ 𝑑𝑗
′ ≥ 𝑑𝑗
➢ But : 𝑑𝑗
′ ≤ 𝑜 ⋅ 𝑑𝑗 ∗ (Why?)
➢ 𝑑𝑗 ≤ 𝑜 ⋅ 𝑑𝑗
∗ for each 𝑗 ⇒ no worse than 𝑜 × optimum
CSC2556 - Nisarg Shah 37
➢ All cost-sharing games: PoA ≤ 𝑜 ➢ ∃ example where PoA = 𝑜
➢ What about more complex
7
60 12 32 10 20
CSC2556 - Nisarg Shah 38
➢ Via a “potential function” argument.
CSC2556 - Nisarg Shah 39
➢ For all pure strategy profiles 𝑄 = 𝑄
1, … , 𝑄 𝑜 ∈ ς𝑗 𝑇𝑗, …
➢ all players 𝑗, and … ➢ all alternative strategies 𝑄𝑗
′ ∈ 𝑇𝑗 for player 𝑗…
′, 𝑄−𝑗 − 𝑑𝑗 𝑄 = Φ 𝑄𝑗 ′, 𝑄−𝑗 − Φ 𝑄
➢ Do not care about the changes in the costs to others
CSC2556 - Nisarg Shah 40
➢ Think about 𝑄 that minimizes the potential function. ➢ What happens when a player deviates?
decrease.
CSC2556 - Nisarg Shah 41
𝑓∈𝐹(𝑄)
𝑙=1 𝑜𝑓(𝑄) 𝑑𝑓
CSC2556 - Nisarg Shah 42
𝑓∈𝐹(𝑄)
𝑙=1 𝑜𝑓(𝑄) 𝑑𝑓
➢ If a player changes path, he pays
𝑑𝑓 𝑜𝑓 𝑄 +1 for each new
𝑑𝑔 𝑜𝑔 𝑄 for each old edge 𝑔.
➢ This is precisely the change in the potential function too. ➢ So Δ𝑑𝑗 = ΔΦ.
CSC2556 - Nisarg Shah 43
➢ Pure Nash equilibria are “local minima” of the potential
➢ A single player deviating should not decrease the
CSC2556 - Nisarg Shah 44
𝑓∈𝐹(𝑄)
𝑑𝑓 ≤ Φ 𝑄 =
𝑓∈𝐹(𝑄)
𝑙=1 𝑜𝑓(𝑄) 𝑑𝑓
𝑙 ≤
𝑓∈𝐹(𝑄)
𝑑𝑓 ∗
𝑙=1 𝑜 1
𝑙
Social cost
∀𝑄, 𝐷 𝑄 ≤ Φ 𝑄 ≤ 𝐷 𝑄 ∗ 𝐼 𝑜 𝐷 𝑄∗ ≤ Φ 𝑄∗ ≤ Φ 𝑃𝑄𝑈 ≤ 𝐷 𝑃𝑄𝑈 ∗ 𝐼(𝑜)
Harmonic function 𝐼(𝑜) = σ𝑙=1
𝑜
1/𝑙 = 𝑃(log 𝑜) Potential minimizing eq. Social optimum
CSC2556 - Nisarg Shah 45
➢ Price of stability is 𝑃(log 𝑜)
➢ Compare to the price of anarchy, which can be 𝑜
CSC2556 - Nisarg Shah 46
𝑘(𝑜𝑘)
CSC2556 - Nisarg Shah 47
𝑘∈𝐹(𝑄)
𝑙=1 𝑜𝑘 𝑄
𝑘 𝑙
CSC2556 - Nisarg Shah 48
➢ E.g., used for analyzing amortized complexity of
CSC2556 - Nisarg Shah 49
𝑘 is decreasing
➢ The more people use a resource, the less the cost to each.
𝑘 can also be increasing
➢ Road network, each player going from home to work ➢ Uses a sequence of roads ➢ The more people on a road, the greater the congestion,
CSC2556 - Nisarg Shah 50
➢ 2000 players want to go from 1 to 4 ➢ 1 → 2 and 3 → 4 are “congestible” roads ➢ 1 → 3 and 2 → 4 are “constant delay” roads
CSC2556 - Nisarg Shah 51
➢ 1000 take 1 → 2 → 4, 1000 take 1 → 3 → 4 ➢ Each player has cost 10 + 25 = 35 ➢ Anyone switching to the other creates a greater
CSC2556 - Nisarg Shah 52
➢ Intuitively, adding more roads should only be helpful ➢ In reality, it leads to a greater delay for everyone in the
𝑑23 𝑜23 = 0
CSC2556 - Nisarg Shah 53
𝑑23 𝑜23 = 0
CSC2556 - Nisarg Shah 54
➢ In the new game, 1 → 2 → 3 → 4 is a strictly dominant
𝑑23 𝑜23 = 0