a game in normal form consists of
play

A game in normal form consists of: Set of players = {1, , } - PowerPoint PPT Presentation

T RUTH J USTICE A LGOS Game Theory I: Basic Concepts Teachers: Ariel Procaccia (this time) and Alex Psomas NORMAL-FORM GAME A game in normal form consists of: Set of players = {1, , } Strategy set For each


  1. T RUTH J USTICE A LGOS Game Theory I: Basic Concepts Teachers: Ariel Procaccia (this time) and Alex Psomas

  2. NORMAL-FORM GAME β€’ A game in normal form consists of: β—¦ Set of players 𝑂 = {1, … , π‘œ} β—¦ Strategy set 𝑇 β—¦ For each 𝑗 ∈ 𝑂 , utility function 𝑣 𝑗 : 𝑇 π‘œ β†’ ℝ : if each j ∈ 𝑂 plays the strategy 𝑑 π‘˜ ∈ 𝑇 , the utility of player 𝑗 is 𝑣 𝑗 (𝑑 1 , … , 𝑑 π‘œ )

  3. THE PRISONER’S DILEMMA β€’ Two men are charged with a crime β€’ They are told that: β—¦ If one rats out and the other does not, the rat will be freed, other jailed for nine years β—¦ If both rat out, both will be jailed for six years β€’ They also know that if neither rats out, both will be jailed for one year

  4. THE PRISONER’S DILEMMA Cooperate Defect -1,-1 -9,0 Cooperate Defect 0,-9 -6,-6 What would you do?

  5. ON TV http://youtu.be/S0qjK3TWZE8

  6. THE PROFESSOR’S DILEMMA Class Listen Sleep 10 6 ,10 6 -10,0 Professor Make effort Slack off 0,-10 0,0 Dominant strategies?

  7. NASH EQUILIBRIUM β€’ In a Nash equilibrium, no player wants to unilaterally deviate β€’ E ach player’s strategy is a best response to strategies of others β€’ Formally, a Nash equilibrium is a vector of strategies 𝒕 = 𝑑 1 … , 𝑑 π‘œ ∈ 𝑇 π‘œ such β€² ∈ 𝑇, that for all 𝑗 ∈ 𝑂, 𝑑 𝑗 β€² , 𝑑 𝑗+1 , … , 𝑑 π‘œ ) 𝑣 𝑗 𝒕 β‰₯ 𝑣 𝑗 (𝑑 1 , … , 𝑑 π‘—βˆ’1 , 𝑑 𝑗

  8. THE PROFESSOR’S DILEMMA Class Listen Sleep 10 6 ,10 6 -10,0 Professor Make effort Slack off 0,-10 0,0 Nash equilibria?

  9. ROCK-PAPER-SCISSORS R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 Nash equilibria?

  10. MIXED STRATEGIES β€’ A mixed strategy is a probability distribution over (pure) strategies β€’ The mixed strategy of player 𝑗 ∈ 𝑂 is 𝑦 𝑗 , where 𝑦 𝑗 (𝑑 𝑗 ) = Pr[𝑗 plays 𝑑 𝑗 ] β€’ The utility of player 𝑗 ∈ 𝑂 is π‘œ 𝑣 𝑗 𝑦 1 , … , 𝑦 π‘œ = ෍ 𝑣 𝑗 𝑑 1 , … , 𝑑 π‘œ β‹… ΰ·‘ 𝑦 π‘˜ (𝑑 π‘˜ ) (𝑑 1 ,…,𝑑 π‘œ )βˆˆπ‘‡ π‘œ π‘˜=1

  11. EXERCISE: MIXED NE 1 1 β€’ Exercise: player 1 plays 2 , 2 , 0 , player 2 1 1 plays 0, 2 , 2 . What is 𝑣 1 ? 1 1 1 β€’ Exercise: Both players play 3 , 3 , 3 . What is 𝑣 1 ? R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

  12. EXERCISE: MIXED NE Poll 1 ? Which is a NE? 1 1 1 1 1 1 1 1 1 1 1. 2 , 2 , 0 , 2 , 2 , 0 3. 3 , 3 , 3 , 3 , 3 , 3 1 1 1 1 1 2 2 1 2. 2 , 2 , 0 , 2 , 0, 4. 3 , 3 , 0 , 3 , 0, 2 3 R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

  13. NASH’S THEOREM β€’ Theorem [Nash, 1950]: In any (finite) game there exists at least one (possibly mixed) Nash equilibrium β€’ What about computing a Nash equilibrium? Stay tuned…

  14. DOES NE MAKE SENSE? β€’ Two players, strategies are {2, … , 100} β€’ If both choose the same number, that is what they get β€’ If one chooses 𝑑 , the other 𝑒 , and 𝑑 < 𝑒 , the former player gets 𝑑 + 2 , and the latter gets 𝑑 βˆ’ 2 β€’ Poll 2: What would you choose? 95 96 97 98 99 100

  15. CORRELATED EQUILIBRIUM β€’ Let 𝑂 = {1,2} for simplicity β€’ A mediator chooses a pair of strategies (𝑑 1 , 𝑑 2 ) according to a distribution π‘ž over 𝑇 2 β€’ Reveals 𝑑 1 to player 1 and 𝑑 2 to player 2 β€’ When player 1 gets 𝑑 1 ∈ 𝑇 , he knows the distribution over strategies of 2 is Pr 𝑑 2 𝑑 1 = Pr 𝑑 1 ∧ 𝑑 2 = π‘ž 𝑑 1 , 𝑑 2 Pr 𝑑 1 Pr[𝑑 1 ]

  16. CORRELATED EQUILIBRIUM β€² ∈ 𝑇 β€’ Player 1 is best responding if for all 𝑑 1 β€² , 𝑑 2 ) ෍ Pr 𝑑 2 𝑑 1 𝑣 1 𝑑 1 , 𝑑 2 β‰₯ ෍ Pr 𝑑 2 𝑑 1 𝑣 1 (𝑑 1 𝑑 2 βˆˆπ‘‡ 𝑑 2 βˆˆπ‘‡ β€’ Equivalently, β€² , 𝑑 2 ) ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 𝑑 1 , 𝑑 2 β‰₯ ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 (𝑑 1 𝑑 2 βˆˆπ‘‡ 𝑑 2 βˆˆπ‘‡ β€’ π‘ž is a correlated equilibrium (CE) if both players are best responding β€’ Every Nash equilibrium is a correlated equilibrium, but not vice versa

  17. GAME OF CHICKEN http://youtu.be/u7hZ9jKrwvo

  18. GAME OF CHICKEN β€’ Social welfare is the sum of utilities Dare Chicken β€’ Pure NE: (C,D) and 0,0 4,1 Dare (D,C), social welfare = 5 β€’ Mixed NE: both 1,4 3,3 (1/2,1/2), social Chicken welfare = 4 β€’ Optimal social welfare = 6

  19. GAME OF CHICKEN β€’ Correlated equilibrium: β—¦ (D,D): 0 Dare Chicken 1 β—¦ (D,C): 0,0 4,1 Dare 3 1 β—¦ (C,D): 3 1,4 3,3 Chicken 1 β—¦ (C,C): 3 16 β€’ Social welfare of CE = 3

  20. IMPLEMENTATION OF CE β€’ Instead of a mediator, use a hat! β€’ Balls in hat are labeled with β€œchicken” or β€œdare”, each blindfolded player takes a ball Poll 3 ? Which balls implement the distribution of the previous slide? 1. 1 chicken, 1 dare 3. 2 chicken, 1 dare 2. 1 chicken, 2 dare 4. 2 chicken, 2 dare

  21. CE AS LP β€’ Can compute CE via linear programming in polynomial time! find βˆ€π‘‘ 1 , 𝑑 2 ∈ 𝑇, π‘ž 𝑑 1 , 𝑑 2 β€² ∈ 𝑇, s.t .t. βˆ€π‘‘ 1 , 𝑑 1 β€² , 𝑑 2 ) ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 𝑑 1 , 𝑑 2 β‰₯ ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 (𝑑 1 𝑑 2 βˆˆπ‘‡ 𝑑 2 βˆˆπ‘‡ β€² ∈ 𝑇, βˆ€π‘‘ 2 , 𝑑 2 β€² ) ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 2 𝑑 1 , 𝑑 2 β‰₯ ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 2 (𝑑 1 , 𝑑 2 𝑑 1 βˆˆπ‘‡ 𝑑 1 βˆˆπ‘‡ ෍ π‘ž 𝑑 1 , 𝑑 2 = 1 𝑑 1 ,𝑑 2 βˆˆπ‘‡ βˆ€π‘‘ 1 , 𝑑 2 ∈ 𝑇, π‘ž 𝑑 1 , 𝑑 2 ∈ [0,1]

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend