CS 440/ECE448 Lecture 9: Game Theory
Slides by Svetlana Lazebnik, 9/2016 Modified by Mark Hasegawa-Johnson, 2/2019 https://en.wikipedia.org/wiki/Prisoner’s_dilemma
CS 440/ECE448 Lecture 9: Game Theory Slides by Svetlana Lazebnik, - - PowerPoint PPT Presentation
CS 440/ECE448 Lecture 9: Game Theory Slides by Svetlana Lazebnik, 9/2016 Modified by Mark Hasegawa-Johnson, 2/2019 https://en.wikipedia.org/wiki/Prisoners_dilemma Game theory Game theory deals with systems of interacting agents where the
Slides by Svetlana Lazebnik, 9/2016 Modified by Mark Hasegawa-Johnson, 2/2019 https://en.wikipedia.org/wiki/Prisoner’s_dilemma
given game
desirable outcome
http://www.economist.com/node/21527025
http://www.spliddit.org
http://www.wired.com/2015/09/facebook-doesnt-make-much-money-couldon-purpose/
4,3,2 7,4,1 4,3,2 1,5,2 7,7,1 1,5,2 4,3,2
sequence.
makes the move that is best for them, when it’s their turn to move.
knowing what the others will do
0,0 1,-1
0,0 1,-1 1,-1
0,0
Player 2 Player 1
Payoff matrix (Player 1’s utility is listed first) Is this a zero-sum game? Normal form representation:
and the police visit them separately
sentence
5-year sentence
get a 1-year sentence
Alice: Testify Alice: Refuse Bob: Testify
Bob: Refuse 0,-10
5 years if I testify and 10 years if I refuse. So I should testify.
testify, and get 1 year if I refuse. So I should testify.
strategies such that no player can get a bigger payoff by switching strategies, provided the other player sticks with the same strategy
Alice: Testify Alice: Refuse Bob: Testify
Bob: Refuse 0,-10
regardless of the strategy chosen by the other player.
impossible to make one of the players better off without making another one worse off.
which there is no dominant strategy – we’ll see some later
Alice: Testify Alice: Refuse Bob: Testify
Bob: Refuse 0,-10
http://en.wikipedia.org/wiki/Prisoner’s_dilemma Defect Cooperate Defect Lose – lose Lose big – win big Cooperate Win big – lose big Win – win
the same for both players
identical strategies
equilibrium strategy is still to defect
become an equilibrium strategy
to take home half a stag
gets to take home a rabbit
the cooperator gets nothing at all
Hunter 1: Stag Hunter 1: Hare Hunter 2: Stag 2,2 1,0 Hunter 2: Hare 0,1 1,1
incomplete information (the issue: trust)
Hunter 1: Stag Hunter 1: Hare Hunter 2: Stag 2,2 1,0 Hunter 2: Hare 0,1 1,1
Cooperate Defect Cooperate Win – win Win big – lose big Defect Lose big – win big Lose – lose Cooperate Defect Cooperate Win big – win big Win – lose Defect Lose – win Win – win
Prisoner’ dilemma Stag hunt Players improve their winnings by defecting unilaterally Players reduce their winnings by defecting unilaterally
chicken out
$10,000 (the cost of the car)
S C S
C 1, -1 0, 0
Straight Chicken Straight Chicken
Player 1 Player 2 http://en.wikipedia.org/wiki/Game_of_chicken
Cooperate Defect Cooperate Win – win Win big – Lose big Defect Lose big – Win big Lose – Lose Chicken Straight Chicken Nil – Nil Win – Lose Straight Lose – Win Lose big – Lose big
Prisoner’ dilemma Chicken Players can’t improve their winnings by unilaterally cooperating The best strategy is always the opposite of what the other player does
(straight, chicken) or (chicken, straight)
choose different strategies
(hawk-dove game)
S C S
C 1, -1 0, 0
Straight Chicken Straight Chicken
Player 1 Player 2 http://en.wikipedia.org/wiki/Game_of_chicken
probability distribution
Is that a Nash equilibrium?
S C S
C 1, -1 0, 0
Straight Chicken Straight Chicken
Player 1 Player 2
P1 chooses S: q(–10) +(1–q)1 = –11q + 1 P1 chooses C: q(–1) + (1–q)0 = –q
has to be indifferent between its two actions:
–11q + 1 = –q or q = 1/10 Similarly, p = 1/10
P1: Choose S with prob. p P1: Choose C with prob. 1-p P2: Choose S with prob. q
P2: Choose C with prob. 1-q 1, -1 0, 0
Nash equilibrium (which may be a mixed-strategy equilibrium)
equilibrium in which the player plays that strategy and the other player plays the best response to that strategy
exists a Nash equilibrium in which they play those strategies
programming problem
running time that is exponential in the number of actions
things get pretty hairy
player
players to deviate
http://en.wikipedia.org/wiki/Nash_equilibrium
S – A = B
(both players get nothing)
http://en.wikipedia.org/wiki/Ultimatum_game
towns and faces 20 competitors successively
with “aggressive”
Competitor Monopolist Out In Cooperative Aggressive (1, 5) (0, 0) (2, 2) https://en.wikipedia.org/wiki/Chainstore_paradox
should we design the game to achieve a socially desirable outcome?
their choices and determines the outcome
reserve price is set too low or too high
but not to the other bidders
they believe about the other buyers
A malevolent twist on the second-price auction:
gets nothing at all in return
highest bidder has to pay the amount of his last bid
into account the future course of the game
state
utility (externality cost)