9/28/20 1
CSCI 3210: Computational Game Theory
Mohammad T . Irfan Email: mirfan@bowdoin.edu Website: www.mtirfan.com
www.mtirfan.com/CSCI-3210
1
Syllabus and required background
u Course website
u www.mtirfan.com/CSCI-3210
10
CSCI 3210: Computational Game Theory www.mtirfan.com/CSCI-3210 - - PDF document
9/28/20 CSCI 3210: Computational Game Theory www.mtirfan.com/CSCI-3210 Mohammad T . Irfan Email: mirfan@bowdoin.edu Website: www.mtirfan.com 1 Syllabus and required background u Course website u www.mtirfan.com/CSCI-3210 10 1 9/28/20
9/28/20 1
1
u Course website
u www.mtirfan.com/CSCI-3210
10
9/28/20 2
u Be intellectually challenged in a class that is
u The game theory concept! u new algorithms! u I’ve been fascinated by game theory in the
past, so I’m excited to take a look at it through a cs lens.
u Learning something new 😂 u The difference between human decisions and
rational thinking
u Don’t know a lot about game theory, but
most curious about application to human choice
11
12
9/28/20 3
u “Game”
u Ernst Zermelo (1913): In any chess game that
does not end in a draw, a player has a winning strategy u Mathematical theory of strategic
decision making
u John von Neumann (1944)
13
u https://www.youtube.com/watch?v=yM38mR
HY150
u Rules of the game u Outcome
14
9/28/20 4
u What will happen? Payoff matrix
Split Steal Split
Steal
15
Payoff matrix
Split Steal Split
Steal
Lucy Tony
Everyone plays their best response to others simultaneously
John F . Nash Nobel Prize, 1994
16
9/28/20 5
17
u Application: market equilibria
u Predict where the market is heading to
u Mechanism design and auctions
u Google and Yahoo apply game-theoretic techniques
u Keyword search auction
u Spectrum allocation among wireless companies
18
18
9/28/20 6
u Understanding the Internet
u Selfish routing is a constant-factor off from
19
19
u Load balancing and resource allocation u p2p and file sharing systems u Cryptography and security u Social and economic networks, etc.
20
9/28/20 7
21
u One-shot games (simultaneous move) u 3 components
u Players u Strategies/actions u Payoffs
Payoff matrix
Split Steal Split
Steal
Tony Lucy
Call these pure strategies Pure-strategy NE
22
9/28/20 8
u A joint strategy (one strategy/player) s.t.
every player plays their best response to
u (Equiv.) A joint strategy s.t. no player gains
by deviating unilaterally
u Useful for checking whether a cell is NE
23
u What will they do? Payoff matrix
Not Confess Confess Not Confess
Confess
Suspect 1 Suspect 2
24
9/28/20 9
u Payoffs reflect player’s preference u Payoffs are known to all u Actions are known to all (different players
could have different actions– but everyone knows everyone’s actions)
u Each player wants to maximize own payoff
subject to others' actions
25
26
9/28/20 10
u Best strategy of a player, given the other
players’ strategies
u Always exists!
27
u A strategy of a player that is
(strictly/weakly) better than any of their
players do
u Does not always exist
28
9/28/20 11
u Players do not use any probability in
choosing strategies as they do in "mixed- strategy" (to be covered later)
u Every player plays their best response to others
simultaneously
29
u What is the difference between a dominant
strategy and a best response?
u What is the difference between best
response and PSNE?
u What is the difference between weakly and
strictly dominant strategies? Will a player always have one?
30
9/28/20 12
u Watch the following clip from the movie A
Beautiful Mind that tries to portray John Nash’s discovery of Nash equilibrium.
u Is this actually a Nash equilibrium?
32 33
9/28/20 13
u Equilibrium signifies a tie/draw/balance u Equilibrium outcome is the best possible
u Self-interested players want to hurt each
34
u Does NE always exist? (Answer later ...) u If it exists, is it unique?
35
9/28/20 14
36
44
9/28/20 15
45
45
46
46
9/28/20 16
47
47
u E[GK plays Left] = p(1) + (1-p)(-1) = 2p – 1 u E[GK plays Right] = p(-1) + (1-p)(1) = 1 – 2p u 2p – 1 = 1 – 2p è p = ½ u Similarly, q = ½
48
Left (q) Right (1- q) Left (p) -1, +1 +1, -1 Right (1- p)
Goalkeeper Shooter
48
9/28/20 17
u “Professionals Play Minimax”- Ignacio
Palacios-Huerta
Left (0.42) Right (0.58) Left (0.38)
0.58, 0.42 0.95, 0.05
Right (0.62)
0.93, 0.07 0.70, 0.30
Shooter Goalkeeper
Equilibrium probabilities (computed by solving equations) match real-world probabilities from data! From real- world data
49
u Active randomization – tennis, soccer u Proportion interpretation – evolutionary
biology
u Probabilities of player 1 are the beliefs of
player 2 about what player 1 is doing (Bob Aumann)
u Misconception
u Players just choose probabilities
u Correct
u players play pure strategies chosen according to
these probabilities
50
9/28/20 18
u Every finite 2-person zero-sum game has a
mixed equilibrium John von Neumann (1903 – 1957)
51
51
u Every finite game has an equilibrium in
mixed strategies
u Reading
John F . Nash (1928 – 2015) Nobel Prize, 1994
52
52
9/28/20 19
53
u Players act simultaneously, but NE outcome
is stable in the sense that there is no incentive for unilateral deviation.
u There is always at least one mixed-strategy
u The concept of NE doesn't say how NE
happens.
u NE is not a balance or tie. It is often times a
socially-inefficient outcome.
59
9/28/20 20
61
u Set theory
u Sets u Representation: list, describe properties u Belongs to u Subset u Empty set u Power set u Operations: union, intersection, difference,
product
u Sum, product
62
9/28/20 21
63
u Idea: Connor Marrs u Assume: Cars don’t want to wait; all turns/S same One way St
L/R L/R/S
Photo modified from topdriver.com
Car1 Car2
64
9/28/20 22
u N = {1, 2}
u A1 = {L, R, S}; A2 = {L, R} u Set of action profiles
A= A1 x A2 = {(L, L), (L, R), (R, L), (R, R), (S, L), (S, R)}
u Each element (e.g., (L,R)) is called an action profile
65
L R L 1, 1 0, 0 R 0, 0 1, 1 S 0, 0 1, 1
One way St
L/R L/R/S Car 1 Car 2
66
9/28/20 23
u Is constant sum the same as zero sum?
action profile
67
u You and opponent flip two coins u Same parity (both heads up or tails up)=> you
win (keep both coins)
u Otherwise, opponent wins
You Opponent
68
9/28/20 24
u Mixed strategy of a player and mixed-
strategy profile of all players
u Expected utility u Best response u Nash equilibrium
69
70
9/28/20 25
71
72
9/28/20 26
73