CSC304 Lecture 3 Game Theory (More examples, PoA, PoS)
CSC304 - Nisarg Shah 1
Game Theory (More examples, PoA, PoS) CSC304 - Nisarg Shah 1 - - PowerPoint PPT Presentation
CSC304 Lecture 3 Game Theory (More examples, PoA, PoS) CSC304 - Nisarg Shah 1 Recap Normal form games Domination among strategies Weak/strict domination Hope 1: Find a weakly/strictly dominant strategy Hope 2: Iterated
CSC304 - Nisarg Shah 1
CSC304 - Nisarg Shah 2
➢ Weak/strict domination
➢ Pure – may be none, unique, or multiple
➢ Mixed – at least one!
CSC304 - Nisarg Shah 3
➢ A strategy profile Ԧ
′, Ԧ
′
➢ Each player’s strategy is only best given the strategies of
No quantifier on Ԧ 𝑡−𝑗
CSC304 - Nisarg Shah 4
➢ That is, player 𝑗 plays a single action w.p. 1
➢ In a fully-mixed strategy, every action is played with a
➢ These are the “cells” in the normal form representation
CSC304 - Nisarg Shah 5
➢ The best response of player 𝑗 to others’ strategies Ԧ
∗ ∈ argmax𝑡𝑗 𝑣𝑗 𝑡𝑗, Ԧ
➢ From each cell Ԧ
∗, Ԧ
∗ = player 𝑗’s best response to Ԧ
➢ Each player is already playing their best response ➢ No outgoing arrows
CSC304 - Nisarg Shah 6
Hunter 1 Hunter 2 Stag Hare Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1)
P1 P2 Rock Paper Scissor Rock (0 , 0) (-1 , 1) (1 , -1) Paper (1 , -1) (0 , 0) (-1 , 1) Scissor (-1 , 1) (1 , -1) (0 , 0)
CSC304 - Nisarg Shah 7
➢ Every normal form game has at least one (possibly mixed)
➢ Proof? We’ll prove a special case later.
➢ How do we find mixed NE?
➢ If 𝑡𝑗, Ԧ
CSC304 - Nisarg Shah 8
➢ Equal expected reward for Stag and Hare given the other hunter’s
strategy
➢ 𝔽 Stag = 𝑞 ∗ 4 + 1 − 𝑞 ∗ 0 ➢ 𝔽 Hare = 𝑞 ∗ 2 + 1 − 𝑞 ∗1 ➢ 4𝑞 = 2𝑞 + 1 − 𝑞
⇒ 𝑞 = 1/3
Hunter 1 Hunter 2 Stag Hare Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1)
CSC304 - Nisarg Shah 9
➢ “Fully mixed”
➢ Symmetric
➢ Check if other cases provide any mixed NE
P1 P2 Rock Paper Scissor Rock (0 , 0) (-1 , 1) (1 , -1) Paper (1 , -1) (0 , 0) (-1 , 1) Scissor (-1 , 1) (1 , -1) (0 , 0)
CSC304 - Nisarg Shah 10
➢ Fare = 10 ➢ Cost of inspection = 1 ➢ Fine if fare not paid = 30 ➢ Total cost to driver if caught = 90
Driver Inspector Inspect Don’t Inspect Pay Fare (-10 , -1) (-10 , 0) Don’t Pay Fare (-90 , 29) (0 , -30)
CSC304 - Nisarg Shah 11
➢ If both report the same number, each gets this value. ➢ If one reports a lower number (𝑡) than the other (𝑢), the
100 99 98 97 96
. . . . . . . . . . . 95
CSC304 - Nisarg Shah 12
➢ The brother at 𝑡 gets 0,
𝑡+𝑢 2 , the other gets 𝑡+𝑢 2 , 1 1 s t
CSC304 - Nisarg Shah 13
➢ Existence: Checking the existence of a pure Nash
➢ Computation: Computing a pure NE can be PLS-complete,
➢ Existence: Always exist due to Nash’s theorem ➢ Computation: Computing a mixed NE is PPAD-complete.
CSC304 - Nisarg Shah 14
CSC304 - Nisarg Shah 15
➢ Rationality is common knowledge.
➢ Rationality is perfect = “infinite wisdom”
➢ Full information about what other players are doing.
CSC304 - Nisarg Shah 16
➢ No binding contracts.
➢ No player can commit first.
➢ No external help.
➢ Humans reason about randomization using expectations.
CSC304 - Nisarg Shah 17
➢ Cannot expect humans to find it if your computer cannot.
CSC304 - Nisarg Shah 18
➢ For human agents, take it with a grain of salt. ➢ For AI agents playing against AI agents, perfect!