A game in normal form consists of: Set of players = {1, , } - - PowerPoint PPT Presentation

β–Ά
a game in normal form consists of
SMART_READER_LITE
LIVE PREVIEW

A game in normal form consists of: Set of players = {1, , } - - PowerPoint PPT Presentation

T RUTH J USTICE A LGOS Game Theory I: Basic Concepts Teachers: Ariel Procaccia (this time) and Alex Psomas NORMAL-FORM GAME A game in normal form consists of: Set of players = {1, , } Strategy set For each


slide-1
SLIDE 1

ALGOS TRUTH JUSTICE

Game Theory I: Basic Concepts

Teachers: Ariel Procaccia (this time) and Alex Psomas

slide-2
SLIDE 2

NORMAL-FORM GAME

  • A game in normal form consists of:
  • Set of players 𝑂 = {1, … , π‘œ}
  • Strategy set 𝑇
  • For each 𝑗 ∈ 𝑂, utility function

𝑣𝑗: π‘‡π‘œ β†’ ℝ: if each j ∈ 𝑂 plays the strategy 𝑑

π‘˜ ∈ 𝑇, the utility of player 𝑗

is 𝑣𝑗(𝑑1, … , π‘‘π‘œ)

slide-3
SLIDE 3

THE PRISONER’S DILEMMA

  • Two men are charged with a crime
  • They are told that:
  • If one rats out and the other does not, the

rat will be freed, other jailed for nine years

  • If both rat out, both will be jailed for six

years

  • They also know that if neither rats out,

both will be jailed for one year

slide-4
SLIDE 4

THE PRISONER’S DILEMMA

  • 1,-1
  • 9,0

0,-9

  • 6,-6

Cooperate Defect Cooperate Defect

What would you do?

slide-5
SLIDE 5

ON TV

http://youtu.be/S0qjK3TWZE8

slide-6
SLIDE 6

THE PROFESSOR’S DILEMMA

106,106

  • 10,0

0,-10 0,0

Make effort Slack off Listen Sleep

Dominant strategies?

Professor Class

slide-7
SLIDE 7

NASH EQUILIBRIUM

  • In a Nash equilibrium, no player wants

to unilaterally deviate

  • Each player’s strategy is a best

response to strategies of others

  • Formally, a Nash equilibrium is a vector
  • f strategies 𝒕 = 𝑑1 … , π‘‘π‘œ ∈ π‘‡π‘œ such

that for all 𝑗 ∈ 𝑂, 𝑑𝑗

β€² ∈ 𝑇,

𝑣𝑗 𝒕 β‰₯ 𝑣𝑗(𝑑1, … , π‘‘π‘—βˆ’1, 𝑑𝑗

β€², 𝑑𝑗+1, … , π‘‘π‘œ)

slide-8
SLIDE 8

THE PROFESSOR’S DILEMMA

106,106

  • 10,0

0,-10 0,0

Make effort Slack off Listen Sleep

Nash equilibria?

Professor Class

slide-9
SLIDE 9

R P S R

0,0

  • 1,1

1,-1

P

1,-1 0,0

  • 1,1

S

  • 1,1

1,-1 0,0

ROCK-PAPER-SCISSORS

Nash equilibria?

slide-10
SLIDE 10

MIXED STRATEGIES

  • A mixed strategy is a probability

distribution over (pure) strategies

  • The mixed strategy of player 𝑗 ∈ 𝑂 is 𝑦𝑗,

where 𝑦𝑗(𝑑𝑗) = Pr[𝑗 plays 𝑑𝑗]

  • The utility of player 𝑗 ∈ 𝑂 is

𝑣𝑗 𝑦1, … , π‘¦π‘œ = ෍

(𝑑1,…,π‘‘π‘œ)βˆˆπ‘‡π‘œ

𝑣𝑗 𝑑1, … , π‘‘π‘œ β‹… ΰ·‘

π‘˜=1 π‘œ

π‘¦π‘˜(𝑑

π‘˜)

slide-11
SLIDE 11

EXERCISE: MIXED NE

R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

  • Exercise: player 1 plays

1 2 , 1 2 , 0 , player 2

plays 0,

1 2 , 1 2 . What is 𝑣1?

  • Exercise: Both players play

1 3 , 1 3 , 1 3 . What is

𝑣1?

slide-12
SLIDE 12

EXERCISE: MIXED NE

Which is a NE?

1.

1 2 , 1 2 , 0 , 1 2 , 1 2 , 0

3.

1 3 , 1 3 , 1 3 , 1 3 , 1 3 , 1 3

2.

1 2 , 1 2 , 0 , 1 2 , 0, 1 2

4.

1 3 , 2 3 , 0

,

2 3 , 0, 1 3

R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

?

Poll 1

slide-13
SLIDE 13

NASH’S THEOREM

  • Theorem [Nash, 1950]: In any (finite)

game there exists at least one (possibly mixed) Nash equilibrium

  • What about computing a Nash

equilibrium? Stay tuned…

slide-14
SLIDE 14

DOES NE MAKE SENSE?

  • Two players, strategies are {2, … , 100}
  • If both choose the same number, that is

what they get

  • If one chooses 𝑑, the other 𝑒, and 𝑑 < 𝑒, the

former player gets 𝑑 + 2, and the latter gets 𝑑 βˆ’ 2

  • Poll 2: What would you choose?

100 99 98 97 96 95

slide-15
SLIDE 15

CORRELATED EQUILIBRIUM

  • Let 𝑂 = {1,2} for simplicity
  • A mediator chooses a pair of strategies

(𝑑1, 𝑑2) according to a distribution π‘ž

  • ver 𝑇2
  • Reveals 𝑑1 to player 1 and 𝑑2 to player 2
  • When player 1 gets 𝑑1 ∈ 𝑇, he knows the

distribution over strategies of 2 is Pr 𝑑2 𝑑1 = Pr 𝑑1 ∧ 𝑑2 Pr 𝑑1 = π‘ž 𝑑1, 𝑑2 Pr[𝑑1]

slide-16
SLIDE 16

CORRELATED EQUILIBRIUM

  • Player 1 is best responding if for all 𝑑1

β€² ∈ 𝑇

෍

𝑑2βˆˆπ‘‡

Pr 𝑑2 𝑑1 𝑣1 𝑑1, 𝑑2 β‰₯ ෍

𝑑2βˆˆπ‘‡

Pr 𝑑2 𝑑1 𝑣1(𝑑1

β€², 𝑑2)

  • Equivalently,

෍

𝑑2βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 𝑣1 𝑑1, 𝑑2 β‰₯ ෍

𝑑2βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 𝑣1(𝑑1

β€², 𝑑2)

  • π‘ž is a correlated equilibrium (CE) if both

players are best responding

  • Every Nash equilibrium is a correlated

equilibrium, but not vice versa

slide-17
SLIDE 17

GAME OF CHICKEN

http://youtu.be/u7hZ9jKrwvo

slide-18
SLIDE 18

GAME OF CHICKEN

Dare Chicken Dare

0,0 4,1

Chicken

1,4 3,3

  • Social welfare is the

sum of utilities

  • Pure NE: (C,D) and

(D,C), social welfare = 5

  • Mixed NE: both

(1/2,1/2), social welfare = 4

  • Optimal social welfare

= 6

slide-19
SLIDE 19

GAME OF CHICKEN

  • Correlated equilibrium:
  • (D,D): 0
  • (D,C):

1 3

  • (C,D):

1 3

  • (C,C):

1 3

  • Social welfare of CE =

16 3

Dare Chicken Dare

0,0 4,1

Chicken

1,4 3,3

slide-20
SLIDE 20

IMPLEMENTATION OF CE

  • Instead of a mediator, use a

hat!

  • Balls in hat are labeled with

β€œchicken” or β€œdare”, each blindfolded player takes a ball

Which balls implement the distribution

  • f the previous slide?
  • 1. 1 chicken, 1 dare
  • 3. 2 chicken, 1 dare
  • 2. 1 chicken, 2 dare
  • 4. 2 chicken, 2 dare

Poll 3

?

slide-21
SLIDE 21

CE AS LP

  • Can compute CE via linear

programming in polynomial time!

find βˆ€π‘‘1, 𝑑2 ∈ 𝑇, π‘ž 𝑑1, 𝑑2 s.t .t. βˆ€π‘‘1, 𝑑1

β€² ∈ 𝑇,

βˆ€π‘‘2, 𝑑2

β€² ∈ 𝑇,

෍

𝑑1,𝑑2βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 = 1 βˆ€π‘‘1, 𝑑2 ∈ 𝑇, π‘ž 𝑑1, 𝑑2 ∈ [0,1]

෍

𝑑2βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 𝑣1 𝑑1, 𝑑2 β‰₯ ෍

𝑑2βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 𝑣1(𝑑1

β€², 𝑑2)

෍

𝑑1βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 𝑣2 𝑑1, 𝑑2 β‰₯ ෍

𝑑1βˆˆπ‘‡

π‘ž 𝑑1, 𝑑2 𝑣2(𝑑1, 𝑑2

β€²)