G AME T HEORY 2 I NSTRUCTOR : G IANNI A. D I C ARO T HE PROFESSOR S - - PowerPoint PPT Presentation

g ame t heory 2
SMART_READER_LITE
LIVE PREVIEW

G AME T HEORY 2 I NSTRUCTOR : G IANNI A. D I C ARO T HE PROFESSOR S - - PowerPoint PPT Presentation

15-382 C OLLECTIVE I NTELLIGENCE S18 L ECTURE 27: G AME T HEORY 2 I NSTRUCTOR : G IANNI A. D I C ARO T HE PROFESSOR S DILEMMA Class Simultaneous move Listen Sleep Non-cooperative game Complete information Make 10 6 ,10 6


slide-1
SLIDE 1

LECTURE 27: GAME THEORY 2

INSTRUCTOR: GIANNI A. DI CARO

15-382 COLLECTIVE INTELLIGENCE – S18

slide-2
SLIDE 2

15781 Fall 2016: Lecture 22

THE PROFESSOR’S DILEMMA

106,106

  • 10,0

0,-10 0,0

Make effort Slack

  • ff

Listen Sleep Dominant strategies? Professor Class

2

Nope, if Class listen, and Professor slacks off, Sleep provides a higher payoff! No dominant strategy: best strategy it doesn’t matter what other player’s strategy

  • Simultaneous move
  • Non-cooperative game
  • Complete information
  • Imperfect information
  • Solution concept: predict

how the game will be played with rational agents

  • Prediction ≡ Solution
  • Nash: Equilibrium concept
slide-3
SLIDE 3

15781 Fall 2016: Lecture 22

NASH EQUILIBRIUM (1951)

  • At equilibrium, each player’s strategy is a best

response to strategies of others

  • Formally, a Nash equilibrium is strategy profile

𝑡 = 𝑡1 … , 𝑡𝑜 ∈ 𝑇𝑜 such that: ∀𝑗 ∈ 𝑂, ∀𝑡𝑗

′ ∈ 𝑇, 𝑣𝑗 𝑡 ≥ 𝑣𝑗(𝑡𝑗 ′, 𝑡−𝑗)

3

John F. Nash, Nobel Prize in Economics, 1994

  • Can we find an equilibrium also in absence
  • f a dominant strategy?
slide-4
SLIDE 4

15781 Fall 2016: Lecture 22

NASH EQUILIBRIUM

  • In equilibrium, each player is playing the strategy that is a “best

response” to the strategies of the other players. No one has an incentive to change strategy given the strategy choices of the others

  • A NE is an equilibrium where each player’s strategy is optimal given

the strategies of all other players.

  • A Nash Equilibrium exists when there is no unilateral profitable

deviation from any of the players involved

  • Nash Equilibria are self-enforcing strategies: when players are at a

Nash Equilibrium they have no desire to move because they will be worse off → Equilibrium in the policy space

  • Dominant strategy ⟹ Nash equilibrium: All solutions in dominant

strategies are also Nash equilibria, but the vice versa is not necessarily (and not usually) true

4

slide-5
SLIDE 5

15781 Fall 2016: Lecture 22

NASH EQUILIBRIUM

  • The best possible outcome of the game.

Equilibrium in the one-shot prisoners’ dilemma is for both players to confess, which is not the best possible

  • utcome (not Pareto optimal)
  • A situation where players always choose the same
  • action. Sometimes equilibrium will involve changing

action choices (mixed strategy equilibrium).

5

Equilibrium is not:

slide-6
SLIDE 6

15781 Fall 2016: Lecture 22

NASH EQUILIBRIUM

  • How many Nash equilibria does the Professor’s Dilemma have?

106,106

  • 10,0

0,-10 0,0

Make effort Slack off Listen Sleep

6

ML - SS

slide-7
SLIDE 7

15781 Fall 2016: Lecture 22

NASH EQUILIBRIA: HOW DO WE FIND THEM?

  • Nash equilibrium: A play of the game where each strategy is a best reply to the

given strategy of the other.

  • Let’s examine all the possible pure strategy profiles and check if for a profile (X,Y)
  • ne player could improve its payoff, given the strategy of the other

(M, L)? If Prof plays M, then L is the best reply given M. Neither player can increase its the payoff by choosing a different action

  • (S,L)? If Prof plays S, S is the best reply given S, not L
  • (M, S)? If Prof plays M, then L is the best reply given M, not S

(S,S)? If Prof plays S, then S is the best reply given S. Neither player can increase its the payoff by choosing a different action

7

slide-8
SLIDE 8

15781 Fall 2016: Lecture 22

NASH EQUILIBRIUM FOR PRISONER’S DILEMMA

Prisoner B

Don’t confess

Confess

Prisoner A

Don’t Confess

  • 1,-1
  • 9,0

Confess

0,-9

  • 6,-6

8

slide-9
SLIDE 9

COORDINATION GAME: STAG HUNT

9

(Originally from J.J. Rousseau)

  • Two equilibria at (stag, stag) and (rabbit, rabbit) → Players' optimal strategy

depend on their expectation on what the other player may do.

  • This game has been used as an analogy for social cooperation, and mutual trust
  • In Prisoner’s dilemma, the Nash equilibrium corresponds to defect, no cooperate!
slide-10
SLIDE 10

15781 Fall 2016: Lecture 22

COMPETITION GAME

10

  • Both players simultaneously choose an integer from 0 to 3
  • They both win the smaller of the two numbers in points.
  • In addition, if one player chooses a larger number than the
  • ther, then it has to give up two points to the other.

Does the (unique) NE at (0,0) make sense?

slide-11
SLIDE 11

15781 Fall 2016: Lecture 22

R P S R

0,0

  • 1,1

1,-1

P

1,-1 0,0

  • 1,1

S

  • 1,1

1,-1 0,0

ROCK-PAPER-SCISSORS

Nash equilibrium? Is there a pure strategy as best response?

11

slide-12
SLIDE 12

15781 Fall 2016: Lecture 22

R P S R 0,0

  • 1,1

1,-1 P 1,-1 0,0

  • 1,1

S

  • 1,1

1,-1 0,0

ROCK-PAPER-SCISSORS

No (pure) Nash equilibria: Best response: randomize!

  • For every pure strategy (X,Y), there

is a different strategy choice that increases the payoff of a player

  • E.g., for strategy (P,R), player B can

get a higher payoff playing strategy S instead R

  • E.g., for strategy (S,R), player A can

get a higher payoff playing strategy P instead S

  • No strategy equilibrium can be

settled, players have the incentive to keep switching their strategy

12

slide-13
SLIDE 13

15781 Fall 2016: Lecture 22

MIXED STRATEGIES

  • Mixed strategy: a probability distribution over (pure) strategies
  • The mixed strategy of player 𝑗 ∈ 𝑂 is 𝑦𝑗, where 𝑦𝑗(𝑡𝑗) =

Pr[𝑗 plays 𝑡𝑗] (e.g., 𝑦𝑗 𝑆 = 0.3, 𝑦𝑗 𝑄 = 0.5, 𝑦𝑗 𝑇 = 0.2)

  • The (expected) utility of player 𝑗 ∈ 𝑂 is

𝑣𝑗 𝑦1, … , 𝑦𝑜 = ෍

(𝑡1,…,𝑡𝑜)∈𝑇𝑜

𝑣𝑗 𝑡1, … , 𝑡𝑜 ⋅ ෑ

𝑘=1 𝑜

𝑦𝑘(𝑡

𝑘)

13

Mixed strategy profile Joint probability of the pure strategy profile given the mixed profile Pure strategy profile Utility of pure strategy profile

slide-14
SLIDE 14

15781 Fall 2016: Lecture 22

EXERCISE: MIXED NE

  • Player 1 plays

1 2 , 1 2 , 0 , player 2

plays 0,

1 2 , 1 2 . What is 𝑣1?

  • Both players play

1 3 , 1 3 , 1 3 . What

is 𝑣1?

R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

14

slide-15
SLIDE 15

15781 Fall 2016: Lecture 22

EXERCISE: MIXED NE

R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

In the second case, because of symmetry, the utility is zero: It’s a zero-sum game

15

Player 1 plays

1 2 , 1 2 , 0 , player 2 plays 0, 1 2 , 1 2 . What is 𝑣1?

+ +

slide-16
SLIDE 16

15781 Fall 2016: Lecture 22

MIXED STRATEGIES EQUILIBRIUM IS NASH

  • 𝑣𝑗 𝑦 is player 𝑗’s expected utility with mixed strategy

profile 𝑦

  • → Same definition as in the case f pure strategies, where

𝑣𝑗 was the utility of a pure strategy instead of a mixed strategy

16

𝑣𝑗 𝑦𝑗

∗, 𝑦−𝑗 ∗

≥ 𝑣𝑗 𝑦𝑗, 𝑦−𝑗

∀ 𝑦𝑗 and 𝑗

  • The mixed strategy profile 𝑦∗ in a strategic game is a

mixed strategy Nash equilibrium if

slide-17
SLIDE 17

15781 Fall 2016: Lecture 22

MIXED STRATEGIES NASH EQUILIBRIUM

  • Using best response functions, 𝑦∗ is a mixed strategy NE iff 𝑦𝑗

∗ is

the best response for every player 𝑗.

  • If a mixed strategy 𝑦∗ is a best response, then each of the pure

strategies in the mix must be best response: they must yield the same expected payoff (otherwise it would just make sense to choose the one with the better payoff)

  • → If a mixed strategy is a best response for player 𝑗, then the

player must be indifferent among the pure strategies in the mix

  • E.g., in the RPS game, if the mixed strategy of player 𝑗 assigns

non-zero probabilities pR for playing R and pP for playing P, then 𝑗’s expected utility for playing R or P has to be the same

17

slide-18
SLIDE 18

15781 Fall 2016: Lecture 22

EXERCISE: MIXED NE

  • Which is a NE?

1.

1 2 , 1 2 , 0 , 1 2 , 1 2 , 0

2.

1 2 , 1 2 , 0 , 1 2 , 0, 1 2

3.

1 3 , 1 3 , 1 3 , 1 3 , 1 3 , 1 3

4.

1 3 , 2 3 , 0

,

2 3 , 0, 1 3

R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

18

Any other NE?

slide-19
SLIDE 19

15781 Fall 2016: Lecture 22

NASH’S THEOREM

  • Theorem [Nash, 1950]: In any game with finite number of

strategies there exists at least one (possibly mixed) Nash equilibrium

19

Player A

1,2 0,4 0,5 3,2

Up Down Left Right Player B This game has no pure strategy Nash equilibria but it does have a Nash equilibrium in mixed strategies. How is it computed?

slide-20
SLIDE 20

15781 Fall 2016: Lecture 22 Player A

Player A plays Up with probability pU and plays Down with probability 1-pU Player B plays Left with probability pL and plays Right with probability 1-pL.

1,2 0,4 0,5 3,2

Up Down Left Right Player B

COMPUTATION OF MS NE

20

slide-21
SLIDE 21

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B

COMPUTATION OF MS NE

21

slide-22
SLIDE 22

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B If B plays Left, its expected utility is

2 5 1 p p

U U

  ( )

COMPUTATION OF MS NE

22

slide-23
SLIDE 23

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B If B plays Right, its expected utility is

COMPUTATION OF MS NE

23

4 2 1 p p

U U

  ( ).

slide-24
SLIDE 24

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B

2 5 1 4 2 1 p p p p

U U U U

     ( ) ( )

If then

B would play only Left, which would be a pure strategy. But there are no (pure) Nash equilibria in which B plays only Left.

COMPUTATION OF MS NE

24

slide-25
SLIDE 25

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B

2 5 1 4 2 1 p p p p

U U U U

     ( ) ( )

If then

B would play only Right. But there are no (pure) Nash equilibria in which B plays only Right

COMPUTATION OF MS NE

25

slide-26
SLIDE 26

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B

For there to exist a MS Nash equilibrium, B must be indifferent between playing Left or Right:

2 5 1 4 2 1 p p p p

U U U U

     ( ) ( )

COMPUTATION OF MS NE

26

slide-27
SLIDE 27

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U,pU D,1-pU L,pL R,1-pL Player B

COMPUTATION OF MS NE

27

2 5 1 4 2 1 3 5 p p p p p

U U U U U

       ( ) ( ) / .

slide-28
SLIDE 28

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5

1 pU =

2 5

pU =

3 5 2 5

COMPUTATION OF MS NE

28

slide-29
SLIDE 29

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5 2 5

If A plays Up its expected payoff is

COMPUTATION OF MS NE

29

. ) 1 ( 1

L L L

p  p    p 

slide-30
SLIDE 30

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5 2 5

If A plays Down his expected payoff is

COMPUTATION OF MS NE

30

). 1 ( 3 ) 1 ( 3

L L L

p   p    p 

slide-31
SLIDE 31

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5 2 5

p p

L L

  3 1 ( )

If then

A would play only Up. But there are no pure Nash equilibria in which A plays only Up.

COMPUTATION OF MS NE

31

slide-32
SLIDE 32

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5 2 5

If then

A would play only Down. But there are no Nash equilibria in which A plays only Down.

p p

L L

  3 1 ( ) COMPUTATION OF MS NE

32

slide-33
SLIDE 33

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5 2 5

For there to exist a Mixed Nash equilibrium, A must be indifferent between playing Up or Down:

p p

L L

  3 1 ( ) COMPUTATION OF MS NE

33

slide-34
SLIDE 34

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L,pL R,1-pL Player B

3 5 2 5

COMPUTATION OF MS NE

34

p p p

L L L

    3 1 3 4 ( ) / .

slide-35
SLIDE 35

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L, 3

4

R, 1

4

Player B

3 5 2 5

1 pL =

1 4

pL =

3 4

COMPUTATION OF MS NE

35

slide-36
SLIDE 36

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L, 3

4

R, 1

4

Player B

3 5 2 5

Game’s only Nash equilibrium has A playing the mixed strategy (3

5, 2 5)

and B playing the mixed strategy (3

4, 1 4)

COMPUTATION OF MS NE

36

slide-37
SLIDE 37

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L, 3

4

R, 1

4

Player B

3 5 2 5

Payoffs:

  • (1,2) with probability (3

5 × 3 4) = 9 20

  • (0,4) with probability (3

5 × 1 4) = 3 20

  • (0,5) with probability (2

5 × 3 4) = 6 20

  • (3,2) with probability (2

5 × 1 4) = 2 20

COMPUTATION OF MS NE

37

slide-38
SLIDE 38

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L, 3

4

R, 1

4

Player B

3 5 2 5

A’s expected Nash equilibrium payoff:

COMPUTATION OF MS NE

38

1 9 20 3 20 6 20 3 2 20 3 4         .

slide-39
SLIDE 39

15781 Fall 2016: Lecture 22 Player A

1,2 0,4 0,5 3,2

U, D, L, 3

4

R, 1

4

Player B

3 5 2 5

B’s expected Nash equilibrium payoff:

COMPUTATION OF MS NE

39

2 9 20 4 3 20 5 6 20 2 2 20 16 5         .