Extensive Games with Perfect Information A Mini Tutorial Krzysztof - - PowerPoint PPT Presentation

extensive games with perfect information a mini tutorial
SMART_READER_LITE
LIVE PREVIEW

Extensive Games with Perfect Information A Mini Tutorial Krzysztof - - PowerPoint PPT Presentation

Extensive Games with Perfect Information A Mini Tutorial Krzysztof R. Apt (so not Krzystof and definitely not Krystof) CWI, Amsterdam, the Netherlands , University of Amsterdam Extensive Games withPerfect InformationA Mini Tutorial p. 1/29


slide-1
SLIDE 1

Extensive Games with Perfect Information A Mini Tutorial

Krzysztof R. Apt

(so not Krzystof and definitely not Krystof)

CWI, Amsterdam, the Netherlands, University of Amsterdam

Extensive Games withPerfect InformationA Mini Tutorial – p. 1/29

slide-2
SLIDE 2

Strategic Games: Definition

Strategic game for n ≥ 2 players: (possibly infinite) set Si of strategies, payoff function pi : S1 × · · · × Sn → R, for each player i. Basic assumptions: players choose their strategies simultaneously, each player is rational: his objective is to maximize his payoff, players have common knowledge of the game and of each others’ rationality.

Extensive Games withPerfect InformationA Mini Tutorial – p. 2/29

slide-3
SLIDE 3

Three Examples

Prisoner’s Dilemma C D C 2, 2 0, 3 D 3, 0 1, 1 The Battle of the Sexes F B F 2, 1 0, 0 B 0, 0 1, 2 Matching Pennies H T H 1, −1 −1, 1 T −1, 1 1, −1

Extensive Games with Perfect Information A Mini Tutorial – p. 3/29

slide-4
SLIDE 4

Nash Equilibrium

Notation: G := (S1, . . ., Sn, p1, . . ., pn). Notation: si, s′

i ∈ Si, s, s′, (si, s−i) ∈ S1 × · · · × Sn.

si is a best response to s−i if ∀s′

i ∈ Si pi(si, s−i) ≥ pi(s′ i, s−i).

s is a Nash equilibrium if ∀i si is a best response to s−i: ∀i ∈ {1, . . ., n} ∀s′

i ∈ Si pi(si, s−i) ≥ pi(s′ i, s−i).

Intuition: In a Nash equilibrium no player can gain by unilaterally switching to another strategy.

Extensive Games with Perfect Information A Mini Tutorial – p. 4/29

slide-5
SLIDE 5

Nash Equilibrium

Prisoner’s Dilemma:

1 Nash equilibrium

C D C 2, 2 0, 3 D 3, 0 1, 1 The Battle of the Sexes:

2 Nash equilibria

F B F 2, 1 0, 0 B 0, 0 1, 2 Matching Pennies:

no Nash equlibrium

H T H 1, −1 −1, 1 T −1, 1 1, −1

Extensive Games with Perfect Information A Mini Tutorial – p. 5/29

slide-6
SLIDE 6

Example 1: Prisoner’s Dilemma

1 2 (2,2) C (0,3) D C 2 (3,0) C (1,1) D D

Extensive Games with Perfect Information A Mini Tutorial – p. 6/29

slide-7
SLIDE 7

Example 2: Battle of the Sexes

1 2 (2,1) F (0,0) B F 2 (0,0) F (1,2) B B

Extensive Games with Perfect Information A Mini Tutorial – p. 7/29

slide-8
SLIDE 8

Example 3: Matching Pennies

1 2 (1,-1) H (-1,1) T H 2 (-1,1) H (1,-1) T T

Extensive Games with Perfect Information A Mini Tutorial – p. 8/29

slide-9
SLIDE 9

Discussion

These are examples of two-player games with two stages. In general there may be more players and more stages. We limit ourselves to the games with finitely many stages (games with finite horizon) and such that at each stage exactly one player proceeds.

  • Note. At each stage a player can have infinitely many

choices. We assume here perfect information: each player knows the previous moves.

Extensive Games with Perfect Information A Mini Tutorial – p. 9/29

slide-10
SLIDE 10

Extensive Game: Definition

Extensive game for n ≥ 1 players: game tree: a finite depth tree T := (V, E) with a turn function D : V \ Z → {1, . . ., n}, where Z is the set of leaves of T,

  • utcome function oi : Z → R, for each player i.

We denote it by (T, D, o1, . . ., on). Given v ∈ V \ Z we call {w | (v, w) ∈ E} the set of actions available to player D(v) at node v. Sometimes we identify the actions with the labels put

  • n the edges.

Extensive Games with Perfect Information A Mini Tutorial – p. 10/29

slide-11
SLIDE 11

Strategies

Consider an extensive game EG := (T, D, o1, . . ., on). Let Ni := {v ∈ V | D(v) = i}.

Ni is the set of nodes at which player i takes an action.

Strategy for player i:

si : Ni → V , such that for all v ∈ Ni, (v, si(v)) ∈ E.

Joint strategy: s = (s1, . . ., sn). It assigns a unique edge to every node in V \ Z. To each joint strategy s there corresponds a finite path

path(s) := (v1, . . ., vh) in T defined inductively: v1 is the root of T,

if vk ∈ Z, then vk+1 := si(vk), where D(vk) = i. When each player i selects si we call (o1(z), . . ., on(z)), where z is the last element of path(s), the outcome of EG.

Extensive Games with Perfect Information A Mini Tutorial – p. 11/29

slide-12
SLIDE 12

Example of Strategies: Matching Pennies

1 2 (1,-1) H (-1,1) T H 2 (-1,1) H (1,-1) T T Strategies for player 1: H, T. Strategies for player 2: HH, HT, TH, TT. Thick lines correspond with the joint strategy (T,HH).

Extensive Games with Perfect Information A Mini Tutorial – p. 12/29

slide-13
SLIDE 13

Strategic Forms

With each extensive game EG := (T, D, o1, . . ., on) we associate a strategic game G := (S1, . . ., Sn, p1, . . ., pn) defined as follows:

Si is the set of strategies of player i in EG, pi(s) := oi(z), where z is the last element of path(s). G is called the strategic form of EG. s is called a Nash equilibrium of EG if it is a Nash

equilibrium of G.

Extensive Games with Perfect Information A Mini Tutorial – p. 13/29

slide-14
SLIDE 14

Example: Matching Pennies

1 2 (1,-1) H (-1,1) T H 2 (-1,1) H (1,-1) T T Strategic form

HH HT TH TT H 1, −1 1, −1 −1, 1 −1, 1 T −1, 1 1, −1 −1, 1 1, −1

  • Note. Two Nash equilibria: (H, TH) and (T, TH).

Extensive Games with Perfect Information A Mini Tutorial – p. 14/29

slide-15
SLIDE 15

Win or Lose Games

A two-player extensive game is called a win or lose game if the only possible outcomes are (1, −1) and

(−1, 1). si is called a winning strategy of player i in a win or lose

game EG if

∀s−i ∈ S−ipi(si, s−i) = 1,

where (S1, S2, p1, p2) is the strategic form of EG. Theorem (Zermelo, 1913) In every win or lose game one of the players has a winning strategy.

Extensive Games with Perfect Information A Mini Tutorial – p. 15/29

slide-16
SLIDE 16

Proof of Zermelo’s Theorem

Theorem In every win or lose game one of the players has a winning strategy. We can assume that the players alternate their moves. We can extend all the paths in the game so that all paths in T are of the same depth, say 2k. Let W denote the sentence “player 1 wins after 2k stages”. Then the formula

φ1 := ∃x1∀y1. . .∃xk∀ykW

denotes “player 1 has a winning strategy” and

φ2 := ∀x1∃y1. . .∀xk∃yk¬W

denotes “player 2 has a winning strategy”. But ¬φ1 ≡ φ2, i.e., φ1 ∨ φ2 holds.

Extensive Games with Perfect Information A Mini Tutorial – p. 16/29

slide-17
SLIDE 17

Example: Ultimatum Game

Player 1 claims x ∈ {0, 1, . . ., 100}. Player 2 accepts - the outcome is then (x, 100 − x), or rejects - the outcome is then (0, 0). For each x ∈ {0, 1, . . ., 100} the root 1 has the subtree 1 2

(x, 100 − x)

A

(0, 0)

R

x

Extensive Games with Perfect Information A Mini Tutorial – p. 17/29

slide-18
SLIDE 18

Nash Equilibria in the Ultimatum Game

1 2

(x, 100 − x)

A

(0, 0)

R

x

  • Note. For each x ∈ {0, 1, . . ., 100} there is a Nash

equilibrium with the outcome (x, 100 − x).

  • Proof. Take x∗ ∈ {0, 1, . . ., 100}.

Strategy for player 1: x∗. Strategy for player 2: if x ≤ x∗ then A else R fi. This is a Nash equilibrium with the outcome (x∗, 100 − x∗).

Extensive Games with Perfect Information A Mini Tutorial – p. 18/29

slide-19
SLIDE 19

Example: Ultimatum Game, ctd

Illustration. Strategy for player 1: x∗. Strategy for player 2:

s2 :=if x ≤ x∗ then A else R fi.

Consider two deviations of player 1, x1 < x∗ < x2.

s2 . . . . . . . . . x1 x1, 100 − x1 . . . . . . x∗ x∗

1, 100 − x∗ 1

. . . . . . x2 0, 0 . . . . . . . . . . . . . . .

  • Conclusion. The notion of a Nash equilibrium is not

informative here.

Extensive Games with Perfect Information A Mini Tutorial – p. 19/29

slide-20
SLIDE 20

Subgames

Consider EG := (T, D, o1, . . ., on). We define the subgame of EG rooted at node v of T,

EGv, as expected.

  • Note. Each strategy si of player i in EG uniquely

determines his strategy sv

i in EGv.

(s1, . . ., sn) is called a subgame perfect equilibrium in EG if for each node v of T (sv

1, . . ., sv n) is a Nash

equilibrium in EGv. Informally: s is subgame perfect equilibrium in EG if it induces a Nash equilibrium in every subgame of EG.

Extensive Games with Perfect Information A Mini Tutorial – p. 20/29

slide-21
SLIDE 21

Backward Induction

Given a tree (V, E) and v ∈ V , let

desc(v) := {w | (v, w) ∈ E}.

Fix a finite extensive game EG := ((V, E), D, o1, . . ., on). Backward induction algorithm while |V | > 1 do choose v ∈ V such that all its descendants are leaves;

i := D(v);

choose w ∈ desc(v) such that oi(w) is maximal;

si(v) := w;

for j ∈ {1, . . ., n} do oj(v) := oj(w) od;

V := V \ desc(v); E := E ∩ (V × V );

  • d
  • Note. This process generates a set of joint strategies.

Multiple joint strategies may arise due to the second choose statement.

Extensive Games with Perfect Information A Mini Tutorial – p. 21/29

slide-22
SLIDE 22

Kuhn and Selten Theorems

Theorem (Kuhn, 1950) Every finite extensive game (with perfect information) has a Nash equilibrium. Theorem (Selten, 1965) Every finite extensive game (with perfect information) has a subgame perfect equilibrium.

  • Proof. A stronger claim holds:

A joint strategy is a subgame perfect equilibrium iff it can be generated by the backward induction algorithm.

Extensive Games with Perfect Information A Mini Tutorial – p. 22/29

slide-23
SLIDE 23

Example: Ultimatum Game

1 2

(x, 100 − x)

A

(0, 0)

R

x

Player 2 has two best responses to the strategy 100: A and R.

  • Note. There are two subgame perfect equilibria:

(100, always A), with the outcome (100, 0), (99, if x = 100 then A else R fi), with the outcome

(99, 1).

Extensive Games with Perfect Information A Mini Tutorial – p. 23/29

slide-24
SLIDE 24

Example: the Centipede Game

(Rosenthal, 1981)

S S S S S S C C C C C C

1a 1b 1c 2a 2b 2c

(1, 1) (0, 3) (2, 2) (1, 4) (3, 3) (2, 5) (4, 4)

General rule: Initial situation: (1, 1). If a player continues he loses 1 and the opponent gains 2.

  • Note. Backward induction shows that in the unique

subgame perfect equilibrium each player selects at each node S. So the outcome of the game is (1, 1).

(1, 1) is also the outcome of the game in each Nash

equilibrium.

Extensive Games with Perfect Information A Mini Tutorial – p. 24/29

slide-25
SLIDE 25

Extensive Games with Imperfect Information

Example: Matching Pennies 1 2 (1,-1) H (-1,1) T H 2 (-1,1) H (1,-1) T T

  • Intuition. Player 2 does not know the action of player 1.

Imperfect information: players do not need to know some of the previous moves.

Extensive Games with Perfect Information A Mini Tutorial – p. 25/29

slide-26
SLIDE 26

Definition

Extensive game with imperfect information for n ≥ 1 players: labelled game tree: a finite depth tree T := (V, E) with labelled edges and a turn function D : V \ Z → {1, . . ., n}, where Z is the set of leaves of T, such that there is a partition I of the nodes from V \ Z; each element of I is called an information set, if v and v′ are in the same information set, then

D(v) = D(v′) and E(v) = E(v′),

where E(v) is the set of labels on the edges starting in v (called actions available to player D(v) at node

v),

  • utcome function oi : Z → R, for each player i.

We denote it by (T, D, I, o1, . . ., on).

Extensive Games with Perfect Information A Mini Tutorial – p. 26/29

slide-27
SLIDE 27

Strategies

Consider an extensive game with imperfect information

EG := (T, D, I, o1, . . ., on).

Let Ii := {J ∈ I | for all v ∈ J, D(v) = i}.

Ii is the set of information sets at which player i takes

an action. Strategy for player i:

si : Ii → ∪v∈V E(v), such that for all J ∈ Ii and v ∈ J, si(J) ∈ E(v).

Joint strategy: s = (s1, . . ., sn). It assigns a unique edge to every node in V \ Z. If two nodes lie in the same information set, then the edges with the same label are assigned to them.

Extensive Games with Perfect Information A Mini Tutorial – p. 27/29

slide-28
SLIDE 28

Strategies in the Matching Pennies Game

1 2 (1,-1) H (-1,1) T H 2 (-1,1) H (1,-1) T T Strategies for player 1: H, T. Strategies for player 2: H, T. Thick lines correspond with the joint strategy (T,H). So this game coincides with the strategic game

H T H 1, −1 −1, 1 T −1, 1 1, −1

Extensive Games with Perfect Information A Mini Tutorial – p. 28/29

slide-29
SLIDE 29

Strategic Forms

With each extensive game with imperfect information we associate a strategic game defined as before. Note Extensive games with imperfect information do not need to have a Nash equilibrium. A fortiori these games do not need to have a subgame perfect equilibrium. Caveat The notion of a subgame has to be redefined.

Extensive Games with Perfect Information A Mini Tutorial – p. 29/29