ECE700.07: Game Theory with Engineering Applications Le Lecture 5: - - PowerPoint PPT Presentation

โ–ถ
ece700 07 game theory with engineering applications
SMART_READER_LITE
LIVE PREVIEW

ECE700.07: Game Theory with Engineering Applications Le Lecture 5: - - PowerPoint PPT Presentation

ECE700.07: Game Theory with Engineering Applications Le Lecture 5: 5: Ga Games in Ext Extensi ensive e Form Seyed Majid Zahedi Outline Perfect information extensive form games Subgame perfect equilibrium Backward induction


slide-1
SLIDE 1

ECE700.07: Game Theory with Engineering Applications

Seyed Majid Zahedi

Le Lecture 5: 5: Ga Games in Ext Extensi ensive e Form

slide-2
SLIDE 2

Outline

  • Perfect information extensive form games
  • Subgame perfect equilibrium
  • Backward induction
  • One-shot deviation principle
  • Imperfect information extensive form games
  • Readings:
  • MAS Sec. 5, GT Sec. 3 (skim through Sec. 3.4 and 3.6), Sec. 4.1, and Sec 4.2
slide-3
SLIDE 3

Extensive Form Games

  • So far, we have studied strategic form games
  • Agents take actions once and simultaneously
  • Next, we study extensive form games
  • Agents sequentially make decisions in multi-stage games
  • Some agents may move simultaneously at some stage
  • Extensive form games can be conveniently represented by game trees
slide-4
SLIDE 4

Example: Entry Deterrence Game

  • Entrant chooses to enter market or stay out
  • Incumbent, after observing entrantโ€™s action, chooses to accommodate or fight
  • Utilities are given by (๐‘ฆ, ๐‘ง) at leaves for each action profile (or history)
  • ๐‘ฆ denotes utility of agent 1 (entrant) and ๐‘ง denotes utility of agent 2 (incumbent)
slide-5
SLIDE 5

Example: Investment in Duopoly

  • Agent 1 chooses to invest or not invest
  • After that, both agents engage in Cournot competition
  • If agent 1 invests, then they engage in Cournot game with ๐‘‘' = 0 and ๐‘‘* = 2
  • Otherwise, they engage in Cournot game with ๐‘‘' = ๐‘‘* = 2
slide-6
SLIDE 6

Finite Perfect-Information Extensive Form Games

  • Formally, each game is tuple ๐ป = โ„, ๐’/ /โˆˆโ„, โ„‹, ๐’ถ, ๐›ฝ, ๐›พ/ /โˆˆโ„, ๐œ, ๐‘ฃ/ /โˆˆโ„
  • โ„ is finite set of agents
  • ๐’7 is set of actions available to agent ๐‘—
  • โ„‹ is set of choice nodes (internal nodes of game tree)
  • ๐’ถ is set of terminal nodes (leaves of game tree)
  • ๐›ฝ: โ„‹ โ†ฆ 2โ„ is agent function, which assigns to each choice node set of agents
  • ๐›พ7: โ„‹ โ†ฆ 2๐’; is action function, which maps choice nodes to set of actions available to agent ๐‘—
  • ๐œ: โ„‹ร—๐’ โ†ฆ โ„‹ โˆช ๐’ถ is successor function, which maps choice nodes and action profiles to new

choice or terminal node, such that if ๐œ โ„Ž', ๐‘' = ๐œ โ„Ž*, ๐‘* , then โ„Ž' = โ„Ž* and ๐‘' = ๐‘*

  • ๐‘ฃ/: ๐’ถ โ†ฆ โ„ is utility function, which assigns real-valued utility to agent ๐‘— at terminal nodes
slide-7
SLIDE 7

History in Extensive Form Games

  • Let ๐ผB = โ„ŽB โІ โ„‹ โˆช ๐’ถ be set of all possible stage ๐‘™ nodes in gameโ€™s tree
  • โ„ŽE = โˆ…

initial history

  • ๐‘E = ๐‘/

E /โˆˆG HI

stage 0 action profile

  • โ„Ž' = ๐‘E

history after stage 0

  • ๐‘' = ๐‘/

' /โˆˆG HJ

stage 1 action profile

  • โ„Ž* = ๐‘E, ๐‘'

history after stage 1

  • โ‹ฎ

โ‹ฎ

  • โ„ŽB = ๐‘E, โ€ฆ , ๐‘BM'

history after stage ๐‘™ โˆ’ 1

  • If number of stages is finite, then game is called finite horizon game
  • In perfect information extensive form games, each choice (and terminal) node is

associated with unique history and vice versa

slide-8
SLIDE 8

Strategies in Extensive Form Games

  • Pure strategies for agent ๐‘— is defined as contingency plan for every

choice node that agent ๐‘— is assigned to

  • Example:
  • Agent 1โ€™s strategies: ๐‘ก' โˆˆ ๐‘‡' = ๐ท, ๐ธ
  • Agent 2โ€™s strategies: ๐‘ก* โˆˆ ๐‘‡* = ๐น๐ป, ๐น๐ผ, ๐บ๐ป, FH
  • For strategy profile ๐‘ก = ๐ท, ๐น๐ป , outcome is terminal node ๐ท, ๐น
slide-9
SLIDE 9

Randomized Strategies in Extensive Form Games

  • Mixed strategy: randomizing over pure strategies
  • Behavioral strategy: randomizing at each choice node
  • Example:
  • Give behavioral strategy for agent 1
  • L with probability 0.2 and L with probability 0.5
  • Give mixed strategy for agent 1 that is not behavioral strategy
  • LL with probability 0.4 and RR with probability 0.6 (why this is not behavioral?)

2,4 5,3 3,2 1,0 0,1

Agent 1 Agent 2 Agent 1 Agent 2

L R L R L R L R

slide-10
SLIDE 10

Example: Sequential Matching Pennies

  • Consider following extensive form version of matching pennies
  • How many strategies does agent 2 have?
  • ๐‘ก* โˆˆ ๐‘‡* = ๐ผ๐ผ, ๐ผ๐‘ˆ, ๐‘ˆ๐ผ, ๐‘ˆ๐‘ˆ
  • Extensive form games can be represented as normal form games
  • What will happen in this game?

Agent 2 Agent 1 HH HT TT TH Heads (-1, 1) (-1, 1) (1, -1) (1, -1) Tails (1, -1) (-1, 1) (-1, 1) (1, -1)

slide-11
SLIDE 11

Example: Entry Deterrence Game

  • Consider following extensive form game
  • What is equivalent strategic form representation?
  • Two pure Nash equilibrium: (In, A) and (Out, F)
  • Are Nash equilibria of this game reasonable in reality?
  • (Out, F) is sustained by noncredible threat of Entrant

Incumbent Entrant A F In (2, 1) (0, 0) Out (1, 2) (1, 2)

slide-12
SLIDE 12

Subgames

  • Suppose that ๐‘Š

Z represents set of all nodes in ๐ปโ€™s game tree

  • Subgame ๐ปโ€ฒ of ๐ป consists of one choice node and all its successors
  • Restriction of strategy ๐‘ก to subgame ๐ป\ is denoted by ๐‘กZ]
  • Subgame ๐ปโ€ฒ can be analyzed as its own game
  • Example: sequential matching pennies
  • How many subgame does this game have?
  • Given that game itself is also considered as subgame, there are three subgames
slide-13
SLIDE 13

Matrix Representation of Subgames

Agent 2 Agent 1 LL LR RL RR LL

2, 4 2, 4 5, 3 5, 3

LR

2, 4 2, 4 5, 3 5, 3

RL

3, 2 1, 0 3, 2 1, 0

RR

3, 2 0, 1 3, 2 0, 1

Agent 2 Agent 1 ** *L

1, 0

*R

0, 1

Agent 2 Agent 1 *L *R *L

3, 2 1, 0

*R

3, 2 0, 1

Agent 2 Agent 1 L* R* **

2, 4 5, 3

2,4 5,3 3,2 1,0 0,1

Agent 1 Agent 2 Agent 1 Agent 2

L R L R L R L R

slide-14
SLIDE 14

Subgame Perfect Equilibrium (SPE)

  • Profile ๐‘กโˆ— is SPE of game ๐ป if for any subgame ๐ป\ of ๐ป, ๐‘กZ]

โˆ— is NE of ๐ป\

  • Loosely speaking, subgame perfection will remove noncredible threats
  • Noncredible threads are not NE in their subgames
  • How to find SPE?
  • One could find all of NE, then eliminate those that are not subgame perfect
  • But there are more economical ways of doing it
slide-15
SLIDE 15

Backward Induction for Finite Games

  • (1) Start from โ€œlastโ€ subgames (choice nodes with all terminal children)
  • (2) Find Nash equilibria of those subgames
  • (3) Turn those choice nodes to terminal nodes using NE utilities
  • (4) Go to (1) until no choice node remains
  • [Theorem] Backward induction gives entire set of SPE
slide-16
SLIDE 16

SPE of Extensive Form Game and NE of Subgames

  • (RR, LL) and (LR, LR) are not subgame perfect equilibria because (*R, **) is not an equilibrium
  • (LL, LR) is not subgame perfect because (*L, *R) is not an equilibrium, *R is not a credible threat

1,0 3,2 2,4 3,2

Agent 2 Agent 1 LL LR RL RR LL

2, 4 2, 4 5, 3 5, 3

LR

2, 4 2, 4 5, 3 5, 3

RL

3, 2 1, 0 3, 2 1, 0

RR

3, 2 0, 1 3, 2 0, 1

Agent 2 Agent 1 ** *L

1, 0

*R

0, 1

Agent 2 Agent 1 *L *R *L

3, 2 1, 0

*R

3, 2 0, 1

Agent 2 Agent 1 L* R* **

2, 4 5, 3

2,4 5,3 3,2 1,0 0,1

Agent 1 Agent 2 Agent 1 Agent 2

L R L R L R L R

slide-17
SLIDE 17

Example: Stackleberg Model of Competition

  • Consider variant of Cournot game where firm 1 first chooses ๐‘Ÿ', then

firm 2 chooses ๐‘Ÿ* after observing ๐‘Ÿ' (firm 1 is Stackleberg leader)

  • Suppose that both firms have marginal cost ๐‘‘ and inverse demand

function is given by ๐‘„ ๐‘… = ๐›ฝ โˆ’ ๐›พ๐‘…, where ๐‘… = ๐‘Ÿ' + ๐‘Ÿ*, and ๐›ฝ > ๐‘‘

  • Solve for SPE by backward induction starting firm 2โ€™s subgame
  • Firm 2 chooses ๐‘Ÿ* = arg max

ijE

๐›ฝ โˆ’ ๐›พ ๐‘Ÿ' + ๐‘Ÿ โˆ’ ๐‘‘ ๐‘Ÿ

  • ๐‘Ÿ* = ๐›ฝ โˆ’ ๐‘‘ โˆ’ ๐›พ๐‘Ÿ' /2๐›พ
  • Firm 1 chooses ๐‘Ÿ' = arg max

ijE

๐›ฝ โˆ’ ๐›พ ๐‘Ÿ + ๐›ฝ โˆ’ ๐‘‘ โˆ’ ๐›พ๐‘Ÿ /2๐›พ โˆ’ ๐‘‘ ๐‘Ÿ

  • ๐‘Ÿ' = ๐›ฝ โˆ’ ๐‘‘ /2๐›พ
  • ๐‘Ÿ* = ๐›ฝ โˆ’ ๐‘‘ /4๐›พ
slide-18
SLIDE 18

Example: Ultimatum Game

  • Two agents want to split ๐‘‘ dollars
  • 1 offers 2 some amount ๐‘ฆ โ‰ค ๐‘‘
  • If 2 accepts, outcome is ๐‘‘ โˆ’ ๐‘ฆ, ๐‘ฆ
  • If 2 rejects, outcome is 0, 0
  • What is 2โ€™s best response if ๐‘ฆ > 0?
  • Yes
  • What is 2โ€™s best response if ๐‘ฆ = 0?
  • Indifferent between

Yes or No

  • What are 2โ€™s optimal strategies?
  • (a)

Yes for all ๐‘ฆ โ‰ฅ 0

  • (b)

Yes if ๐‘ฆ > 0, No if ๐‘ฆ = 0

๐‘ฆ ๐‘‘ โˆ’ ๐‘ฆ, ๐‘ฆ 0,0

Agent 1

๐‘‘

Agent 2 Yes No

slide-19
SLIDE 19

SPE of Ultimatum Game

  • What is 1โ€™s optimal strategy for each of 2โ€™s optimal strategies?
  • For (a), 1โ€™s optimal strategy is to offer ๐‘ฆ = 0
  • For (b),
  • If agent 1 offers ๐‘ฆ = 0, then her utility is 0
  • If she wants to offer any ๐‘ฆ > 0, then she must offer arg max
  • pE (๐‘‘ โˆ’ ๐‘ฆ)
  • This optimization does not have any optimal solution!
  • No offer of agent 1 is optimal!
  • Unique SPE of ultimatum game is:

โ€œAgent 1 offers 0, and agent 2 accepts all offersโ€

slide-20
SLIDE 20

Modified Ultimatum Game

  • If ๐‘‘ is in multiples of cent, what are 2โ€™s optimal strategies?
  • (a)

Yes for all ๐‘ฆ โ‰ฅ 0

  • (b)

Yes if ๐‘ฆ > 0, No if ๐‘ฆ = 0

  • What are 1โ€™s optimal strategies for each of 2โ€™s?
  • For (a), offer ๐‘ฆ = 0
  • For (b), offer ๐‘ฆ = 1 cent
  • What are SPE of modified ultimatum game?
  • Agent 1 offers 0, and agent 2 accepts all offers
  • Agent 1 offers 1 cent, and agent 2 accept all offers except 0
  • Show that for every ฬ…

๐‘ฆ โˆˆ 0, ๐‘‘ , there exists NE in which 1 offers ฬ… ๐‘ฆ

  • What is agent 2โ€™s optimal strategy?
slide-21
SLIDE 21

limitation of Backward Induction

  • If there are ties, how they are broken affects what happens up in tree
  • There could be too many equilibria

Agent 1 Agent 2 Agent 2

3,2 2,3 4,1 0,1 0.87655 0.12345 1/2 1/2

slide-22
SLIDE 22

Example: Bargaining Game

  • Two agents want to split ๐‘‘ = 1 dollar
  • First, 1 makes her offer
  • Then, 2 decides to accept or reject
  • If 2 rejects, then 2 makes new offer
  • Then, 1 decides to accept or reject
  • Let ๐‘ฆ = ๐‘ฆ', ๐‘ฆ* with ๐‘ฆ' + ๐‘ฆ* = 1

denote allocations in 1st round

  • Let ๐‘ง = (๐‘ง', ๐‘ง*) with ๐‘ง' + ๐‘ง* = 1

denote allocations in 2nd round

๐‘ฆ 1 โˆ’ ๐‘ฆ, ๐‘ฆ

Agent 1

1

Agent 2 Yes No Agent 2

1

๐‘ง ๐‘ง, 1 โˆ’ ๐‘ง 0,0

Agent 1 Yes No

slide-23
SLIDE 23

Backward Induction for Bargaining Game

  • Second round is ultimatum game with unique SPE
  • Agent 2 offers 0, and agent 1 accepts all offers
  • What is 2โ€™s optimal strategy in her round 1โ€™s subgame?
  • (a) If ๐‘ฆ* โ‰ค 1, reject
  • (b) If ๐‘ฆ* = 1, accept, and reject otherwise
  • What are 1โ€™s optimal strategies in round 1 for each of 2โ€™s?
  • For both (a) and (b), agent 1 is indifferent between all strategies
  • Agent 1โ€™s weakly dominant strategy is to offer ๐‘ฆ* = 1
  • How many SPE does this game have?
  • Infinitely many! In all SPE, agent 2 gets everything
  • Last moverโ€™s advantage: In every SPE, agent who makes offer in last round obtains everything
slide-24
SLIDE 24

Example: Discounted Bargaining Game

  • Suppose utilities are discounted every

round by discount factor, 0 < ๐œ€/ < 1

  • What is unique SPE of (1)?
  • 2 offers ๐‘ง' = 0 and 1 accepts all offers
  • What are optimal strategies in (2)?
  • (a) Yes if ๐‘ฆ* โ‰ฅ ๐œ€*, No otherwise
  • (b) Yes if ๐‘ฆ* > ๐œ€*, No otherwise
  • What are optimal strategies in (3)?
  • For (a), offer ๐‘ฆ* = ๐œ€*
  • For (b), there is no optimal strategy

๐‘ฆ ๐‘ฆ', ๐‘ฆ*

Agent 1

1

Agent 2 Yes No Agent 2

1

๐‘ง ๐œ€'๐‘ง', ๐œ€*๐‘ง* 0,0

Agent 1 Yes No (1) 1) (2) 2) (3) 3)

slide-25
SLIDE 25

Unique SPE of Discounted Bargaining Game

  • What are SPE strategies?
  • Agent 1โ€™s proposes 1 โˆ’ ๐œ€*, ๐œ€*
  • Agent 2 only accepts proposals with ๐‘ฆ* โ‰ฅ ๐œ€*
  • Agent 2 proposes 0,1 after any history in which1โ€™s proposal is rejected
  • Agent 1 accepts all proposals of Agent 2
  • What is SPE outcome of game?
  • Agent 1 proposes 1 โˆ’ ๐œ€*, ๐œ€*
  • Agent 2 accepts
  • Resulting utilities are 1 โˆ’ ๐œ€*, ๐œ€*
  • Desirability of earlier agreement yields positive utility for agent 1
slide-26
SLIDE 26

Stahlโ€™s Bargaining Model (for Finite Horizon Games)

  • 2 rounds:

1 โˆ’ ๐œ€*

  • 3 rounds:

1 โˆ’ ๐œ€* + ๐œ€'๐œ€*

  • 4 rounds:

(1 โˆ’ ๐œ€*) 1 + ๐œ€'๐œ€*

  • 5 rounds:

(1 โˆ’ ๐œ€*) 1 + ๐œ€'๐œ€* + ๐œ€'๐œ€*

  • 2k rounds:

1 โˆ’ ๐œ€*

'M xJxy z 'MxJxy

  • 2k+1 rounds:

1 โˆ’ ๐œ€*

'M xJxy z 'MxJxy

+ ๐œ€'๐œ€* B

  • Taking limit as ๐‘™ โ†’ โˆž, we see that agent 1 gets ๐‘ฆ'

โˆ— = 'Mxy 'MxJxy at SPE

slide-27
SLIDE 27

Rubinsteinโ€™s Infinite Horizon Bargaining Model

  • Suppose agent can alternate offers forever
  • There are two types of outcome to consider
  • At round ๐‘ข, one agent accepts her offer ๐‘ฆ', No, ๐‘ฆ*, No, โ€ฆ , ๐‘ฆโ‚ฌ, Yes
  • Every offer gets rejected: ๐‘ฆ', No, ๐‘ฆ*, No, โ€ฆ , ๐‘ฆB, No, โ€ฆ
  • This is not finite horizon game, backward induction cannot be used
  • We need different method to verify any SPE
slide-28
SLIDE 28

One-Shot Deviation Principle

  • One-shot deviation from strategy ๐‘ก means deviating from ๐‘ก in single

stage and conforming to it thereafter

  • Strategy profile ๐‘กโˆ— is SPE if and only if there exists no profitable one-

shot deviation for each subgame and every agent

  • This follows from principle of optimality of dynamic programming
slide-29
SLIDE 29

SPE for Rubinsteinโ€™s Model

  • Recall that in Stahlโ€™s model, for ๐‘™ โ†’ โˆž, ๐‘ฆ'

โˆ— = 'Mxy 'MxJxy

  • Is following strategy profile ๐‘กโˆ— SPE?
  • Agent 1 proposes ๐‘ฆโˆ— and accepts ๐‘ง if and only if ๐‘ง โ‰ฅ ๐‘ง'

โˆ—

  • Agent 2 proposes ๐‘งโˆ— and accepts ๐‘ฆ if and only if ๐‘ฆ โ‰ฅ ๐‘ฆ*

โˆ—

  • ๐‘ฆโˆ— = ๐‘ฆ'

โˆ—, ๐‘ฆ* โˆ— , ๐‘ฆ' โˆ— = 'Mxy 'MxJxy ,

๐‘ฆ*

โˆ— = xy('MxJ) 'MxJxy

  • ๐‘งโˆ— = ๐‘ง'

โˆ—, ๐‘ง* โˆ— , ๐‘ง' โˆ— = xJ('Mxy) 'MxJxy ,

๐‘ง*

โˆ— = 'MxJ 'MxJxy

slide-30
SLIDE 30

One-Shot Deviation Principle for Rubinsteinโ€™s Model

  • First note that this game has two types of subgames
  • (1) first move is offer
  • (2) first move is response to offer
  • For (1), suppose offer is made by agent 1
  • If agent 1 adopts ๐‘กโˆ—, agent 2 accepts, agent 1 gets ๐‘ฆ'

โˆ—

  • If agent 1 offers > ๐‘ฆ*

โˆ—, agent 2 accepts, and agent 1 gets ๐‘ฆ' < ๐‘ฆ' โˆ—

  • If agent 1 offers < ๐‘ฆ*

โˆ—, agent 2 rejects and offers ๐‘ง' โˆ—, agent 1 accepts and gets ๐œ€'๐‘ง' โˆ— < ๐‘ฆ' โˆ—

  • For (2), suppose agent 1 is responding to offer ๐‘ง' โ‰ฅ ๐‘ง'

โˆ—

  • If agent 1 adopts ๐‘กโˆ—, she accepts and gets ๐‘ง'
  • If agent 1 rejects and offers ๐‘ฆ*

โˆ— in next round, agent 2 accepts, agent 1 gets ๐œ€'๐‘ฆ' โˆ— = ๐‘ง' โˆ— โ‰ค ๐‘ง'

  • For (2), suppose agent 1 is responding to offer ๐‘ง' < ๐‘ง'

โˆ—

  • If agent 1 adopts ๐‘กโˆ—, she rejects and offers ๐‘ฆ*

โˆ— in next round, agent 2 accepts, agent 1 gets ๐œ€'๐‘ฆ' โˆ— = ๐‘ง' โˆ— > ๐‘ง'

  • If agent 1 accepts, she gets ๐‘ง' < ๐‘ง'

โˆ—

  • Hence ๐‘กโˆ— is SPE (in fact unique SPE, check GT, Section 4.4.2 to verify)
slide-31
SLIDE 31

Rubinsteinโ€™s Model for Symmetric Agents

  • Suppose that ๐œ€' = ๐œ€*
  • If agent 1 moves first, division is

' 'ฦ’x , x 'ฦ’x

  • If agent 2 moves first, division is

x 'ฦ’x , ' 'ฦ’x

  • First moverโ€™s advantage is related to impatience of agents
  • If ๐œ€ โ†’ 1, FMA disappears and outcome tends to '

* , ' *

  • If ๐œ€ โ†’ 0, FMA dominates and outcome tends to 1,0
slide-32
SLIDE 32

Imperfect Information Extensive Form Games

  • In perfect information games, agents know choice nodes they are in
  • Agents know all prior actions
  • Recall that in such games choice nodes are equal to histories that led to them
  • Agents may have partial or no knowledge of actions taken by others
  • Agents may also have imperfect recall of actions taken by themselves
slide-33
SLIDE 33

Example: Imperfect Information Sequential Matching Pennies

  • Agent 1 takes action
  • Agent 2 does not see agent 1โ€™s action
  • Agent 2 takes action, and outcome is revealed
  • Information set is collection of choice nodes that cannot be

distinguished by agents whose turn it is

  • Set of agents and their actions at each choice node in information set

has to be the same, otherwise, agents could distinguish between nodes

Agent 1

H T

Agent 2

H H T T

  • 1
  • 1

1 1

slide-34
SLIDE 34

Finite Imperfect-Information Extensive Form Games

  • Formally, each game is tuple ๐ป = โ„, ๐’/ /โˆˆโ„, โ„‹, ๐’ถ, ๐›ฝ, ๐›พ/ /โˆˆโ„, ๐œ, ๐‘ฃ/ /โˆˆโ„, ๐ฝ
  • โ„, ๐’/ /โˆˆโ„, โ„‹, ๐’ถ, ๐›ฝ, ๐›พ/ /โˆˆโ„, ๐œ, ๐‘ฃ/ /โˆˆโ„ is perfect information, extensive form game
  • ๐ฝ = ๐ฝ', โ€ฆ , ๐ฝโ€ฆ , where ๐ฝ

โ€  = โ„Žโ€ ,', โ€ฆ , โ„Žโ€ ,Bโ€ก , is partition of โ„‹ such that if โ„Ž, โ„Ž\ โˆˆ ๐ฝ โ€ , then

๐›ฝ โ„Ž = ๐›ฝ โ„Ž\ , and for all ๐‘— โˆˆ ๐›ฝ โ„Ž , ๐›พ/ โ„Ž = ๐›พ/ โ„Ž\

slide-35
SLIDE 35

Example: Poker-Like Game

  • What are agent 1โ€™s strategies?
  • ๐‘†๐‘†, ๐‘†๐ท, ๐ท๐‘†, ๐ท๐ท
  • What are agent 2โ€™s strategies?
  • ๐ท๐ท, ๐ท๐บ, ๐บ๐ท, ๐บ๐บ
  • How can we find NE of this game?
  • Model game as normal form zero-sum game
  • Each cell represents expected utilities (natureโ€™s coin toss)
  • Eliminated (weakly) dominated strategies
  • Solve for (mixed strategy) NE

Agent 2 Agent 1 CC CF FC FF RR 0, 0 0, 0 1, -1 1, -1 RC 0.5, -0.5 1.5, -1.5 0, 0 1, -1 CR

  • 0.5, 0.5
  • 0.5, 0.5

1, -1 1, -1 CC 0, 0 1, -1 0, 0 1, -1

Nature

Give 1 King Give 1 Jack 50% 50%

Agent 1 Agent 1

Raise Raise Check Check

2/3 1/3 1/3 2/3

Agent 2

1

  • 1

1 1

call fold call fold

1

  • 2

1 2

call fold call fold

Agent 2

slide-36
SLIDE 36

Example: Kune Poker

https://justinsermeno.com/posts/cfr/

slide-37
SLIDE 37

Imperfect Recall, Mixed vs Behavioral Strategies

  • Consider mixed strategies
  • What is NE of this game?
  • (R,D) with outcome utilities (2,2)
  • Consider behavioral strategies
  • What is 1โ€™s expected utility if she does ๐‘ž, 1 โˆ’ ๐‘ž
  • ๐‘ž* + 100๐‘ž 1 โˆ’ ๐‘ž + 2 1 โˆ’ ๐‘ž
  • What is 1โ€™s best response?
  • ๐‘ž = ล โ€น

'ล โ€น

  • What is NE of this game?
  • ล โ€น

'ล โ€น , 'EE 'ล โ€น , 0,1

Agent 1

L R

Agent 1

L U R D 1,1 2,2 100,100 5,1

Agent 2

slide-38
SLIDE 38

Solving Extensive Form Games: Perfect vs Imperfect Information

  • In perfect information games, optimal strategy for each subgame can be determined

by that subgame alone (how backward induction works!)

  • We can forget how we got here
  • We can ignore rest of game
  • In imperfect information games, this is not necessarily true
  • We cannot forget about path to current node
  • We cannot ignore other subgames
https://www.chess.com
slide-39
SLIDE 39

Example

  • Is always accommodating good strategy?
  • No, leads to utility of -2.5 for incumbent
  • Is always fighting good strategy?
  • No, leads to utility of -1.5 for incumbent
  • What should incumbent do?
  • A with 3/8 probability and F with 5/8
  • What if we swap 2 and -2?
  • A with 7/8 probability and F with 1/8

In In

Entrant Entrant

Out Out

2

Nature

Heads Tails 50% 50%

  • 2

Incumbent

  • 5

3 5

  • 3

A F A F

slide-40
SLIDE 40

Subgame Perfection and Imperfect Information

  • There are two subgames: game itself and subgame after agent 1 plays R
  • (R, RR) is NE and SPE
  • But, why should 2 play R after 1 plays L/M?
  • This is noncredible threat
  • There are more sophisticated equilibrium refinements that rule this out

Agent 1 Agent 2 Agent 2 4, 1 0, 0 5, 1 1, 0 Agent 2 3, 2 2, 3

L M R L R L R L R

slide-41
SLIDE 41

Questions?

slide-42
SLIDE 42

Acknowledgement

  • This lecture is a slightly modified version of ones prepared by
  • Asu Ozdaglar [MIT 6.254]
  • Vincent Conitzer [Duke CPS 590.4]