Perfect-Information Extensive Form Games
CMPUT 654: Modelling Human Strategic Behaviour
S&LB §5.1
Perfect-Information Extensive Form Games CMPUT 654: Modelling Human - - PowerPoint PPT Presentation
Perfect-Information Extensive Form Games CMPUT 654: Modelling Human Strategic Behaviour S&LB 5.1 Recap: Best Response and Nash Equilibrium Definition: The set of 's best responses to a strategy profile is i s
CMPUT 654: Modelling Human Strategic Behaviour
S&LB §5.1
Definition: The set of 's best responses to a strategy profile is Definition: A strategy profile is a Nash equilibrium iff
equilibrium
i s−i ∈ S−i s ∈ S si s
BRi(s−i) ≐ {s*
i ∈ S ∣ ui(s* i , s−i) ≥ ui(si, s−i) ∀si ∈ Si}
∀i ∈ N, si ∈ BR−i(s−i)
more than 𝜁
correlated randomizing device
ϵ
actions happen simultaneously
includes temporal structure (i.e., a game tree)
2–0 1–1 0–2
no yes
no yes
no yes
Figure 5.1: The Sharing game.
There are two kinds of extensive form game:
players (including Nature)
Definition: A finite perfect-information game in extensive form is a tuple where
is a set of players,
is a set of nonterminal choice nodes,
),
is the action function,
is the player function,
is the successor function,
is a profile of utility functions for each player, with .
G = (N, A, H, Z, χ, ρ, σ, u), N n A H Z H χ : H → 2A ρ : H → N σ : H × A → H ∪ Z u = (u1, u2, …, un) ui : Z → ℝ
2–0 1–1 0–2
no yes
no yes
no yes
Figure 5.1: The Sharing game.
2–0 1–1 0–2
no yes
no yes
no yes
Figure 5.1: The Sharing game.
Question: What are the pure strategies in an extensive form game? Definition: Let be a perfect information game in extensive form. Then the pure strategies of player consist of the cross product of actions available to player at each of their choice nodes, i.e.,
Note: A pure strategy associates an action with each choice node, even those that will never be reached.
G = (N, A, H, Z, χ, ρ, σ, u)
i
i ∏
h∈H∣ρ(h)=i
χ(h)
Question: What are the pure strategies for player 2?
Question: What are the pure strategies for player 1?
second choice node even when it can never be reached; e.g., and .
{(C, E), (C, F), (D, E), (D, F)} {(A, G), (A, H), (B, G), (B, H)} (A, G) (A, H)
A B
C D
E F
G H
agent (why?)
A B
C D
E F
G H
C,E C,F D,E D,F A,G 3,8 3,8 8,3 8,3 A,H 3,8 3,8 8,3 8,3 B,G 5,5 2,10 5,5 2,10 B,H 5,5 1,0 5,5 1,0
Question: Which representation is more compact?
existing definitions for:
Question: What is the definition
in an extensive form game?
Theorem: [Zermelo 1913] Every finite perfect-information game in extensive form has at least one pure strategy Nash equilibrium.
randomize, because they already know the best response
cases where an agent has multiple best responses at a single choice node
C,E C,F D,E D,F A,G 3,8 3,8 8,3 8,3 A,H 3,8 3,8 8,3 8,3 B,G 5,5 2,10 5,5 2,10 B,H 5,5 1,0 5,5 1,0
A B
C D
E F
G H
: has payoff 0 for player 2, because player 1 plays , so their best response is to play .
if they got to that choice node?
that is not credible.
rely on non-credible threats.
(BH, CE) F H E H
A B
C D
E F
G H
Definition: The subgame of rooted at is the restriction of G to the descendants of . Definition: The subgames of are the subgames of rooted at for every choice node . Examples:
G h
h
G
G h h ∈ H
C D
E F
G H
G H
A B
C D
E F
G H
Definition: An strategy profile is a subgame perfect equilibrium of iff, for every subgame
, the restriction of to is a Nash equilibrium of .
s G G′ G s G′ G′
C,E C,F D,E D,F A,G 3,8 3,8 8,3 8,3 A,H 3,8 3,8 8,3 8,3 B,G 5,5 2,10 5,5 2,10 B,H 5,5 1,0 5,5 1,0
A B
C D
E F
G H
to compute a subgame perfect equilibrium.
A B
C D
E F
G H
A B
C D
E F
G
G: (2,10)
A B
C D
E F
G
G: (2,10) F ,G: (2,10)
A B
C D
E F
G
G: (2,10) F ,G: (2,10) C: (3,8) A,C,F ,G: (3,8)
A D
A D
A D
A D
A D
Question: What is the unique subgame perfect equilibrium for Centipede?
first opportunity.
not to go down, then going down is no longer their only rational choice...
A D
A D
A D
A D
A D