Game Theory for Sequential Interactions
CMPUT 366: Intelligent Systems
S&LB §5.0-5.2.2
Game Theory for Sequential Interactions CMPUT 366: Intelligent - - PowerPoint PPT Presentation
Game Theory for Sequential Interactions CMPUT 366: Intelligent Systems S&LB 5.0-5.2.2 Lecture Outline 1. Recap 2. Perfect Information Games 3. Backward Induction 4. Imperfect Information Games Recap: Game Theory Game
CMPUT 366: Intelligent Systems
S&LB §5.0-5.2.2
behaviour
making some other agent worse off
choice of the other agents' strategies
Ballet Soccer Ballet 2, 1 0, 0 Soccer 0, 0 1, 2
actions happen simultaneously
includes temporal structure (i.e., a game tree)
2–0 1–1 0–2
no yes
no yes
no yes
Figure 5.1: The Sharing game.
All Half None
There are two kinds of extensive form game:
Definition: A finite perfect-information game in extensive form is a tuple where
G = (N, A, H, Z, χ, ρ, σ, u), χ : H → 2A ρ : H → N σ : H × A → H ∪ Z
2–0 1–1 0–2
no yes
no yes
no yes
Figure 5.1: The Sharing game.
All Half None
rejects
2–0 1–1 0–2
no yes
no yes
no yes
Figure 5.1: The Sharing game.
All Half None
Question: What are the pure strategies in an extensive form game? Definition: Let be a perfect information game in extensive form. Then the pure strategies of player i consist of the cross product of actions available to player i at each of their choice nodes, i.e.,
even those that will never be reached
G = (N, A, H, Z, χ, ρ, σ, u) ∏
h∈H∣ρ(h)=i
χ(h)
Question: What are the pure strategies for player 2?
Question: What are the pure strategies for player 1?
second choice node even when it can never be reached
A B
C D
E F
G H
A B
C D
E F
G H
C,E C,F D,E D,F A,G 3,8 3,8 8,3 8,3 A,H 3,8 3,8 8,3 8,3 B,G 5,5 2,10 5,5 2,10 B,H 5,5 1,0 5,5 1,0
Question: Which representation is more compact?
(B,G) and (C,F) → (2,10)
existing definitions for:
Question: What is the definition
in an extensive form game?
Theorem: [Zermelo, 1913] Every finite perfect-information game in extensive form has at least one pure strategy Nash equilibrium.
randomize, because they already know the best response
cases where an agent has multiple best responses at a single choice node
strategy equilibrium in a perfect-information extensive-form game.
BACKWARDINDUCTION(h): if h is terminal: return u(h) i := 𝜍(h) U := -∞ for each h' in 𝜓(h): V = BACKWARDINDUCTION(h') if Vi > Ui: Ui := Vi return U
A D
A D
A D
A D
A D
not to go down, then going down is no longer their only rational choice...
A D
A D
A D
A D
A D
by all players
constant utility and known mixed strategy
actions are hidden, sometimes both
sequential actions, some of which may be hidden
Definition: An imperfect information game in extensive form is a tuple where
and
(i.e., partition of) with the property that and whenever there exists a j for which
G = (N, A, H, Z, χ, ρ, σ, u, I), (N, A, H, Z, χ, ρ, σ, u) I = (I1, …, In), where Ii = (Ii,1, …, Ii,ki) {h ∈ H : ρ(h) = i} χ(h) = χ(h′) ρ(h) = ρ(h′) h ∈ Ii,j and h′ ∈ Ii,j .
L R
A B
ℓ
r
ℓ
r
Question: What are the pure strategies in an imperfect information game? Definition: Let be an imperfect information game in extensive form. Then the pure strategies of player i consist of the cross product of actions available to player i at each of their information sets, i.e.,
even those that will never be reached
G = (N, A, H, Z, χ, ρ, σ, u, I) ∏
Ii,j∈Ii
χ(h)
Questions: In an imperfect information game:
mixed strategies?
best response?
Nash equilibrium?
A B L,ℓ 0,0 2,4 L,r 2,4 0,0 R,ℓ 1,1 1,1 R,r 1,1 1,1
L R
A B
ℓ
r
ℓ
r
Question: Can you represent an arbitrary perfect information extensive form game as an imperfect information game?
information extensive form game
between histories in the same information set