Extensive Form Games
Game Theory MohammadAmin Fazli
Algorithmic Game Theory 1
Extensive Form Games Game Theory MohammadAmin Fazli Algorithmic - - PowerPoint PPT Presentation
Extensive Form Games Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Perfect Information Extensive Form Games Backward Induction and MinMax Algorithms Imperfect Information Extensive Form Games The Sequence Form
Game Theory MohammadAmin Fazli
Algorithmic Game Theory 1
MohammadAmin Fazli
Algorithmic Game Theory 2 MohammadAmin Fazli
MohammadAmin Fazli
any notion of sequence, or time, of the actions of the player
the temporal structure explicit.
Algorithmic Game Theory 3
MohammadAmin Fazli
the tuple (N, A, H, Z, χ, ρ, σ, u), where:
chooses an action at h
Algorithmic Game Theory 4
MohammadAmin Fazli
the tuple (N, A, H, Z, χ, ρ, σ, u), where:
new choice node or terminal node such that for all ℎ1, ℎ2 ∈ 𝐼 and 𝑏1, 𝑏2 ∈ 𝐵, if 𝜏 ℎ1, 𝑏1 = 𝜏(ℎ2, 𝑏2) then ℎ1 = ℎ2 and 𝑏1 = 𝑏2
Algorithmic Game Theory 5
MohammadAmin Fazli
game’s formal definition elements?
strategies player does each player has?
Algorithmic Game Theory 6
MohammadAmin Fazli
complete specification of which action to take at each node belonging to that player.
extensive-form game. Then the pure strategies of player i consist of the cross product
ℎ∈𝐼,𝜍 ℎ =𝑗
𝜓(ℎ)
Algorithmic Game Theory 7
MohammadAmin Fazli
Algorithmic Game Theory 8
MohammadAmin Fazli
Algorithmic Game Theory 9
MohammadAmin Fazli
Algorithmic Game Theory 10
MohammadAmin Fazli
node?
Algorithmic Game Theory 11
him from choosing F , and so gets 5
threat
would he really follow through and play H?
MohammadAmin Fazli
restriction of G to the descendents of h.
iff for any subgame G′ of G, the restriction of s to G′ is a Nash equilibrium of G′. Since G is its own subgame, every SPE is a NE.
Algorithmic Game Theory 12
MohammadAmin Fazli
Algorithmic Game Theory 13
MohammadAmin Fazli
Algorithmic Game Theory 14
MohammadAmin Fazli
the algorithm is called the MinMax Algorithm
things up by pruning nodes that will never be reached in play: “alpha-beta pruning”.
Algorithmic Game Theory 15
MohammadAmin Fazli
game with L leaves, the set of subgame-perfect equilibrium payoffs can be computed in time 𝑃(𝑀3)
Algorithmic Game Theory 16
MohammadAmin Fazli
Z, χ, ρ, σ, u, I), where
partition of) {ℎ ∈ 𝐼, 𝜍 ℎ = 𝑗} with the property that χ(h) = χ(h′) and ρ(h) = ρ(h′) whenever there exists a j for which h ∈ 𝐽𝑗,𝑘 and h′∈ 𝐽𝑗,𝑘.
Algorithmic Game Theory 17
MohammadAmin Fazli
information extensive-form game. Then the pure strategies of player i consist of the cartesian product 𝐽𝑗,𝑘∈𝐽𝑗 𝜓(𝐽𝑗,𝑘)
Algorithmic Game Theory 18
MohammadAmin Fazli
Algorithmic Game Theory 19
MohammadAmin Fazli
in imperfect information extensive form games
every time an information set is encountered
Algorithmic Game Theory 20
MohammadAmin Fazli
equivalent
before him, or if he’s the first one.
Algorithmic Game Theory 21
MohammadAmin Fazli
Algorithmic Game Theory 22
MohammadAmin Fazli
game G if for any two nodes h, h′ that are in the same information set for player i, for any path ℎ0, 𝑏0, ℎ1, 𝑏1, … , ℎ𝑛, 𝑏𝑛, ℎ from the root of the game to h (where the hj are decision nodes and the aj are actions) and for any path ℎ0, 𝑏′0, ℎ′1, 𝑏′1, … , ℎ′𝑛, 𝑏′𝑛, ℎ′ from the root to h′ it must be the case that:
′ are in the same equivalence
class
′
Algorithmic Game Theory 23
MohammadAmin Fazli
given agent can be replaced by an equivalent behavioral strategy, and any behavioral strategy can be replaced by an equivalent mixed
induce the same probabilities on outcomes, for any fixed strategy profile (mixed or behavioral) of the remaining agents.
Algorithmic Game Theory 24
MohammadAmin Fazli
sequence form game of perfect recall. The sequence-form representation of G is a tuple(N,Σ,g,C):
probabilities of agent i.
Algorithmic Game Theory 25
MohammadAmin Fazli
node h ∈ H ∪ Z of the game tree, is the ordered set of player i’s actions that lie on the path from the root to h. Let ∅ denote the sequence corresponding to the root node. The set of sequences of player i is denoted Σi, and Σ = Σ1 × · · · × Σn is the set of all sequences.
function given by g(σ) = u(z) if a leaf node z ∈ Z would be reached when each player played his sequence σi ∈ σ, and by g(σ) = 0
Algorithmic Game Theory 26
MohammadAmin Fazli
Algorithmic Game Theory 27
MohammadAmin Fazli
probability βi(h,ai) to taking action ai at a given decision node h.
mapping ri : Σi →[0, 1] defined as ri(σi) = 𝑑∈𝜏𝑗 𝛾𝑗(𝑑). Each value ri(σi) is called a realization probability.
Algorithmic Game Theory 28
MohammadAmin Fazli
must be one single sequence that player i can play to reach all of his nonterminal choice nodes h ∈ I. We denote this mapping as seqi : Ii →Σi, and call seqi(I) the sequence leading to information set I.
extends the sequence σi. We denote by Exti : Σi → 2Σi a function mapping from sequences to sets of sequences, where Exti(σi) denotes the set of sequences that extend the sequence σi.
satisfying the following constraints:
Algorithmic Game Theory 29
MohammadAmin Fazli
representation:
However, if we treated both r1 and r2 as variables then the objective function would no longer be linear.
Algorithmic Game Theory 30
MohammadAmin Fazli
information set I ∈ I1 and one additional variable v0 (corresponding to the first constraint)
set I ∈ Ii in which the final action in σ𝑗 was taken otherwise.
Algorithmic Game Theory 31
MohammadAmin Fazli
Algorithmic Game Theory 32