Open problems in repeated games with finite automata Abraham Neyman - - PowerPoint PPT Presentation

open problems in repeated games with finite automata
SMART_READER_LITE
LIVE PREVIEW

Open problems in repeated games with finite automata Abraham Neyman - - PowerPoint PPT Presentation

Open problems in repeated games with finite automata Abraham Neyman Jerusalem, May 23, 2011 subject, date p. 1/13 Two-person zero-sum games subject, date p. 2/13 Two-person zero-sum games Quantify the advantage of larger automata in


slide-1
SLIDE 1

Open problems in repeated games with finite automata

Abraham Neyman

Jerusalem, May 23, 2011

subject, date – p. 1/13

slide-2
SLIDE 2

Two-person zero-sum games

subject, date – p. 2/13

slide-3
SLIDE 3

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

subject, date – p. 2/13

slide-4
SLIDE 4

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

GT (k1, k2) is the repeated game where

subject, date – p. 2/13

slide-5
SLIDE 5

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

GT (k1, k2) is the repeated game where

a stage game G is repeated T ≤ ∞ times, player i is confined to play a strategy that is implementable by a deterministic automaton of size

ki ≤ ∞,

the repeated game payoff is the average per-stage payoff.

subject, date – p. 2/13

slide-6
SLIDE 6

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

GT (k1, k2) is the repeated game where

a stage game G is repeated T ≤ ∞ times, player i is confined to play a strategy that is implementable by a deterministic automaton of size

ki ≤ ∞,

the repeated game payoff is the average per-stage payoff. Explicitly, what is the asymptotic behavior of VAL GT (k1, k2) as T, k1, k2 → ∞ .

subject, date – p. 2/13

slide-7
SLIDE 7

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

GT (k1, k2) is the repeated game where

a stage game G is repeated T ≤ ∞ times, player i is confined to play a strategy that is implementable by a deterministic automaton of size

ki ≤ ∞,

the repeated game payoff is the average per-stage payoff. Explicitly, what is the asymptotic behavior of VAL GT (k1, k2) as T, k1, k2 → ∞ .

subject, date – p. 2/13

slide-8
SLIDE 8

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

GT (k1, k2) is the repeated game where

a stage game G is repeated T ≤ ∞ times, player i is confined to play a strategy that is implementable by a deterministic automaton of size

ki ≤ ∞,

the repeated game payoff is the average per-stage payoff. Explicitly, what is the asymptotic behavior of VAL GT (k1, k2) as T, k1, k2 → ∞ .

subject, date – p. 2/13

slide-9
SLIDE 9

Two-person zero-sum games

Quantify the advantage of larger automata in repeated games.

GT (k1, k2) is the repeated game where

a stage game G is repeated T ≤ ∞ times, player i is confined to play a strategy that is implementable by a deterministic automaton of size

ki ≤ ∞,

the repeated game payoff is the average per-stage payoff. Explicitly, what is the asymptotic behavior of VAL GT (k1, k2) as T, k1, k2 → ∞ .

subject, date – p. 2/13

slide-10
SLIDE 10

Non-zero-sum games

subject, date – p. 3/13

slide-11
SLIDE 11

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata

subject, date – p. 3/13

slide-12
SLIDE 12

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata Explicitly, what is the asymptotic behavior of the equilibrium payoffs of GT (k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 3/13

slide-13
SLIDE 13

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata Explicitly, what is the asymptotic behavior of the equilibrium payoffs of GT (k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 3/13

slide-14
SLIDE 14

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata Explicitly, what is the asymptotic behavior of the equilibrium payoffs of GT (k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 3/13

slide-15
SLIDE 15

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata Explicitly, what is the asymptotic behavior of the equilibrium payoffs of GT (k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 3/13

slide-16
SLIDE 16

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata Explicitly, what is the asymptotic behavior of the equilibrium payoffs of GT (k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 3/13

slide-17
SLIDE 17

Non-zero-sum games

Quantify the equilibrium payoff of non-zero-sum repeated games with large automata Explicitly, what is the asymptotic behavior of the equilibrium payoffs of GT (k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 3/13

slide-18
SLIDE 18

Finite automata strategies

subject, date – p. 4/13

slide-19
SLIDE 19

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game

subject, date – p. 4/13

slide-20
SLIDE 20

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players

subject, date – p. 4/13

slide-21
SLIDE 21

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i

subject, date – p. 4/13

slide-22
SLIDE 22

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i

subject, date – p. 4/13

slide-23
SLIDE 23

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i

subject, date – p. 4/13

slide-24
SLIDE 24

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state

subject, date – p. 4/13

slide-25
SLIDE 25

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state α : S → Ai – action function of the automaton M

subject, date – p. 4/13

slide-26
SLIDE 26

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state α : S → Ai – action function of the automaton M τ : S × A → S – the transition function of M

subject, date – p. 4/13

slide-27
SLIDE 27

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state α : S → Ai – action function of the automaton M τ : S × A → S – the transition function of M σ = σ(M) – the strategy defined by the automaton

subject, date – p. 4/13

slide-28
SLIDE 28

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state α : S → Ai – action function of the automaton M τ : S × A → S – the transition function of M σ = σ(M) – the strategy defined by the automaton σ1 = f(s1), σ(a1, a2, . . . , ak) = f(sk) where

subject, date – p. 4/13

slide-29
SLIDE 29

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state α : S → Ai – action function of the automaton M τ : S × A → S – the transition function of M σ = σ(M) – the strategy defined by the automaton σ1 = f(s1), σ(a1, a2, . . . , ak) = f(sk) where sk = τ(sk−1, ak−1) the state of the automaton before

play at stage k

subject, date – p. 4/13

slide-30
SLIDE 30

Finite automata strategies

G = N, A = ×i∈NAi, g = (gi)i∈N – the stage game N – set of players Ai – stage actions of player i gi : A → R – payoff function of player i M = S, s1, α, τ – an automata of player i S – states of the automaton, s0 ∈ S – the initial state α : S → Ai – action function of the automaton M τ : S × A → S – the transition function of M σ = σ(M) – the strategy defined by the automaton σ1 = f(s1), σ(a1, a2, . . . , ak) = f(sk) where sk = τ(sk−1, ak−1) the state of the automaton before

play at stage k

subject, date – p. 4/13

slide-31
SLIDE 31

Open Problem - I

subject, date – p. 5/13

slide-32
SLIDE 32

Open Problem - I

I.1 Do the values of G(k, nk) := G∞(k, nk) converge as

nk ≥ k → ∞ and limk→∞

log nk k

= x?

subject, date – p. 5/13

slide-33
SLIDE 33

Open Problem - I

I.1 Do the values of G(k, nk) := G∞(k, nk) converge as

nk ≥ k → ∞ and limk→∞

log nk k

= x?

I.2 For those values of x for which the limit exists, what is the limit as a function of the stage game G and x?

subject, date – p. 5/13

slide-34
SLIDE 34

Open Problem - I

I.1 Do the values of G(k, nk) := G∞(k, nk) converge as

nk ≥ k → ∞ and limk→∞

log nk k

= x?

I.2 For those values of x for which the limit exists, what is the limit as a function of the stage game G and x? Exact asymptotic – x = 0: Ben-Porath 93,

x = log min(|A1|, |A2|): N-Spencer 10, x > log min(|A1|, |A2|): N 97

subject, date – p. 5/13

slide-35
SLIDE 35

Open Problem - I

I.1 Do the values of G(k, nk) := G∞(k, nk) converge as

nk ≥ k → ∞ and limk→∞

log nk k

= x?

I.2 For those values of x for which the limit exists, what is the limit as a function of the stage game G and x? Exact asymptotic – x = 0: Ben-Porath 93,

x = log min(|A1|, |A2|): N-Spencer 10, x > log min(|A1|, |A2|): N 97

A lower bound – 0 < x < log min(|A1|, |A2|): N 08

subject, date – p. 5/13

slide-36
SLIDE 36

Example

The stage game

L R T 2 B 2

subject, date – p. 6/13

slide-37
SLIDE 37

Information Theoretic Tools

Example: Ω = {0, 1}, p(0) = p.

subject, date – p. 7/13

slide-38
SLIDE 38

Information Theoretic Tools

Example: Ω = {0, 1}, p(0) = p. H(X) = −p log2 p − (1 − p) log2(1 − p).

subject, date – p. 7/13

slide-39
SLIDE 39

Information Theoretic Tools

Example: Ω = {0, 1}, p(0) = p. H(X) = −p log2 p − (1 − p) log2(1 − p).

0.25 0.5 0.75 1.0 p 0.2 0.4 0.6 0.8 1.0

HX

subject, date – p. 7/13

slide-40
SLIDE 40

A lower bound

subject, date – p. 8/13

slide-41
SLIDE 41

A lower bound

subject, date – p. 8/13

slide-42
SLIDE 42

A lower bound

0.2 0.4 0.6 0.8 1 1.2 1.4 x: log n

  • k

0.2 0.4 0.6 0.8 1 Vx 2H11x

subject, date – p. 8/13

slide-43
SLIDE 43

A lower bound

0.2 0.4 0.6 0.8 1 1.2 1.4 x: log n

  • k

0.2 0.4 0.6 0.8 1 Vx 2H11x

x = 0: Ben-Porath x = 1: 93 N-Spencer 10 x > 1: N 97

subject, date – p. 8/13

slide-44
SLIDE 44

A lower bound

0.2 0.4 0.6 0.8 1 1.2 1.4 x: log n

  • k

0.2 0.4 0.6 0.8 1 Vx 2H11x

x = 0: Ben-Porath x = 1: 93 N-Spencer 10 x > 1: N 97 0 < x < 1 : N 08

subject, date – p. 8/13

slide-45
SLIDE 45

Open Problem - II

subject, date – p. 9/13

slide-46
SLIDE 46

Open Problem - II

Do the values of GTk(k, ∞) converge as k → ∞ and

limk→∞

Tk k log = x?

subject, date – p. 9/13

slide-47
SLIDE 47

Open Problem - II

Do the values of GTk(k, ∞) converge as k → ∞ and

limk→∞

Tk k log = x?

For those values of x for which the limit exists, what is the limit as a function of the stage game G and x?

subject, date – p. 9/13

slide-48
SLIDE 48

Open Problem - II

Do the values of GTk(k, ∞) converge as k → ∞ and

limk→∞

Tk k log = x?

For those values of x for which the limit exists, what is the limit as a function of the stage game G and x? Exact asymptotic (N-Okada 99),

x = ∞ = ⇒

VAL GTk(k, ∞) → v∗

subject, date – p. 9/13

slide-49
SLIDE 49

Open Problem - II

Do the values of GTk(k, ∞) converge as k → ∞ and

limk→∞

Tk k log = x?

For those values of x for which the limit exists, what is the limit as a function of the stage game G and x? Exact asymptotic (N-Okada 99),

x = ∞ = ⇒

VAL GTk(k, ∞) → v∗ A conjecture (N 97)

x = 0 = ⇒

VAL GTk(k, ∞) → v

subject, date – p. 9/13

slide-50
SLIDE 50

Open Problem - non-zero-sum

subject, date – p. 10/13

slide-51
SLIDE 51

Open Problem - non-zero-sum

What is the limit behavior of the equilibrium payoffs of

GT(k1, k2, . . . , kn) as T, ki → ∞

subject, date – p. 10/13

slide-52
SLIDE 52

Open Problem - non-zero-sum

What is the limit behavior of the equilibrium payoffs of

GT(k1, k2, . . . , kn) as T, ki → ∞

Known for two-player games

subject, date – p. 10/13

slide-53
SLIDE 53

Open Problem - non-zero-sum

What is the limit behavior of the equilibrium payoffs of

GT(k1, k2, . . . , kn) as T, ki → ∞

Known for two-player games For T = ∞, the essential issue is the individual rational level.

subject, date – p. 10/13

slide-54
SLIDE 54

Open Problem - non-zero-sum

What is the limit behavior of the equilibrium payoffs of

GT(k1, k2, . . . , kn) as T, ki → ∞

Known for two-player games For T = ∞, the essential issue is the individual rational level. For T < ∞ additional issues.

subject, date – p. 10/13

slide-55
SLIDE 55

Example 2

The stage game

L R T −1, 1, 1 0, 0, 0 B 0, 0, 0 0, 0, 0 L R 0, 0, 0 0, 0, 0 0, 0, 0 −1, 1, 1

subject, date – p. 11/13

slide-56
SLIDE 56

Example 2

The stage game

L R T 1, 1, 1 2, 0, 2 B 0, 2, 2 1, 1, 3 L R 2, 2, 0 3, 1, 1 1, 3, 1 2, 2, 2

subject, date – p. 12/13

slide-57
SLIDE 57

Variations

Exact asymptotics VAL GT(k[obl], n) N 08, Peretz 10 VAL GT(k[rec], n[rec]) Peretz 10

subject, date – p. 13/13