Admissibility in Infinite Games Dietmar Berwanger EPFL, Lausanne - - PowerPoint PPT Presentation

admissibility in infinite games
SMART_READER_LITE
LIVE PREVIEW

Admissibility in Infinite Games Dietmar Berwanger EPFL, Lausanne - - PowerPoint PPT Presentation

Admissibility in Infinite Games Dietmar Berwanger EPFL, Lausanne STACS , Aachen Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS07 1 / 18 Strategic Interaction Models of reactive systems: zero-sum two-player


slide-1
SLIDE 1

Admissibility in Infinite Games

Dietmar Berwanger

EPFL, Lausanne

STACS , Aachen

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 1 / 18

slide-2
SLIDE 2

Strategic Interaction

Models of reactive systems: zero-sum two-player games: strict competition

▸ optimal behaviour – guarantee a win; ▸ determinacy neutralises rationality.

non-zero sum, or n-player games: potential for cooperation

▸ various solution concepts; ▸ how to reason about how other players reason. Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 2 / 18

slide-3
SLIDE 3

Strategic Interaction

Models of reactive systems: zero-sum two-player games: strict competition

▸ optimal behaviour – guarantee a win; ▸ determinacy neutralises rationality.

non-zero sum, or n-player games: potential for cooperation

▸ various solution concepts; ▸ how to reason about how other players reason.

Understanding of rationality needed – borrow from Game Teory? Yes, but with care.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 2 / 18

slide-4
SLIDE 4

Computational agents are special players

non-terminating behaviour

▸ open or infinite horizon

extensive structure

▸ state transition graphs ▸ inherently sequential

preplay commitment no steady state, no extraneous correlation particular payoff structures

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 3 / 18

slide-5
SLIDE 5

Program

Identify criteria for interactively optimal behaviour for computational agents. adapt deductive solution concepts from noncooperative game theory, preserve automata-theoretic and logical foundations.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 4 / 18

slide-6
SLIDE 6

Program

Identify criteria for interactively optimal behaviour for computational agents. adapt deductive solution concepts from noncooperative game theory, preserve automata-theoretic and logical foundations. Goals: Describe what to expect from the interaction of rational agents. Prescribe individually rational behaviour. Design mechanisms that promote favourable evolution.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 4 / 18

slide-7
SLIDE 7

Te Minimax principle

Idea: play strategies that guarantee your security payoff:

max

s∈S min s−∈S− .

Problems: ignores that players are rational and aware of each other. ▸ inefficient and unstable solutions.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 5 / 18

slide-8
SLIDE 8

Equilibrium

Nash Equilibrium: a self-enforcing profile of strategies: Each player, given the strategies of the others, should not have an alternate strategy that he striclty prefers. Problems: Stable state assumption:

▸ the strategies of the others are not given

Multiplicity

▸ coordination failure

Dynamic inconsistency (in extensive games):

▸ Non-credible threats Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 6 / 18

slide-9
SLIDE 9

Rationality Obliges

Strategy s dominates r, if

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Reasoning...

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-10
SLIDE 10

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Player Row cannot decide between T and B.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-11
SLIDE 11

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Player Column cannot decide between L and R.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-12
SLIDE 12

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Player Matrix finds X to be better than Y.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-13
SLIDE 13

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ All players follow his reasoning, and discard the matrix Y.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-14
SLIDE 14

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Player Row now finds T to be than B.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-15
SLIDE 15

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Row discards the row B and Column follows his reasoning.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-16
SLIDE 16

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Player Column now finds R to be better than L

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-17
SLIDE 17

Rationality Obliges

Strategy s dominates r, if – given the player’s belief

  • s is not worse than r, against any counter-strategy, and
  • s is strictly better, against some counter-strategy.

Admissibility criterion: avoid dominated strategies Assume common reasoning about admissibility.

▸ iterated elimination ↝ fixed point

L R T

+ − + + + −

B

+ + + − − +

X L R T

+ − + − + −

B

+ + + + − −

Y

▸ Te predicted outcome is (T, R, X).

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 7 / 18

slide-18
SLIDE 18

Infinite Sequential Games

▸ n - players interact to form an infinite path in a graph each player has preferences over outcoming path – may be conciliated Strategies: ω-trees si (colouring the graph unravelling) – space Si uncountable ▸ perfect information, win-or-lose objectives Particular case: regular games over finite graphs: (G, (Wi)i<n) G game graph (V = ˙ ⋃i<nVi, E), Wi∈ V ω regular set of paths through G Canonical case: parity games.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 8 / 18

slide-19
SLIDE 19

Weak Dominance – Admissibility

Fix strategy subspace Q ⊆ S – frame of reference. A strategy s ∈ S dominates r ∈ S on Q, if u(s, t) ≥ u(r, t) for all t ∈ Q, and u(s, t) > u(r, t) for some t ∈ Q. s ∈ S is admissible w.r.t. Q, if no s′ ∈ Q dominates s on Q.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 9 / 18

slide-20
SLIDE 20

Iterated Admissibility

Incorporate admissibility as principle of rationality. Assume common knowledge of rationality – stages: Qi

 ∶= Si

Qi

α+ ∶= {s ∈ Qi α ∶ s admissible w.r.t. Qα}

Qi

λ ∶= ⋂α<λ Qi α

▸ Reach deflationary fixed point Q∞ when Qα = Qα+. Solution concept: Qi

∞ – iteratively admissible strategies

(Strategies s ∈ Qi

α are α-admissible.)

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 10 / 18

slide-21
SLIDE 21

Issues

Procedural soundness

▸ non-emptiness Qα ≠ ∅; ▸ progress guarantee.

Metatheory

▸ representation independence; ▸ stability under irrational deviations; ▸ concept justifiable by player introspection.

Computational aspects

▸ automatic recognition and synthesis of IAS; ▸ restriction to manageable strategies. Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 11 / 18

slide-22
SLIDE 22

Answers

In the general case

One-step soundness: Each strategy eliminated in stage α is dominated by an α + -admissible strategy. Non-stagnation: Every sequence of n stages solves a new subgame.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 12 / 18

slide-23
SLIDE 23

Answers

In the general case

One-step soundness: Each strategy eliminated in stage α is dominated by an α + -admissible strategy. Non-stagnation: Every sequence of n stages solves a new subgame.

Regular games over finite graphs

Foundedness: Games stabilise in a finite number of stages. Finite-state compatibility: Te set of IA strategies is regular.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 12 / 18

slide-24
SLIDE 24

Key: Value Characterisation, Truncation

A strategy subspace Q ⊆ S associates values to rooted subgames: valuei(Q, p) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎩ + player i has a winning strategy in (Γ, Q)↾p − player i loses all plays in (Γ, Q)↾p 

  • therwise

Characterisation s is admissible w.r.t. Q ⇐ ⇒ valuei(Q, p) = valuei({s}, p). Truncation Substitution of equivalent subgames preserves α-admissibility.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 13 / 18

slide-25
SLIDE 25

One-step Soundness

Teorem Each strategy eliminated in stage α is dominated by an (α + )-admissible strategy. Proof: Set out with r ∈ Qi

α ∖ Qi α+.

Construct ascending domination chain r = s, s, . . . if sℓ does not attain value of Q at a position of depth ℓ, shif to some strategy from Qi that does Te pointwise limit σ of this sequence is fine:

▸ σ ∈ Q

α,

▸ σ is (α + )-admissible, and ▸ σ dominates r on Q

α.

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 14 / 18

slide-26
SLIDE 26

Non-Stagnation

Teorem Every sequence of n stages solves a new subgame. When is a subgame Γ↾p solved? For player i: at stage α, such that for every β ≥ α: p reachable in Qi

β

⇒ Qi

β↾p= Qi α↾p .

Globally: maximum over all these α. Examples:

  • p is found to be winning/losing/unreachable for player i
  • strategies in Qα↾p are self-supporting

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 15 / 18

slide-27
SLIDE 27

Non-Stagnation – Proof idea

Two-player case: some position is solved for player : Assume value(Qβ, ⋅) remains unchanged for α ≤ β ≤ α + . Ten eliminated strategies are either (A) non-winning from positions of value(Q, p) = +

▸ can occur only at stage α.

(B) non-pending from positions of value(Q, p) = 

▸ triggered at stage α +  by eliminations of type (A) in stage α, and ▸ in stage α +  due to eliminations of type (B) in stage α +  ▸ and never more.

n-players case: recurse into solved subgames (involve one player less).

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 16 / 18

slide-28
SLIDE 28

Finite-state compatibility

Teorem In games on finite graphs with regular winning conditions, the sets of IA strategies are regular. Proof: Reduction to multi-parity games For regular Qi, the set of admissible strategies is regular. Iteration terminates afer afer n∣G∣ stages:

▸ parity games are prefix independent; ▸ for bisimilar positions p, p′ (reachable!) Qα↾p= Qα↾p′; ▸ if p closes, then all bisimilar ancestors close; ▸ there are at most ∣G∣ bisimulation classes. Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 17 / 18

slide-29
SLIDE 29

Conclusion & Outlook

Admissibility generalises to infinite games of perfect information. Te procedure is sound and robust. Work in progress foundation of backwards induction for perfect information games; forward inductive solution for infinite signalling games; Future tractable automata construction; synthesisable strategies;

Dietmar Berwanger (EPFL) Admissibility in Infinite Games STACS’07 18 / 18