The Complexity of Admissibility in -Regular Games R. Brenguier - - PowerPoint PPT Presentation

the complexity of admissibility in regular games
SMART_READER_LITE
LIVE PREVIEW

The Complexity of Admissibility in -Regular Games R. Brenguier - - PowerPoint PPT Presentation

The Complexity of Admissibility in -Regular Games R. Brenguier J.-F. Raskin M. Sassolas Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 1 / 23 Multiplayer non-zero-sum games Romain Brenguier (ULB) Admissibility Thursday,


slide-1
SLIDE 1

The Complexity of Admissibility in ω-Regular Games

  • R. Brenguier

J.-F. Raskin

  • M. Sassolas

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 1 / 23

slide-2
SLIDE 2

Multiplayer non-zero-sum games

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 2 / 23

slide-3
SLIDE 3

Models of rationality

Nash equilibria no player has interest in deviating. Regret minimization players prefer moves that would induce less regret had they known the other players strategy. Elimination of dominated strategies players eliminate “bad” strategies

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 3 / 23

slide-4
SLIDE 4

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. v0 v1 v2 :-) :-( a b c e f Should player play a or b?

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 4 / 23

slide-5
SLIDE 5

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. v0 v1 v2 :-) :-( a b c d f

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 4 / 23

slide-6
SLIDE 6

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. v0 v1 v2 :-) :-( a b c d e f

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 4 / 23

slide-7
SLIDE 7

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. 1 2 3 a b , , , , , , , , , , , , , , a b a b a b a b a b a b a b

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 5 / 23

slide-8
SLIDE 8

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. 1 2 3 a b , , , , , , , , , , , , , , a b a b a b a b a b a b a b

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 5 / 23

slide-9
SLIDE 9

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. 1 2 3 a b , , , , , , , , , , , , , , a b a b a b a b a b a b a b

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 5 / 23

slide-10
SLIDE 10

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. 1 2 3 a b , , , , , , , , , , , , , , a b b a b a b a b a

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 5 / 23

slide-11
SLIDE 11

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. 1 2 3 a b , , , , , , , , , , , , , , a b b a b a b a b a

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 5 / 23

slide-12
SLIDE 12

Iterative elimination of dominated strategies

σ is dominated by σ′ (wrt S): for all strategies of the other players (in S), if σ wins, then σ′ wins. and for some strategy of the other players (in S), σ loses while σ′ wins. 1 2 3 a b , , , , , , , , , , , , , , a b a b a b a b a

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 5 / 23

slide-13
SLIDE 13

Our setting

Turn based games on graphs. Muller objectives: ρ ∈ Wini iff Inf(ρ) ∈ F. Weak Muller objectives: ρ ∈ Wini iff Occ(ρ) ∈ F.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 6 / 23

slide-14
SLIDE 14

Our setting

Turn based games on graphs. Muller objectives: ρ ∈ Wini iff Inf(ρ) ∈ F. Weak Muller objectives: ρ ∈ Wini iff Occ(ρ) ∈ F. Dominance: σ′

i ≻Sn σi if σ′ i dominates σi w.r.t Sn.

Iterative admissibility: S0

i = Si

(all strategies) Sn+1

i

= Sn

i \

  • σi
  • ∃σ′

i ∈ Sn i , σ′ i ≻Sn σi

  • .

Set of iteratively admissible strategies: S∗ =

n∈N Sn

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 6 / 23

slide-15
SLIDE 15

Our setting

Turn based games on graphs. Muller objectives: ρ ∈ Wini iff Inf(ρ) ∈ F. Weak Muller objectives: ρ ∈ Wini iff Occ(ρ) ∈ F. Dominance: σ′

i ≻Sn σi if σ′ i dominates σi w.r.t Sn.

Iterative admissibility: S0

i = Si

(all strategies) Sn+1

i

= Sn

i \

  • σi
  • ∃σ′

i ∈ Sn i , σ′ i ≻Sn σi

  • .

Set of iteratively admissible strategies: S∗ =

n∈N Sn

“Admissibility in Infinite Games” [Berwanger, STACS’07]: S∗ is well defined and is reached after a finite number of iterations.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 6 / 23

slide-16
SLIDE 16

Values [Berwanger, STACS’07]

:-)

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 7 / 23

slide-17
SLIDE 17

Values [Berwanger, STACS’07]

:-) Winning Losing

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 7 / 23

slide-18
SLIDE 18

Values [Berwanger, STACS’07]

:-) Winning Losing

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 7 / 23

slide-19
SLIDE 19

Values [Berwanger, STACS’07]

:-) Winning Surely Losing Potentially Losing

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 7 / 23

slide-20
SLIDE 20

Values [Berwanger, STACS’07]

:-) Winning Surely Losing Potentially Losing Val = 0 Val = −1 Val = 1 no strategy profile σP in Sn such that h · outcome(σP) winning for player i = ⇒ Valn

i (h) = −1;

∃σi ∈ Sn

i such that ∀σ−i ∈ Sn −i, h · outcome(σP) winning for player i

= ⇒ Valn

i (h) = 1;

  • therwise Valn

i (h) = 0;

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 7 / 23

slide-21
SLIDE 21

Outline

1

Introduction

2

Setting

3

Simple Safety

4

Muller objectives

5

Conclusion

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 8 / 23

slide-22
SLIDE 22

Outline

1

Introduction

2

Setting

3

Simple Safety

4

Muller objectives

5

Conclusion

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 9 / 23

slide-23
SLIDE 23

Simple Safety

Safety objective: avoid Bad states Simple safety: Bad states are absorbing v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 10 / 23

slide-24
SLIDE 24

A local notion of dominance

In simple safety games the rule to never decrease one’s own value is sufficient for admissibility. 0∗1ω 0∗ − 1ω 0ω

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 11 / 23

slide-25
SLIDE 25

Algorithm

n := 0 ; T −1

i

:= ∅ repeat forall the s ∈ V do if there is a winning strategy for player i from s in G \ T n−1 then Valn

i (s) := 1;

else if there is no winning run for player i from s in G \ T n−1 then Valn

i (s) := −1;

else Valn

i (s) := 0;

forall the i ∈ P do T n

i := T n−1 i

∪ {(s, s′) ∈ E | s ∈ Vi ∧ Valn

i (s) > Valn i (s′)}

n := n + 1 until ∀i ∈ P. T n

i = T n−1 i

;

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 12 / 23

slide-26
SLIDE 26

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-27
SLIDE 27

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad Which states have value −1 for player ? Which states have value 1 for player ?

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-28
SLIDE 28

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad Val, Val 0, 0 0, 0 −1, 0 −1, 0 −1, 0 −1, 0 −1, −1 −1, 0

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-29
SLIDE 29

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad Val, Val 0, 0 0, 0 −1, 0 −1, 0 −1, 0 −1, 0 −1, −1 −1, 0 On which transition would a player decrease its own value?

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-30
SLIDE 30

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad Val, Val 0, 0 0, 0 −1, 0 −1, 0 −1, 0 −1, 0 −1, −1 −1, 0

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-31
SLIDE 31

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-32
SLIDE 32

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-33
SLIDE 33

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad −1, 0 −1, 0 −1, 0 −1, −1 −1, 0 0, 1 −1, 0 0, 1

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-34
SLIDE 34

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad −1, 0 −1, 0 −1, 0 −1, −1 −1, 0 0, 1 −1, 0 0, 1

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-35
SLIDE 35

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad automaton recognizing the outcomes of admissible strategies

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-36
SLIDE 36

The algorithm in action

v4 v3 v2 v6 v7 v1 v0 v5 Bad Bad automaton recognizing the outcomes of admissible strategies strategies playing transitions of the automaton are admissible

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 13 / 23

slide-37
SLIDE 37

Application to Safety: Unfolding the graph

v0 v1 Bad1 Bad2 v2

  • v′

v′

1

Bad1 Bad2 v′

2

v1 v0 v2 Nobody lost Player 1 already lost Player 1 and 2 already lost

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 14 / 23

slide-38
SLIDE 38

Complexity

The algorithm is exponential because of the unfolding. The structure allows recursive computation in PSPACE. Hardness: encoding of QSAT.

Theorem

The winning coalition problem is PSPACE-complete for safety.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 15 / 23

slide-39
SLIDE 39

Outline

1

Introduction

2

Setting

3

Simple Safety

4

Muller objectives

5

Conclusion

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 16 / 23

slide-40
SLIDE 40

Outline

1

Introduction

2

Setting

3

Simple Safety

4

Muller objectives

5

Conclusion

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 17 / 23

slide-41
SLIDE 41

In general: the local condition is not sufficient

s1 s2 :-) wait request grant deny

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 18 / 23

slide-42
SLIDE 42

In general: the local condition is not sufficient

s1 s2 s2 :-) wait request grant deny In case the value is 0, need to allow other players to help. “Help!”-state for i: a state where j = i has several choices with value ≥ 0 for i, while not changing the value for j.

  • utcomes of admissible strategies visit infinitely often those states.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 18 / 23

slide-43
SLIDE 43

Value computation

Outcomes of admissible strategies are:

winning outcomes if the value is 1; all outcomes if the value is −1;

  • utcomes that either win or visit“Help!”-states infinitely often.

֒ → + local conditions automaton recognizing Out(Sn)

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 19 / 23

slide-44
SLIDE 44

Value computation

Outcomes of admissible strategies are:

winning outcomes if the value is 1; all outcomes if the value is −1;

  • utcomes that either win or visit“Help!”-states infinitely often.

֒ → + local conditions automaton recognizing Out(Sn)

Computing the values: Value 1: ∃σi ∈ Sn

i . Out(Sn) ⇒ Wini

two-player circuit game (PSPACE-complete [Hunter, 2007]).

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 19 / 23

slide-45
SLIDE 45

Value computation

Outcomes of admissible strategies are:

winning outcomes if the value is 1; all outcomes if the value is −1;

  • utcomes that either win or visit“Help!”-states infinitely often.

֒ → + local conditions automaton recognizing Out(Sn)

Computing the values: Value 1: ∃σi ∈ Sn

i . Out(Sn) ⇒ Wini

two-player circuit game (PSPACE-complete [Hunter, 2007]).

Value −1: Out(Sn) ∩ Wini = ∅.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 19 / 23

slide-46
SLIDE 46

Complexity for Muller Objectives

Solving the winning coalition problem is an emptiness check.

Theorem

The winning coalition problem PSPACE-complete for Muller objectives.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 20 / 23

slide-47
SLIDE 47

Complexity for Muller Objectives

Solving the winning coalition problem is an emptiness check.

Theorem

The winning coalition problem PSPACE-complete for Muller objectives. For B¨ uchi: value 1 is computed by solving a parity game.

Theorem

The winning coalition problem with B¨ uchi objectives is in NP ∩ coNP. The results on circuits can be adapted for weak Muller

Theorem

The winning coalition problem for weak Muller is PSPACE-complete.

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 20 / 23

slide-48
SLIDE 48

Outline

1

Introduction

2

Setting

3

Simple Safety

4

Muller objectives

5

Conclusion

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 21 / 23

slide-49
SLIDE 49

Outline

1

Introduction

2

Setting

3

Simple Safety

4

Muller objectives

5

Conclusion

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 22 / 23

slide-50
SLIDE 50

Summary

Automata representing all outcomes of admissible strategies Polynomial space algorithms for Muller and Weak-Muller

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 23 / 23

slide-51
SLIDE 51

Summary

Automata representing all outcomes of admissible strategies Polynomial space algorithms for Muller and Weak-Muller

Future work

Imperfect information Quantitative objectives

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 23 / 23

slide-52
SLIDE 52

Summary

Automata representing all outcomes of admissible strategies Polynomial space algorithms for Muller and Weak-Muller

Future work

Imperfect information Quantitative objectives

Thank you

Romain Brenguier (ULB) Admissibility Thursday, July 17, 2014 23 / 23