Playing Games with Counter Automata Antonn Ku cera Bordeaux, - - PowerPoint PPT Presentation

playing games with counter automata
SMART_READER_LITE
LIVE PREVIEW

Playing Games with Counter Automata Antonn Ku cera Bordeaux, - - PowerPoint PPT Presentation

Playing Games with Counter Automata Antonn Ku cera Bordeaux, September 2012 Antonn Ku cera (FI MU Brno) Counter Games RP 2012 1 / 25 Outline Multicounter games Game objectives, determinacy, optimal strategies Non-stochastic


slide-1
SLIDE 1

Playing Games with Counter Automata

Antonín Kuˇ cera

Bordeaux, September 2012

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 1 / 25

slide-2
SLIDE 2

Outline

Multicounter games Game objectives, determinacy, optimal strategies Non-stochastic multicounter games eVASS games Consumption games Multiweighted energy games Stochastic one-counter games Solvency games One counter MDPs and games Open problems

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 2 / 25

slide-3
SLIDE 3

Multi-counter stochastic games

p q r s t u v w (−5, 0, ω) (0, ω, 16) (2, 2, −3) (−8, ω, 0) (1, ω, 0)

1 3

(−3, −1, ω)

2 3

(1, −3, 2) (2, −2, ω) (0, 0, 0)

1 2

(4, 1, 0)

1 2

(0, −7, 0) (2, ω, ω) (1, −1, ω)

1 4

(1, −1, ω), 3

4

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 3 / 25

slide-4
SLIDE 4

The semantics of multi-counter stochastic games

Each MCSG determines an infinite game graph which can be infinitely-branching. Vertices are configurations of the form p v. Z-semantics: v ∈ Zk, non-blocking transitions. N-semantics: v ∈ Nk, blocking transitions (Petri net style).

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 4 / 25

slide-5
SLIDE 5

An example

p q r ω −1, 1

2

0, 1

2

0, 1

p(1) q(0) r(0) q(1) r(1) q(2) r(2) q(3) r(3) q(4) r(4) q(i) r(i)

1 2 1 2

1

1 2 1 2

1

1 2 1 2

1

1 2 1 2

1

1 2 1 2

1 1 1

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 5 / 25

slide-6
SLIDE 6

Winning objectives in stochastic games (1)

We deal with infinite-state and possibly infinitely-branching stochastic games. Some of very basic properties of finite-state games do not carry over to infinite-state games. In particular, this applies to the existence and type of optimal/winning strategies, even for simple objectives such as reachability or safety.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 6 / 25

slide-7
SLIDE 7

Winning objectives in stochastic games (2)

Let f be a function assigning a real-valued payoff to every run if a given game graph G. The aim of Player / is to maximize/minimize E[f]. If f is Borel and bounded, then every vertex v has a value val(v) given by sup

σ∈HR

inf

π∈HR

E[f] = inf

π∈HR

sup

σ∈HR

E[f] This holds also for infinitely-branching games. The existence of val(v) induces the notions of optimal and ε-optimal strategy.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 7 / 25

slide-8
SLIDE 8

Reachability objective in stochastic games

The aim of Player / is to maximize/minimize the probability of visiting a target vertex. Let Reach be a function which to every run assigns either 1 or 0 depending

  • n whether or not the run visits a target vertex.

In finite-state stochastic games, both players have optimal MD strategies. Hence, finite-state reachability games are determined for every strategy class C that subsumes MD strategies: sup

σ∈C

inf

π∈C

E[Reach] = inf

π∈C

sup

σ∈C

E[Reach]

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 8 / 25

slide-9
SLIDE 9

Reachability in infinite-state stochastic games

In infinitely-branching games: Optimal strategies do not necessarily exist. Even if they do exist, they may require infinite memory. The games are not determined for finite-memory strategies. In finitely-branching games: An optimal strategy for Player (Max) does not necessarily exist. Player (Min) has an optimal MD strategy. The games are determined for any strategy type that subsumes MD strategies, and the value is the same. Similar results hold also for the (unbounded!) total accumulated reward payoff function (Brázdil, K, Novotný; MEMICS 2012).

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 9 / 25

slide-10
SLIDE 10

Counterexamples (1)

Player (Max) does not necessarily have an optimal strategy, even in finitely-branching MDPs.

v 1

1 2 1 2

1

1 2 1 2

1

1 2 1 2

1

1 2 1 2

1

1 2

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 10 / 25

slide-11
SLIDE 11

Counterexamples (2)

An optimal strategy for Player (Max) may require infinite memory, even in finitely-branching games.

ˆ v d1 e1 s1

1 2 1 2

d2 e2 s2

1 2 1 2

d3 e3 s3

1 2 1 2

d4 e4 s4

1 2 1 2

d5 e5 s5

1 2 1 2

v

1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 11 / 25

slide-12
SLIDE 12

Counterexamples (3)

Optimal minimizing strategies do not necessarily exist, and (ε-) optimal minimizing strategies may require infinite memory.

v s1 s2 s3 si 1

1 2 1 2 1 4 3 4 1 8 7 8 1 2i

1 − 1

2i

r

1 2 1 2

1

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 12 / 25

slide-13
SLIDE 13

Counterexamples (4)

Infinitely-branching games are not determined for finite-memory strategies.

v s u t p

supσ∈MD infπ∈MD E[Reach] = 0 infπ∈MD supσ∈MD E[Reach] = 1

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 13 / 25

slide-14
SLIDE 14

Existing results about multi-counter games

Only for restricted models: non-stochastic games with multiple counters; stochastic games with only one counter. Objectives: zero-safety (selective) zero-reachability expected zero-termination time

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 14 / 25

slide-15
SLIDE 15

Non-stochastic eVASS games

Let G be an eVASS game, i.e., a non-stochastic multi-counter game with k counters where update vectors belong to {0, 1, −1, ω}k. We consider N-semantics. The aim of Player is to avoid visiting configurations with zero in some

  • counter. Player aims at the opposite.

Let safe be the set of all configurations where Player has a winning strategy.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 15 / 25

slide-16
SLIDE 16

Non-stochastic eVASS games (2)

Observation 1

The set safe is upwards-closed w.r.t. component-wise ordering. Hence, it is fully characterized by its finitely many minimal elements. Consequently, the membership to safe is semi-decidable.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 16 / 25

slide-17
SLIDE 17

Non-stochastic eVASS games (2)

Observation 1

The set safe is upwards-closed w.r.t. component-wise ordering. Hence, it is fully characterized by its finitely many minimal elements. Consequently, the membership to safe is semi-decidable.

Observation 2

Suppose that the update vectors of G do not contain any ω’s. Then the game graph of G is finitely-branching, and a winning strategy for Player can be encoded by a finite tree. Consequently, the non-membership to safe is semi-decidable.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 16 / 25

slide-18
SLIDE 18

Non-stochastic eVASS games (2)

Theorem 3 (Brázdil, Janˇ car, K.; Icalp 2010)

The set of minimal elements of safe is computable in k−1-EXPTIME. The membership problem for safe is EXPSPACE-hard. A symbolic configuration is a pair p v where vi ∈ N ∪ {∗}. The precision of p v is the number of non-∗ elements in v. An instance of a symbolic configuration p v is obtained by substituting all ∗-elements in v with concrete values. The component-wise ordering is extended to symbolic configurations by stipulating n < ∗ for all n ∈ N. For j = 0, 1, 2, . . . , k, we inductively compute the set Cj of all minimal symbolic configurations p v of precision j such that some instance of p v is safe. a bound Bj such that for every p v ∈ Cj we have that the instance

  • btained by substituting every ∗ in

v with Bj is safe. Clearly, Ck is the set of all minimal safe configurations.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 17 / 25

slide-19
SLIDE 19

Non-stochastic eVASS games (3)

Further remarks. For k = 2 and no ω components, the complexity has been improved from EXPTIME to P by Chaloupka (RP 2010). The exact complexity classification for a fixed number of counters is open.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 18 / 25

slide-20
SLIDE 20

Consumption games

Let G be an consumption game, i.e., a non-stochastic multi-counter game with k counters where the components of update vectors are non-positive integers or ω’s. Let S be the set of states of G, and ℓ the maximal absolute value of a counter update.

Theorem 4 (Brázdil, Chatterjee, K., Novotný; CAV 2012)

The problem whether p v ∈ safe is PSPACE-hard and solvable in time | v| · (k · ℓ · |S|)O(k). Further, the set of minimal elements of safe is computable in time (k · ℓ · |S|)O(k).

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 19 / 25

slide-21
SLIDE 21

Consumption games (2)

Let G be an consumption game. Let cover be the set of all p v where Player can play so that all counters stay positive; every visited configuration q u satisfies u ≤ v. Clearly, cover ⊆ safe, and the set cover is upwards closed.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 20 / 25

slide-22
SLIDE 22

Consumption games (2)

Let G be an consumption game. Let cover be the set of all p v where Player can play so that all counters stay positive; every visited configuration q u satisfies u ≤ v. Clearly, cover ⊆ safe, and the set cover is upwards closed.

Theorem 5 (Brázdil, Chatterjee, K., Novotný; CAV 2012)

The problem whether p v ∈ cover is PSPACE-hard and solvable in O(Λ2 · |S|2) time, where Λ = Πk

i=1

  • vi. Further, the set of all minimal elements of cover is

computable in (k · ℓ · |S|)O(k·k!) time.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 20 / 25

slide-23
SLIDE 23

Multiweighted energy games

Let G be a multiweighted (or generalized) energy game, i.e., a non-stochastic multicounter game with Z-semantics with no ω-components in update vectors. Let b ∈ N. For every control state p, let boundb(p) be the set of all v ∈ Nk where Player can play so that all counters stay non-negative and bounded by b.

Theorem 6 (Fahrenberg, Juhl, Larsen, Srba; ICTAC 2011)

The problem whether v ∈ boundb(p) is EXPTIME-complete. If all control states belong to Player , then the problem is PSPACE-complete.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 21 / 25

slide-24
SLIDE 24

Stochastic games with one counter

Solvency games (Berger, Kapur, Schulman, Vazirani; FST& TCS 2008)

v s1 s2 sn 7

1 4

−3

1 4

−1

1 2

5

1 3

−4

2 3

3

1 3

−2

1 6

2

1 4

−5

1 4 Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 22 / 25

slide-25
SLIDE 25

One-counter games and MDPs

Let G be a one-counter game, i.e., a multi-counter game with one counter and Z-semantics where the counter updates range over {0, 1, −1}. So far, the following payoff functions have been studied for one-counter games: Cover negatives (CN), which to every run assigns 1 or 0 depending on whether or not lim inf of all counter values visited along the run is equal to −∞. Zero reachability (Z), which to every run assigns 1 or 0 depending on whether or not the run visits a configuration with zero counter. Selective zero reachability (SZ), which to every run assigns 1 or 0 depending on whether or not the run visits a configuration with zero counter in one of the selected control states, and the counter stays positive in all preceding configurations. Termination time (T), which to every run assigns the number of transitions performed before visiting a configuration with zero counter.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 23 / 25

slide-26
SLIDE 26

One-counter games and MDPs (2)

The results concern the following problems: Qualitative questions: Is the value of a given configuration equal to one? Is there a strategy for Player which achieves the outcome one against every strategy of Player ? Quantitative questions: Can we compute/approximate the value of a given configuration? Can we compute (ε-optimal) strategies? Many answers exist and many questions remain open. The underlying analytical tools are nontrivial.

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 24 / 25

slide-27
SLIDE 27

Open problems

The tractability frontier for non-stochastic multi-counter games is still not well understood, and only very basic payoff functions have been studied. The structure of optimal strategies in one-counter games and MDPs is

  • unclear. Are optimal strategies in solvency games “ultimately periodic”?

What can be done for stochastic games with multiple counters?

Antonín Kuˇ cera (FI MU Brno) Counter Games RP 2012 25 / 25