Games with Costs and Delays Martin Zimmermann Saarland University - - PowerPoint PPT Presentation

games with costs and delays
SMART_READER_LITE
LIVE PREVIEW

Games with Costs and Delays Martin Zimmermann Saarland University - - PowerPoint PPT Presentation

Games with Costs and Delays Martin Zimmermann Saarland University June 20th, 2017 LICS 2017, Reykjavik, Iceland Martin Zimmermann Saarland University Games with Costs and Delays 1/14 Gale-Stewart Games Bchi-Landweber: The winner of a


slide-1
SLIDE 1

Games with Costs and Delays

Martin Zimmermann

Saarland University

June 20th, 2017

LICS 2017, Reykjavik, Iceland

Martin Zimmermann Saarland University Games with Costs and Delays 1/14

slide-2
SLIDE 2

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively.

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-3
SLIDE 3

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-4
SLIDE 4

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b O:

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-5
SLIDE 5

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b O: a

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-6
SLIDE 6

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a O: a

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-7
SLIDE 7

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a O: a a

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-8
SLIDE 8

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b O: a a

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-9
SLIDE 9

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b · · · O: a a · · · I wins!

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-10
SLIDE 10

Gale-Stewart Games

Büchi-Landweber: The winner of a zero-sum two-player game of infinite duration with ω-regular winning condition can be deter- mined effectively. α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b · · · O: a a · · · I wins! Many possible extensions... we consider two: Interaction: one player may delay her moves. Winning condition: quantitative instead of qualitative.

Martin Zimmermann Saarland University Games with Costs and Delays 2/14

slide-11
SLIDE 11

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-12
SLIDE 12

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b O:

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-13
SLIDE 13

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a O:

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-14
SLIDE 14

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b O:

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-15
SLIDE 15

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b O: b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-16
SLIDE 16

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b O: b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-17
SLIDE 17

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b O: b b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-18
SLIDE 18

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a O: b b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-19
SLIDE 19

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a O: b b a

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-20
SLIDE 20

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a O: b b a

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-21
SLIDE 21

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a O: b b a a

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-22
SLIDE 22

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a b O: b b a a

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-23
SLIDE 23

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a b O: b b a a b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-24
SLIDE 24

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a b b O: b b a a b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-25
SLIDE 25

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a b b O: b b a a b b

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-26
SLIDE 26

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a b b · · · O: b b a a b b · · · O wins!

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-27
SLIDE 27

Delay Games

Allow Player O to delay her moves: α(0) β(0) α(1) β(1)

  • · · · ∈ L, if β(i) = α(i + 2) for every i

I: b a b b a a b b · · · O: b b a a b b · · · O wins! Typical questions: How often does Player O have to delay to win? How hard is determining the winner of a delay game? Does the ability to delay allow Player O to improve the quality

  • f her strategies?

Martin Zimmermann Saarland University Games with Costs and Delays 3/14

slide-28
SLIDE 28

Previous Work

If winning conditions given by deterministic parity automata:

Theorem (Klein, Z. ’15)

If Player O wins delay game induced by A, then also by delaying at most 2|A|2 times.

Martin Zimmermann Saarland University Games with Costs and Delays 4/14

slide-29
SLIDE 29

Previous Work

If winning conditions given by deterministic parity automata:

Theorem (Klein, Z. ’15)

If Player O wins delay game induced by A, then also by delaying at most 2|A|2 times. Lower bound 2|A| (already for safety automata).

Martin Zimmermann Saarland University Games with Costs and Delays 4/14

slide-30
SLIDE 30

Previous Work

If winning conditions given by deterministic parity automata:

Theorem (Klein, Z. ’15)

If Player O wins delay game induced by A, then also by delaying at most 2|A|2 times. Lower bound 2|A| (already for safety automata). Determining the winner is EXPTIME-complete (hardness already for safety automata).

Martin Zimmermann Saarland University Games with Costs and Delays 4/14

slide-31
SLIDE 31

Previous Work

If winning conditions given by deterministic parity automata:

Theorem (Klein, Z. ’15)

If Player O wins delay game induced by A, then also by delaying at most 2|A|2 times. Lower bound 2|A| (already for safety automata). Determining the winner is EXPTIME-complete (hardness already for safety automata). Note: This improved similar results by Holtmann, Kaiser, and Thomas with doubly-exponential upper bounds and no lower bounds.

Martin Zimmermann Saarland University Games with Costs and Delays 4/14

slide-32
SLIDE 32

Previous Work

If winning conditions given by formula in (quantitative) linear temporal logics:

Theorem (Klein, Z. ’16)

If Player O wins delay game induced by ϕ, then also by delaying at most 222|ϕ| times. There is a matching lower bound. Determining the winner is 3EXPTIME-complete. Note: Quantitative conditions not harder than qualitative ones.

Martin Zimmermann Saarland University Games with Costs and Delays 5/14

slide-33
SLIDE 33

Uniformization of Relations

A strategy σ for O in a game induces a mapping fσ : Σω

I → Σω O

σ is winning ⇔ {

  • α

fσ(α)

  • | α ∈ Σω

I } ⊆ L

(fσ uniformizes L)

Martin Zimmermann Saarland University Games with Costs and Delays 6/14

slide-34
SLIDE 34

Uniformization of Relations

A strategy σ for O in a game induces a mapping fσ : Σω

I → Σω O

σ is winning ⇔ {

  • α

fσ(α)

  • | α ∈ Σω

I } ⊆ L

(fσ uniformizes L) Continuity in terms of strategies (in Cantor metric): Strategy without lookahead: i-th letter of fσ(α) only depends

  • n first i letters of α (very strong notion of continuity).

Martin Zimmermann Saarland University Games with Costs and Delays 6/14

slide-35
SLIDE 35

Uniformization of Relations

A strategy σ for O in a game induces a mapping fσ : Σω

I → Σω O

σ is winning ⇔ {

  • α

fσ(α)

  • | α ∈ Σω

I } ⊆ L

(fσ uniformizes L) Continuity in terms of strategies (in Cantor metric): Strategy without lookahead: i-th letter of fσ(α) only depends

  • n first i letters of α (very strong notion of continuity).

Strategy with bounded delay: fσ Lipschitz-continuous.

Martin Zimmermann Saarland University Games with Costs and Delays 6/14

slide-36
SLIDE 36

Uniformization of Relations

A strategy σ for O in a game induces a mapping fσ : Σω

I → Σω O

σ is winning ⇔ {

  • α

fσ(α)

  • | α ∈ Σω

I } ⊆ L

(fσ uniformizes L) Continuity in terms of strategies (in Cantor metric): Strategy without lookahead: i-th letter of fσ(α) only depends

  • n first i letters of α (very strong notion of continuity).

Strategy with bounded delay: fσ Lipschitz-continuous. Strategy with arbitrary (finite) delay: fσ (uniformly) continuous.

Martin Zimmermann Saarland University Games with Costs and Delays 6/14

slide-37
SLIDE 37

Uniformization of Relations

A strategy σ for O in a game induces a mapping fσ : Σω

I → Σω O

σ is winning ⇔ {

  • α

fσ(α)

  • | α ∈ Σω

I } ⊆ L

(fσ uniformizes L) Continuity in terms of strategies (in Cantor metric): Strategy without lookahead: i-th letter of fσ(α) only depends

  • n first i letters of α (very strong notion of continuity).

Strategy with bounded delay: fσ Lipschitz-continuous. Strategy with arbitrary (finite) delay: fσ (uniformly) continuous. Holtmann, Kaiser, Thomas: for ω-regular L L uniformizable by continuous function ⇔ L uniformizable by Lipschitz-continuous function

Martin Zimmermann Saarland University Games with Costs and Delays 6/14

slide-38
SLIDE 38

Finitary Parity Automata

1 2 a a a b

Martin Zimmermann Saarland University Games with Costs and Delays 7/14

slide-39
SLIDE 39

Finitary Parity Automata

1 2 a a a b Parity acceptance: Almost every odd priority is followed by a larger even one. L(A) = a(b∗aaa)∗bω + a(b∗aaa)ω

Martin Zimmermann Saarland University Games with Costs and Delays 7/14

slide-40
SLIDE 40

Finitary Parity Automata

1 2 a a a b Parity acceptance: Almost every odd priority is followed by a larger even one. L(A) = a(b∗aaa)∗bω + a(b∗aaa)ω Finitary parity acceptance: There is a bound n such that almost every odd priority is followed by a larger even one within n steps. L(A) = a(b∗aaa)∗bω +

  • n∈N

a(b≤naaa)ω

Martin Zimmermann Saarland University Games with Costs and Delays 7/14

slide-41
SLIDE 41

Finitary Parity Automata

Remark

Safety automata can be transformed into finitary parity automata

  • f the same size.

Martin Zimmermann Saarland University Games with Costs and Delays 8/14

slide-42
SLIDE 42

Finitary Parity Automata

Remark

Safety automata can be transformed into finitary parity automata

  • f the same size.

Proof: Turn all unsafe states into sinks with an odd color, all safe states get even color.

Martin Zimmermann Saarland University Games with Costs and Delays 8/14

slide-43
SLIDE 43

Finitary Parity Automata

Remark

Safety automata can be transformed into finitary parity automata

  • f the same size.

Proof: Turn all unsafe states into sinks with an odd color, all safe states get even color. Thus: exponential lower bounds on complexity and necessary lookahead for delay games with finitary parity conditions.

Martin Zimmermann Saarland University Games with Costs and Delays 8/14

slide-44
SLIDE 44

Results

If winning conditions given by deterministic finitary parity automata:

Theorem

If Player O wins delay game induced by A, then also by delaying at most 2|A|6 times.

Martin Zimmermann Saarland University Games with Costs and Delays 9/14

slide-45
SLIDE 45

Results

If winning conditions given by deterministic finitary parity automata:

Theorem

If Player O wins delay game induced by A, then also by delaying at most 2|A|6 times. Lower bound 2|A|.

Martin Zimmermann Saarland University Games with Costs and Delays 9/14

slide-46
SLIDE 46

Results

If winning conditions given by deterministic finitary parity automata:

Theorem

If Player O wins delay game induced by A, then also by delaying at most 2|A|6 times. Lower bound 2|A|. Determining the winner is EXPTIME-complete. Note: Again, quantitative conditions not harder than qualitative ones.

Martin Zimmermann Saarland University Games with Costs and Delays 9/14

slide-47
SLIDE 47

Tradeoff Lookahead vs. Quality

. . . ∗

1

  • 1

  • 1

  • Martin Zimmermann

Saarland University Games with Costs and Delays 10/14

slide-48
SLIDE 48

Tradeoff Lookahead vs. Quality

. . . ∗

1

  • 1

  • 1

  • Theorem

For every n > 0, there is a language Ln recognized by a finitary Büchi automaton with n + 2 states such that an optimal strategy without delay has cost n, but an optimal strategy delaying once has cost 1.

Martin Zimmermann Saarland University Games with Costs and Delays 10/14

slide-49
SLIDE 49

Tradeoff Lookahead vs. Quality

Theorem

For every n > 0, there is a language L′

n recognized by a finitary

Büchi automaton with O(n) states such that an optimal strategy delaying 2n times has cost 0, and an optimal strategy delaying less than 2n times has cost n. · · · ∗

1

  • delay 2n

cost 0

Martin Zimmermann Saarland University Games with Costs and Delays 11/14

slide-50
SLIDE 50

Tradeoff Lookahead vs. Quality

b c d e f a

(3,0)

(3,1)

  • 1

(2,0)

(2,1)

  • 1

(1,0)

(1,1)

  • 1

  • 1

  • Martin Zimmermann

Saarland University Games with Costs and Delays 12/14

slide-51
SLIDE 51

Tradeoff Lookahead vs. Quality

a

(3,0)

(3,1)

  • 1

  • Martin Zimmermann

Saarland University Games with Costs and Delays 12/14

slide-52
SLIDE 52

Tradeoff Lookahead vs. Quality

b c a

(2,0)

(2,1)

  • 1

  • Martin Zimmermann

Saarland University Games with Costs and Delays 12/14

slide-53
SLIDE 53

Tradeoff Lookahead vs. Quality

b c d e a

(1,0)

(1,1)

  • 1

  • Martin Zimmermann

Saarland University Games with Costs and Delays 12/14

slide-54
SLIDE 54

Tradeoff Lookahead vs. Quality

b c d e f a

(1,0)

(1,1)

  • 1

  • Martin Zimmermann

Saarland University Games with Costs and Delays 12/14

slide-55
SLIDE 55

Tradeoff Lookahead vs. Quality

b c d e f a

(3,0)

(3,1)

  • 1

(2,0)

(2,1)

  • 1

(1,0)

(1,1)

  • 1

  • 1

  • Theorem

For every n > 0, there is a language L′′

n recognized by a finitary

Büchi automaton with O(n2) states such that for every 0 ≤ j ≤ n: an optimal strategy delaying j times has cost 2(n + 1) − j.

Martin Zimmermann Saarland University Games with Costs and Delays 12/14

slide-56
SLIDE 56

More Results

acceptance lookahead complexity

Martin Zimmermann Saarland University Games with Costs and Delays 13/14

slide-57
SLIDE 57

More Results

acceptance lookahead complexity parity exp. EXPTIME-complete finitary parity exp. EXPTIME-complete

Martin Zimmermann Saarland University Games with Costs and Delays 13/14

slide-58
SLIDE 58

More Results

acceptance lookahead complexity parity exp. EXPTIME-complete finitary parity exp. EXPTIME-complete parity w. costs exp. EXPTIME-complete

Martin Zimmermann Saarland University Games with Costs and Delays 13/14

slide-59
SLIDE 59

More Results

acceptance lookahead complexity parity exp. EXPTIME-complete finitary parity exp. EXPTIME-complete parity w. costs exp. EXPTIME-complete finitary Streett exp./doubly-exp. EXPTIME/2EXPTIME Streett w. costs exp./doubly-exp. EXPTIME/2EXPTIME

Martin Zimmermann Saarland University Games with Costs and Delays 13/14

slide-60
SLIDE 60

More Results

acceptance lookahead complexity parity exp. EXPTIME-complete finitary parity exp. EXPTIME-complete parity w. costs exp. EXPTIME-complete finitary Streett exp./doubly-exp. EXPTIME/2EXPTIME Streett w. costs exp./doubly-exp. EXPTIME/2EXPTIME

Theorem

Optimal strategies in delay games with Streett conditions with costs may require doubly-exponential lookahead.

Martin Zimmermann Saarland University Games with Costs and Delays 13/14

slide-61
SLIDE 61

Conclusion

Quantitative delay games with parity conditions are not harder than qualitative ones. Lookahead allows to improve the quality of strategies.

Martin Zimmermann Saarland University Games with Costs and Delays 14/14

slide-62
SLIDE 62

Conclusion

Quantitative delay games with parity conditions are not harder than qualitative ones. Lookahead allows to improve the quality of strategies. Open Problems Close the gaps for Streett conditions (qualitative and quantitative). Study other tradeoffs, e.g., lookahead vs. memory size. Determine the complexity of finding optimal strategies (smallest cost or smallest lookahead).

Martin Zimmermann Saarland University Games with Costs and Delays 14/14