Approximating Values of Generalized-Reachability Stochastic Games - - PowerPoint PPT Presentation

approximating values of generalized reachability
SMART_READER_LITE
LIVE PREVIEW

Approximating Values of Generalized-Reachability Stochastic Games - - PowerPoint PPT Presentation

Approximating Values of Generalized-Reachability Stochastic Games Maximilian Weininger joint work with Pranav Ashok, Krishnendu Chatterjee, Jan Kretnsk, Tobias Winkler HIGHLIGHTS 2020 (Paper appeared at LICS 2020) Model The problem The


slide-1
SLIDE 1

Approximating Values of Generalized-Reachability Stochastic Games

Maximilian Weininger joint work with Pranav Ashok, Krishnendu Chatterjee, Jan Kretínský, Tobias Winkler HIGHLIGHTS 2020 (Paper appeared at LICS 2020)

slide-2
SLIDE 2

Model

slide-3
SLIDE 3

The problem

slide-4
SLIDE 4

The problem

slide-5
SLIDE 5

The problem

slide-6
SLIDE 6

The problem

slide-7
SLIDE 7

The problem

slide-8
SLIDE 8

The problem

Want: Pareto frontier

slide-9
SLIDE 9

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13]

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-10
SLIDE 10

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13]

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-11
SLIDE 11

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13]

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-12
SLIDE 12

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13]

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-13
SLIDE 13

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13]

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-14
SLIDE 14

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13]

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-15
SLIDE 15

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop?

[CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-16
SLIDE 16

The problem

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

slide-17
SLIDE 17

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

The problem

slide-18
SLIDE 18

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

The problem

slide-19
SLIDE 19

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

The problem

slide-20
SLIDE 20

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

The problem

slide-21
SLIDE 21

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

The problem

slide-22
SLIDE 22

Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

Approximate values of generalized-reachability stochastic games for arbitrarily small precision.

The problem

slide-23
SLIDE 23

Our contribution

Over-approximation need not converge (multiple fixpoints)

slide-24
SLIDE 24

Over-approximation need not converge (multiple fixpoints)

  • Consider single directions
  • Apply single-dimensional solution

Our contribution

slide-25
SLIDE 25

Over-approximation need not converge (multiple fixpoints)

  • Consider single directions
  • Apply single-dimensional solution
  • Group directions into regions

Our contribution

slide-26
SLIDE 26

Context

Single-dim SG Multi-dim MDP Multi-dim SG Computational complexity

NP ∩ coNP [Con92] PSPACE-complete [RRS15]

Strategy complexity

[Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015.

slide-27
SLIDE 27

Context

Single-dim SG Multi-dim MDP Multi-dim SG Computational complexity

NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open

Strategy complexity

[Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015.

slide-28
SLIDE 28

Context

Single-dim SG Multi-dim MDP Multi-dim SG Computational complexity

NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open

Strategy complexity

Memoryless deterministic [Con92] Randomized memoryless (absorbing) [EKVY07]; Finite mem. in general [RRS15]

[Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015. [EKVY07] Etessami, K., Kwiatkowska, M., Vardi, M. Y., & Yannakakis, M. Multi-objective model checking of Markov decision processes. TACAS 2007.

slide-29
SLIDE 29

Context

Single-dim SG Multi-dim MDP Multi-dim SG Computational complexity

NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open

Strategy complexity

Memoryless deterministic [Con92] Randomized memoryless (absorbing) [EKVY07]; Finite mem. in general [RRS15]

  • Inf. mem. (absorbing)

[CFK+13]

[Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015. [EKVY07] Etessami, K., Kwiatkowska, M., Vardi, M. Y., & Yannakakis, M. Multi-objective model checking of Markov decision processes. TACAS 2007. [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

slide-30
SLIDE 30

Context

Over-approximation need not converge (multiple fixpoints)

Single-dim SG Multi-dim MDP Multi-dim SG Computational complexity

NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open

Strategy complexity

Memoryless deterministic [Con92] Randomized memoryless (absorbing) [EKVY07]; Finite mem. in general [RRS15]

  • Inf. mem. (absorbing)

[CFK+13]