A Discrete Strategy Improvement Algorithm for Solving Parity Games - PDF document

A Discrete Strategy Improvement Algorithm for Solving Parity Games Jens V¨ oge Marcin Jurdzi´ nski Lehrstuhl f¨ ur Informatik VII BRICS RWTH Aachen University of Aarhus Germany Denmark Chicago, USA, 19 July 2000 1 Complexity of parity games — motivations • Equivalent to modal µ -calculus model checking [Emerson, Jutla, Sistla 1993; Stirling 1995] Solving a parity game: Model checking: �− → who is the winner in G K ,ϕ ? does K | = ϕ hold? � � reduction in time O |K| · | ϕ | • Intriguing complexity-theoretic status – in NP ∩ co-NP [EJS’93] (even in UP ∩ co-UP [J’98]) – no polynomial time algorithm known [EL’86, . . . , EJS’93, BCJLM’94, Sei’96, J’00] – parity games ≤ log − space mean payoff games ≤ log − space m m discounted payoff games ≤ log − space simple stochastic games m [Condon’92, Puri’95, ZP’96] Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 2

Strategy improvement algorithms — history 1966 Hoffman-Karp’s algorithm for stochastic games received a lot of attention in Operations Research community 1995 Puri’s algorithm for discounted payoff games Drawbacks of Puri’s algorithm: • Inefficient in practice – solving linear programming instances – high precision arithmetic • Hard to analyze/understand – manipulates real number encodings of discrete values – proof of correctness uses continuous methods (e.g., Banach fixed point theorem) Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 3 Discrete strategy improvement algorithm We alleviate drawbacks of Puri’s algorithm • Fast implementation – O ( n · m ) discrete algorithm for strategy improvement step • Hope for easier analysis/better understanding: – manipulates discrete values explicitly – proof of correctness uses only discrete arguments Experimental evidence: small number of strategy improvement steps Open problem: is it a polynomial time algorithm? Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 4

� � � � � � � � � � Plan 1. Definition of parity games 2. Solving parity games (a) as a decision problem (b) as an optimization problem 3. Strategy Improvement Algorithm (a) generic idea (b) our implementation 4. Time complexity 5. Open problems Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 5 Parity games — definition � � G = V, E, ( M Even , M Odd ) • V = { 0 , 1 , 2 , . . ., n } = M Even ⊎ M Odd • 0 ∈ M Even ; o ∈ M Odd �� 6 � 3 �� 5 0 1 4 2 Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 6

� � � � � � � � � � � � � � Winning play Loop ( P ) � �� 6 � 3 � 0 � 4 Play P : 5 2 λ ( P ) � � Loop ( P ) = { 0 , 2 , 3 , 4 } λ ( P ) = max { 0 , 2 , 3 , 4 } = 4 Loop value λ ( P ) of a play P is defined by � � λ ( P ) = max Loop ( P ) Play P is a winning play for Even iff λ ( P ) is even Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 7 Strategies Function σ : M Even → V is a strategy for Even iff � � v, σ ( v ) ∈ E for every v ∈ M Even �� σ σ � 6 � 6 � 3 0 �� 2 �� 5 5 5 6 3 3 0 0 1 1 1 4 4 4 2 2 σ Play P = � v 1 , v 2 , . . ., v k � is consistent with strategy σ iff v i +1 = σ ( v i ) for every v i ∈ M Even �� σ σ σ � 6 � 6 � 3 � 0 � 4 � 4 � 2 � 2 5 5 5 6 3 3 0 0 4 2 P : Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 8

Winning strategies Plays σ ( v ) is defined to be the set of plays − starting from v , and − consistent with σ Strategy σ is a winning strategy for Even from v iff every play P ∈ Plays σ ( v ) is winning for Even Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 9 Solving parity games — decision problem The winning set � � W Even = v ∈ V : there is a winning strategy for Even from v The problem of solving parity games � � Given: a parity game G = V, E, ( M Even , M Odd ) Find: the winning set W Even ⊆ V Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 10

Solving parity games — optimization problem (Strategies , ⊑ ) ⊑ is a partial order on Strategies Postulates for (Strategies , ⊑ ): 1. The ⊑ -maximum strategy exists 2. The ⊑ -maximum element is a strategy winning from every vertex in W Even The problem of solving parity games � � Given: a parity game G = V, E, ( M Even , M Odd ) Find: the ⊑ -maximum strategy Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 11 Generic Strategy Improvement Algorithm Postulates for a function ( Strategies , ⊑ ) Improve : Strategies → Strategies 1. (Strategy Improvement) ⊑ If σ is not the ⊑ -maximum strategy then σ ⊑ Improve ( σ ) 2. (Optimum Strategy) Improve If σ is the ⊑ -maximum strategy then Improve ( σ ) = σ Strategy Improvement Algorithm: pick a strategy σ for player Even while σ � = Improve ( σ ) do σ := Improve ( σ ) Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 12

Ingredients of Strategy Improvement Algorithm In this talk: 1. Definition of a partial order ⊑ on Strategies 2. Definition of a function Improve : Strategies → Strategies In (full versions of) the paper: 1. Proofs of postulates for ( Strategies , ⊑ ) 2. Proofs of postulates for Improve : (a) Proof of Strategy Improvement Lemma (b) Proof of Optimum Strategy Lemma 3. Efficient implementation of Improve Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 13 Our proposal for ⊑ -order on Strategies Assume we are given: 1. ( PlayValues , � ): a linear order � on PlayValues 2. Θ : Plays → PlayValues : value of a play ( V → PlayValues ) point-wise extension ? � �� ( Strategies , ⊑ ) ( � ) Valuations , Ω σ ′ Ω σ ′ σ Ω σ Ω σ : V → PlayValues Ω σ ( v ) = min � � � Θ( P ) : P ∈ Plays σ ( v ) Intuition: Ω σ is the value of the best counter-strategy against σ Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 14

� � Our proposal for Improve function on Strategies Improve : Strategies → Strategies Improve ( σ ) : M Even → V � � Improve ( σ ) ( v ) = the successor of v maximizing Ω σ (w.r.t. � ) u 1 Improve ( σ ) because Ω σ ( u 1 ) ⊲ Ω σ ( u 2 ) v u 2 Improvement step for strategy σ in a nutshell: 1. global minimization: find Ω σ , the best counter-strategy against σ 2. local maximization: point Improve ( σ ) to � -maximum Ω σ values Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 15 PlayValues and function Θ : Plays → PlayValues �� 6 � 3 � 0 � 4 5 2 Play P : � �� λ ( P ) Prefix ( P ) Prefix ( P ) = { 0 , 3 , 5 , 6 } λ ( P ) = 4, π ( P ) = { 5 , 6 } , #( P ) = 4 � � Primary path value π ( P ) = Prefix ( P ) ∩ v : v > λ ( P ) � � Secondary path value #( P ) = � Prefix ( P ) � V × ℘ ( V ) × N � �� Value of a play function Θ : Plays → PlayValues is defined by: � � Θ( P ) = λ ( P ) , π ( P ) , #( P ) Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 16

Our proposal for � order on PlayValues PlayValues ⊆ V × ℘ ( V ) × N � on PlayValues is the lexicographic order on V × ℘ ( V ) × N • We define a � linear order on loop values V • We define a � linear order on primary path values ℘ ( V ) • We use standard ≤ linear order on secondary path values N Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 17 The � linear orders on V and ℘ ( V ) Definition (The � linear order on loop values V ) (2 k − 1) ≺ · · · ≺ 5 ≺ 3 ≺ 1 ≺ 0 ≺ 2 ≺ 4 ≺ 6 ≺ · · · ≺ 2 k Definition (The � linear order on primary path values ℘ ( V )) P � Q iff FirstDiff ( P ; Q ) � FirstDiff ( Q ; P ) Example � � P = 7 > 6 > 5 > 4 � � = 7 6 4 Q > > � � = 7 6 4 2 R > > > Jens V¨ oge and Marcin Jurdzi´ nski CA V 2000 A Discrete Strategy Improvement Algorithm for Solving Parity Games 18

A Discrete Strategy Improvement Algorithm for Solving Parity Games - PDF document

A Discrete Strategy Improvement Algorithm for Solving Parity Games Jens V oge Marcin Jurdzi nski Lehrstuhl f ur Informatik VII BRICS RWTH Aachen University of Aarhus Germany Denmark Chicago, USA, 19 July 2000 1 Complexity of

Integrating Problem Solving 2020 Integrating Problem Solving 2020 Integrating Problem Solving

Solving Word Problems The strategy for solving word problems, presented in written form, may be

Discrete Mathematics Jeremy Siek Spring 2010 Jeremy Siek Discrete Mathematics 1 / 118 Jeremy

Cyber-Physical Systems Discrete Dynamics IECE 553/453 Fall 2019 Prof. Dola Saha 1 Discrete

CMSC 222: Discrete Mathematics Prof S Fall 2018 What is Discrete Mathematics? Discrete

Cyber-Physical Systems Discrete Dynamics ICEN 553/453 Fall 2018 Prof. Dola Saha 1 Discrete

Plan Discrete paths as Heyting algebras Discrete paths as categories Discrete paths as quantales

Discrete-time Systems in the Time Domain Chaiwoot Boonyasiriwat August 21, 2020 Discrete-time

Evidence evaluation for discrete data Evidence evaluation for discrete data Evidence evaluation

Solving Discrete Logarithms in Smooth-Order Groups with CUDA Ryan Henry Ian Goldberg Solving

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Quiz I Give the SVD-based algorithm for solving least squares, and I justify the algorithm by that

Continuous Improvement Solving Problems That Change Lives CI Skills Development Problem Solving

Algorithm Analysis October 12, 2016 CMPE 250 Algorithm Analysis October 12, 2016 1 / 66

Discrete Structures of Computer Science Amanda Watson What is Discrete Mathematics?

Logic and discrete mathematics (HKGAB4) Discrete mathematics: contents http://www.ida.liu.se/

From Object Fields to Local Variables: a Practical Approach to Field-Sensitive Analysis Elvira

Stable finiteness and pure infiniteness of the C -algebras of higher-rank graphs Astrid an

Automatically proving linearizability Viktor Vafeiadis University of Cambridge CAV 2010

Pure Functional Programming Functional Programming and Reasoning Dr Hans Georg Schaathun

dependence of J/ suppression in Pb-Pb collisions at s NN =2.76TeV ALICE Collaboration, May

Probabilistic Modelling with Tensor Networks: From Hidden Markov Models to Quantum Circuits Ryan

GCF 26th Meeting of the Board (B.26) Diana Dorman PhD Student | Environmental Studies University

Covid-19- Supply Chain Agility CIPS Bahrain Branch Presented by GP Puri Creating Supply Chain