on the approximation of mean payoff games
play

On the Approximation of Mean-Payoff Games Raffaella Gentilini - PowerPoint PPT Presentation

Mean-Payoff Games Exact Solutions Approximation On the Approximation of Mean-Payoff Games Raffaella Gentilini University of Perugia Convegno Italiano Logica Computazionale (CILC2011) 1 / 20 Mean-Payoff Games Exact Solutions Approximation


  1. Mean-Payoff Games Exact Solutions Approximation On the Approximation of Mean-Payoff Games Raffaella Gentilini University of Perugia Convegno Italiano Logica Computazionale (CILC2011) 1 / 20

  2. Mean-Payoff Games Exact Solutions Approximation Contents 1. Mean-Payoff Games (MPG) Problems 2. Exact Solutions for MPG 3. Approximate Solutions for MPG: The Additive Setting 4. Approximate Solutions for MPG: The Multiplicative Setting 2 / 20

  3. Mean-Payoff Games Exact Solutions Approximation Mean-Payoff Games MPG • 2 players: Maximazer B � b vs Minimazer △ lice 3 / 20

  4. Mean-Payoff Games Exact Solutions Approximation Mean-Payoff Games MPG • 2 players: Maximazer B � b vs Minimazer △ lice • played on a finite graph (arena) 3 / 20

  5. Mean-Payoff Games Exact Solutions Approximation Mean-Payoff Games MPG • 2 players: Maximazer B � b vs Minimazer △ lice • played on a finite graph (arena) • turn based 3 / 20

  6. Mean-Payoff Games Exact Solutions Approximation Mean-Payoff Games MPG • 2 players: Maximazer B � b vs Minimazer △ lice • played on a finite graph (arena) • turn based • infinite number of turns 3 / 20

  7. Mean-Payoff Games Exact Solutions Approximation Mean-Payoff Games MPG • 2 players: Maximazer B � b vs Minimazer △ lice • played on a finite graph (arena) • turn based • infinite number of turns • goal (for Bob): maximazing the long-run average weight 3 / 20

  8. Mean-Payoff Games Exact Solutions Approximation MPG in Formal Term In a MPG Γ = ( V , E , w : V → Z , � V � , V △ � ): B � b wants to maximize his payoff, i.e. the long-run average weight in a play . Given a play p = { v i } i ∈ N in Γ, the payoff of B � b on p is: n − 1 � 1 MP( v 0 v 1 . . . v n . . . ) = lim inf n · w ( v i , v i +1 ) n →∞ i =0 4 / 20

  9. Mean-Payoff Games Exact Solutions Approximation MPG in Formal Term The value secured by a strategy σ � : V ∗ · V � → V in vertex v is: MP(outcome Γ ( v , σ � , σ △ )) val σ � ( v ) = inf σ △ ∈ Σ △ sup σ � ∈ Σ � (val σ � ( v )) is the optimal value that B � b can secure in v 5 / 20

  10. Mean-Payoff Games Exact Solutions Approximation MPG in Formal Term Example The value secured by a strategy σ � : V ∗ · V � → V in vertex v is: MP(outcome Γ ( v , σ � , σ △ )) val σ � ( v ) = inf σ △ ∈ Σ △ sup σ � ∈ Σ � (val σ � ( v )) is the optimal value that B � b can secure in v 5 / 20

  11. Mean-Payoff Games Exact Solutions Approximation MPG in Formal Term Example val σ � ( v ) = − 4 2 The value secured by a strategy σ � : V ∗ · V � → V in vertex v is: MP(outcome Γ ( v , σ � , σ △ )) val σ � ( v ) = inf σ △ ∈ Σ △ sup σ � ∈ Σ � (val σ � ( v )) is the optimal value that B � b can secure in v 5 / 20

  12. Mean-Payoff Games Exact Solutions Approximation MPG in Formal Term Example The value secured by a strategy σ � : V ∗ · V � → V in vertex v is: MP(outcome Γ ( v , σ � , σ △ )) val σ � ( v ) = inf σ △ ∈ Σ △ sup σ � ∈ Σ � (val σ � ( v )) is the optimal value that B � b can secure in v 5 / 20

  13. Mean-Payoff Games Exact Solutions Approximation MPG in Formal Term Example sup σ � ∈ Σ � (val σ � ( v )) = 2 2 The value secured by a strategy σ � : V ∗ · V � → V in vertex v is: MP(outcome Γ ( v , σ � , σ △ )) val σ � ( v ) = inf σ △ ∈ Σ △ sup σ � ∈ Σ � (val σ � ( v )) is the optimal value that B � b can secure in v 5 / 20

  14. Mean-Payoff Games Exact Solutions Approximation MPG are Memoryless Determined Theorem [Ehrenfeucht&Mycielsky’79] val Γ ( v ) = sup σ � ∈ Σ � inf σ △ ∈ Σ △ MP(outcome Γ ( v , σ � , σ △ )) = = inf σ △ ∈ Σ △ sup σ � ∈ Σ � MP(outcome Γ ( v , σ � , σ � )). There exist uniform memoryless strategies, π � : V � → V for B � b , π △ : V △ → V for △ lice such that: val Γ ( v ) = val π � ( v ) = val π △ ( v ) . 6 / 20

  15. Mean-Payoff Games Exact Solutions Approximation MPG are Memoryless Determined Example Theorem [Ehrenfeucht&Mycielsky’79] val Γ ( v ) = sup σ � ∈ Σ � inf σ △ ∈ Σ △ MP(outcome Γ ( v , σ � , σ △ )) = = inf σ △ ∈ Σ △ sup σ � ∈ Σ � MP(outcome Γ ( v , σ � , σ � )). There exist uniform memoryless strategies, π � : V � → V for B � b , π △ : V △ → V for △ lice such that: val Γ ( v ) = val π � ( v ) = val π △ ( v ) . 6 / 20

  16. Mean-Payoff Games Exact Solutions Approximation MPG are Memoryless Determined Example Theorem [Ehrenfeucht&Mycielsky’79] val Γ ( v ) = sup σ � ∈ Σ � inf σ △ ∈ Σ △ MP(outcome Γ ( v , σ � , σ △ )) = = inf σ △ ∈ Σ △ sup σ � ∈ Σ � MP(outcome Γ ( v , σ � , σ � )). There exist uniform memoryless strategies, π � : V � → V for B � b , π △ : V △ → V for △ lice such that: val Γ ( v ) = val π � ( v ) = val π △ ( v ) . 6 / 20

  17. Mean-Payoff Games Exact Solutions Approximation MPG are Memoryless Determined Example d ∈ Q : 0 ≤ d ≤ | V | and | n | val Γ ( v )= n d ≤ M = max e ∈ E {| w ( e ) |} . Theorem [Ehrenfeucht&Mycielsky’79] val Γ ( v ) = sup σ � ∈ Σ � inf σ △ ∈ Σ △ MP(outcome Γ ( v , σ � , σ △ )) = = inf σ △ ∈ Σ △ sup σ � ∈ Σ � MP(outcome Γ ( v , σ � , σ � )). There exist uniform memoryless strategies, π � : V � → V for B � b , π △ : V △ → V for △ lice such that: 6 / 20 Γ π � π

  18. Mean-Payoff Games Exact Solutions Approximation MPG Problems 7 / 20

  19. Mean-Payoff Games Exact Solutions Approximation MPG Problems 1. Decision Problem Given v ∈ V , µ ∈ Z , decide if B � b has a strategy π � to secure val π � ( v ) ≥ µ . 7 / 20

  20. Mean-Payoff Games Exact Solutions Approximation MPG Problems 1. Decision Problem Given v ∈ V , µ ∈ Z , decide if B � b has a strategy π � to secure val π � ( v ) ≥ µ . 2. Value Problem: Compute the set of (rational) values: { val Γ ( v ) | v ∈ V } 7 / 20

  21. Mean-Payoff Games Exact Solutions Approximation MPG Problems 1. Decision Problem Given v ∈ V , µ ∈ Z , decide if B � b has a strategy π � to secure val π � ( v ) ≥ µ . 2. Value Problem: Compute the set of (rational) values: { val Γ ( v ) | v ∈ V } 3. (Optimal) Strategy Synthesis Construct an (optimal) strategy for B � b. 7 / 20

  22. Mean-Payoff Games Exact Solutions Approximation MPG Problems: Why They Matter? | = Correctness Relation Quantitative Requirements : • limited resources • average performance . . . 8 / 20

  23. Mean-Payoff Games Exact Solutions Approximation MPG Problems: Why They Matter? | = Correctness Relation Quantitative Requirements : System Model? • limited resources • average performance . . . 8 / 20

  24. Mean-Payoff Games Exact Solutions Approximation MPG Problems: Why They Matter? Solved as a game: System vs Environment Solution = Winning Strategy | = Correctness Relation Quantitative Requirements : System Model? • limited resources • average performance . . . 8 / 20

  25. Mean-Payoff Games Exact Solutions Approximation MPG Problems: Why They Matter? • MPG significative for theoretical and applicative aspects PTIME PTIME • µ -calculus model checking ⇐ ⇒ parity games = ⇒ MPG PTIME • MPG = ⇒ simple stochastic games PTIME • MPG = ⇒ discounted payoff games 9 / 20

  26. Mean-Payoff Games Exact Solutions Approximation MPG Problems: Why They Matter? • MPG significative for theoretical and applicative aspects PTIME PTIME • µ -calculus model checking ⇐ ⇒ parity games = ⇒ MPG PTIME • MPG = ⇒ simple stochastic games PTIME • MPG = ⇒ discounted payoff games • MPG problems have an interesting complexity status • MPG decision problem belongs to NP ∩ coNP (and even to UP ∩ coUP) • No polynomial algorithm known so far 9 / 20

  27. Mean-Payoff Games Exact Solutions Approximation Solving MPG Problems Consider Γ = ( V , E , w , � V � , V △ � ), where w : V → [ − M · · · + M ]: 10 / 20

  28. Mean-Payoff Games Exact Solutions Approximation Solving MPG Problems Consider Γ = ( V , E , w , � V � , V △ � ), where w : V → [ − M · · · + M ]: U. Zwick and M. Paterson, 1996 • Θ( EV 2 M ) algorithm for the decision problem • Θ( EV 3 M ) algorithm for the value problem • Θ( EV 4 M log( E V )) algorithm for optimal strategy synthesis 10 / 20

  29. Mean-Payoff Games Exact Solutions Approximation Solving MPG Problems Consider Γ = ( V , E , w , � V � , V △ � ), where w : V → [ − M · · · + M ]: U. Zwick and M. Paterson, 1996 • Θ( EV 2 M ) algorithm for the decision problem H. Bjorklund and S. Vorobyov, 2004: Use a randomized framework • Θ( EV 3 M ) algorithm for the value problem • O (min( EV 2 M , 2 O ( √ V log V ) )) for the decision prob. • Θ( EV 4 M log( E V )) algorithm for optimal strategy synthesis • O (min( EV 3 M (log V + log M ) , 2 O ( √ V log V ) )) for the value prob. • Θ( EV 4 M log( E V )) algorithm for optimal strategy synthesis 10 / 20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend