the multiple dimensions of mean payoff games
play

The Multiple Dimensions of Mean-Payoff Games Laurent Doyen CNRS - PowerPoint PPT Presentation

The Multiple Dimensions of Mean-Payoff Games Laurent Doyen CNRS & LSV, ENS Paris-Saclay RP 2017 About Basics about mean-payoff games Algorithms & Complexity Strategy complexity Memory Strategy complexity Memory


  1. The Multiple Dimensions of Mean-Payoff Games Laurent Doyen CNRS & LSV, ENS Paris-Saclay RP 2017

  2. About Basics about mean-payoff games • Algorithms & Complexity • Strategy complexity – Memory • Strategy complexity – Memory Focus • Equivalent game forms • Techniques for memoryless proofs

  3. Mean-Payoff Mean-Payoff

  4. Mean-Payoff

  5. Mean-Payoff

  6. Mean-Payoff

  7. Mean-Payoff Switching policy to get average power (1,1) ?

  8. Mean-Payoff Switching policy to get average power (1,1) ?

  9. Mean-Payoff Mean-payoff value = limit-average of the visited weights

  10. Mean-Payoff Switching policy • Infinite memory: (1,1) vanishing frequency in q 0 • Infinite memory: (1,1) vanishing frequency in q 0

  11. Mean-Payoff limit ?

  12. Mean-Payoff limit ? limsup liminf Mean-payoff is prefix-independent

  13. Mean-Payoff Switching policy • Infinite memory: (1,1) for liminf • Infinite memory: (1,1) for liminf

  14. Mean-Payoff Switching policy • Infinite memory: (1,1) for liminf & (2,2) for limsup • Infinite memory: (1,1) for liminf & (2,2) for limsup

  15. Games Games

  16. Two-player games

  17. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration

  18. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  19. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  20. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  21. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  22. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  23. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  24. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  25. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  26. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  27. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  28. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  29. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Strategies = recipe to extend the play prefix Player 1: Player 2:

  30. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Strategies = recipe to extend the play prefix Player 1: outcome of two strategies is a play Player 2:

  31. Mean-payoff games Mean-payoff games

  32. Mean-payoff games Mean-payoff game: positive and negative weights (encoded in binary) Decision problem: Decide if there exists a player-1 strategy to ensure mean-payoff value ≥ 0 Value problem:

  33. Mean-payoff games Key ingredients: • identify memory requirement: infinite vs. finite vs. memoryless • solve 1-player games (i.e., graphs) Key arguments for memoryless proof: • backward induction • shuffle of plays • nested memoryless objectives

  34. Reduction to Reachability Games

  35. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0)

  36. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0)

  37. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0)

  38. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0) If player 1 wins � only positive cycles are formed � mean-payoff value ≥ 0 If player 2 wins � only negative cycles are formed � mean-payoff value < 0 (Note: limsup vs. liminf does not matter)

  39. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0) Mean-payoff game � Ensuring positive cycles Memoryless strategy transfers to finite-memory mean-payoff winning strategy

  40. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ?

  41. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ?

  42. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  43. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  44. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  45. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  46. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Choose successor to stay above minimum credit minimum credit such that Progress measure: minimum initial credit to stay always positive In choose such that

  47. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Choose successor to stay above minimum credit minimum credit such that Progress measure: minimum initial credit to stay always positive In choose such that

  48. Memoryless proofs Key arguments for memoryless proof: • backward induction • backward induction • shuffle of plays • nested memoryless objectives

  49. Energy Games Energy: min-value of the prefix. (if positive cycle; otherwise ∞) Mean-payoff: average-value of the cycle. Mean-payoff: average-value of the cycle.

  50. Energy Games Winning strategy ?

  51. Energy Games Winning strategy ? Follow the minimum initial credit !

  52. Multi-dimension Multi-dimension games

  53. Multi-dimension games Multiple resources • Energy: initial credit to stay always above (0,0) • Energy: initial credit to stay always above (0,0) • Mean-payoff:

  54. Multi-dimension games Multiple resources • Energy: initial credit to stay always above (0,0) • Energy: initial credit to stay always above (0,0) same ? • Mean-payoff: same as positive cycles ?

  55. Multi-dimension games Multiple resources • Energy: initial credit to stay always above (0,0) • Energy: initial credit to stay always above (0,0) same ? • Mean-payoff: same as positive cycles ? If player 1 can ensure positive simple cycles, then energy and mean-payoff are satisfied. Not the converse !

  56. Multi-dimension games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient

  57. Multi-dimension games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient Then σ’ 1 is winning Let σ 1 be winning On each branch and finite memory L L 1 stop and play ... ... ... ... L 2 as from L 1 ! ... ... ... ... ... ... ... ... With L 1 ≤L 2 ( ℕ d ,≤) is well-quasi ordered wqo + Koenig’s lemma

  58. Multi-energy games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient For player 2 ?

  59. Multi-energy games For player 2, memoryless strategies are sufficient • induction on player-2 states • if ∃ initial credit against all memoryless strategies, then ∃ initial credit against all arbitrary strategies. ‘left’ game ‘right’ game

  60. Multi-energy games For player 2, memoryless strategies are sufficient • induction on player-2 states • if ∃ initial credit against all memoryless strategies, then ∃ initial credit against all arbitrary strategies. c l c r c l+ c r ⇒ ‘left’ game ‘right’ game Play is a shuffle of left-game play and right-game play Energy is sum of them

  61. Multi-energy games For player 2, memoryless strategies are sufficient • induction on player-2 states • if ∃ initial credit against all memoryless strategies, then ∃ initial credit against all arbitrary strategies. c l c r c l+ c r ⇒ ‘left’ game ‘right’ game Value against Value against memoryless Play is a shuffle of left-game play arbitrary strategies and right-game play strategies Energy is sum of them In general, we need

  62. Memoryless proofs Key arguments for memoryless proof: • backward induction • backward induction • shuffle of plays • nested memoryless objectives

  63. Multi-energy games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient For player 2, memoryless strategies are sufficient coNP ?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend