Winning Infinite Games in Finite Time Wolfgang Thomas Francqui - PowerPoint PPT Presentation

Winning Infinite Games in Finite Time Wolfgang Thomas Francqui Lecture, Mons, April 2013

Wolfgang Thomas

R. McNaughton Wolfgang Thomas

The Problem Given a Muller game with the collection F of “winning loops” for Player 2 play this like a card game in the evening ... and of course go to sleep at some time. Question: How can one terminate a play after finite time, declaring correctly the winner? This is trivial for parity games: Terminate a play when a vertex v is repeated the first time, and declare the winner according to the maximal color seen between the two visits of v . We pursue the question for Muller games. Wolfgang Thomas

From McNaughton’s Report (1965) Wolfgang Thomas

Scoring A Muller game ( G , F 1 , F 2 ) consists here of an arena G = ( V , V 1 , V 2 , E ) and a partition ( F 1 , F 2 ) of Pow V . Player i wins play ̺ iff Inf ( ̺ ) ∈ F i . Strategies, winning strategies, winning regions are defined as before. McNaughton’s approach: Count for each loop F how often the loop F (as a set) was completely traversed without interruptions. Call this number at time t of a play the score for F at time t . McNaughton (2000): The winner of a Muller game is the player who first can reach score n ! for one of his winning loops F . Wolfgang Thomas

A Muller Game Example F 2 = 0 1 2 {{ 0, 1, 2 } , { 0 } , { 2 }} F 1 = {{ 0, 1 } , { 1, 2 }} Player 2 has a winning strategy: alternate between 0 and 1 (requires two memory states). Wolfgang Thomas

Scoring Functions For F ⊆ V define Sc F : V + → N : Sc F ( w ) = max { k | exist words x 1 , · · · , x k ∈ V + s.t. x 1 · · · x k suffix of w and Occ ( x i ) = F for all i } where Occ ( w ) = { v ∈ V | ∃ j s.t. w j = v } . Sc F ( w ) = k iff all of F is visited k consecutive times Example: w 0 0 1 1 0 0 1 2 0 1 2 0 2 Sc { 0,1 } 0 0 1 1 2 2 3 0 0 1 0 0 0 Sc { 0,1,2 } 0 0 0 0 0 0 0 1 1 1 2 2 2 Wolfgang Thomas

Accumulator Functions For F ⊆ V define Acc F : V + → 2 F : Acc F ( w ) contains vertices of F seen since last increase or reset of Sc F Example: w 0 0 1 1 0 0 1 2 Sc { 0,1 } 0 0 1 1 2 2 3 0 Acc { 0,1 } { 0 } { 0 } � O { 1 } � O { 0 } � O � O Sc { 0,1,2 } 0 0 0 0 0 0 0 1 { 0 } { 0 } { 0, 1 } { 0, 1 } { 0, 1 } { 0, 1 } { 0, 1 } Acc { 0,1,2 } � O Wolfgang Thomas

Finite-time Muller Games Two properties of the scoring functions (informal versions): 1. If you play long enough (i.e., k | G | steps), some score value will be high (i.e., k ). 2. At most one score value can increase at a time. A finite-time Muller game has the format ( G , F 1 , F 2 , k ) with a threshold k ≥ 3 , and the following conditions: Players move a token through the arena. Stop play w as soon as score of k is reached for the first time. There is a unique F such that Sc F ( w ) = k . Player i wins w iff F ∈ F i . Wolfgang Thomas

Results Fearnley, Zimmermann (2010, a GASICS cooperation): Let k ≥ 3 . The winning regions in a Muller game ( G , F 1 , F 2 ) and in the finite-time Muller game ( G , F 1 , F 2 , k ) coincide. Stronger statement, which implies the theorem: On his winning region, Player i can prevent her opponent from ever reaching a score of 3 for every set F ∈ F 1 − i . We obtain two “reductions”: Muller game to.. 1. ..reachability game on unravelling up to score 3 (doubly- exponential blowup) 2. ..safety game: see next slides. Wolfgang Thomas

“Reducing“ Muller games to Safety Games F 2 = 0 1 2 {{ 0, 1, 2 } , { 0 } , { 2 }} F 1 = {{ 0, 1 } , { 1, 2 }} Idea: keep track of Player 1 ’s scores and avoid Sc F = 3 for F ∈ F 1 . Ignore scores of Player 2 . Identify plays having the same scores and accumulators for Player 1 . w = F 1 w ′ iff ∀ F ∈ F 1 : Sc F ( w ) = Sc F ( w ′ ) and Acc F ( w ) = Acc ( w ′ ) Build unravelling of = F 1 -equivalence classes up to score 3 for Player 1 . Wolfgang Thomas

Safety Game Graph 101 1010 10101 101010 10 100 1001 10010 100101 1 122 1221 12212 122121 12 121 1212 12121 121212 Wolfgang Thomas

Standard Game Reductions A classical game reduction transforms a complicated game G to simpler game G ′ : Every play in G is mapped (continuously) to play in G ′ that has the same winner. Solving G ′ yields both winning regions of G and corresponding finite-state winning strategies for both players. Muller games cannot be reduced to safety games. Otherwise we would reduce the Borel level of Muller-recognizable ω -languages ( B ( Π 2 ) ) to Π 1 . Wolfgang Thomas

Results 1. Player i wins the Muller game from v iff she wins the safety game from [ v ] = F 1 . 2. Player 2 ’s winning region in the safety game can be turned into finite-state winning strategy for her in the Muller game. 3. Size of the safety game: ( n ! ) 3 . (Neider, Rabinovich, Zimmermann, GandALF 2011) Remarks: Size of parity game in LAR-reduction n ! . But: simpler algorithms for safety games. 2. does not hold for Player 1 . The reduction is unilateral and not player-symmetric as in the classical sense. Wolfgang Thomas

Conclusion Convincing the referee that one can win the game is not the same as winning the game. One can transform the winner-deciding strategy into a genuine winning strategy. This gives an alternative approach to strategy construction. Task: Study the interplay between symmetric and unilateral game reductions. Wolfgang Thomas

Perspective: Quantitative Aspects Wolfgang Thomas

Wolfgang Thomas

Quantitative Games The games studied so far were win-lose games. In quantitative games a value is associated to each play. Usually, one player tries to maximize and the other player tries to minimize the value. Other quantitative aspects deals with the economic shape of strategies (e.g., minimization of memory). Wolfgang Thomas

A Mean Payoff Game − 4 u v w 0 − 4 − 4 − 1 2 2 − 4 8 y x z 8 − 1 For a finite play v 0 · · · v n we are interested in the mean value n − 1 1 ∑ n · r ( v i , v i + 1 ) i = 0 In the limit, Player 0 tries to maximize and Player 1 tries to minimize this value. Wolfgang Thomas

Mean Payoff Game – Formal A mean payoff game is of the form G = ( Q , Q 0 , E , r ) where ( Q , Q 0 , E ) is a finite game graph as we know it, and r : E → Z is a function assigning a reward to each edge. As usual the players built up a play π = v 0 v 1 v 2 · · · where Player 0 tries to maximize � n − 1 � 1 r 0 ( π ) : = lim inf n · r ( v i , v i + 1 ) ∑ n → ∞ i = 0 Player 1 tries to minimize � n − 1 � 1 r 1 ( π ) : = lim sup n · r ( v i , v i + 1 ) ∑ n → ∞ i = 0 Wolfgang Thomas

Strategies Strategies for Player i are as before mappings σ : V ∗ V i → V . For two strategies σ and τ of Player 2 and Player 1, respectively, and a starting vertex v we denote by π σ , τ , v the unique play starting in v and played according to σ and τ . The Player 0 value of the game from v is val 0 ( v ) : = sup σ inf τ r 0 ( π σ , τ , v ) , the Player 1 value of the game from v is val 1 ( v ) : = inf τ sup σ r 1 ( π σ , τ , v ) , where σ ranges over Player 0 strategies and τ over Player 1 strategies. Remark. val 0 ( v ) ≤ val 1 ( v ) Wolfgang Thomas

Determinacy of Mean Payoff Games Theorem (Ehrenfeucht-Mycielski, Zwick-Paterson) For each finite mean payoff game there are positional strategies σ ∗ and τ ∗ for Player 0 and Player 1, respectively, such that for each vertex v val 0 ( v ) = sup σ inf τ r 0 ( π σ , τ , v ) = inf τ r 0 ( π σ ∗ , τ , v ) = sup σ r 1 ( π σ , τ ∗ , v ) = inf τ sup σ r 1 ( π σ , τ , v ) = val 1 ( v ) The decision problem “Given a finite mean payoff game and a vertex v , is val ( v ) > 0 ?” belongs to NP ∩ co-NP. Wolfgang Thomas

From Parity Games to Mean Payoff Games Theorem. For each parity game G one can construct a mean payoff game G ′ over the same game graph such that for each vertex v Player 0 has a winning strategy in G from v iff val ( v ) ≥ 0 in G ′ . Construction: Let n be the number of vertices of G . Let ( u , v ) be an edge of G and p be the color of u . � n p if p is even Define r ( u , v ) : = − n p if p is odd Wolfgang Thomas

An Application: Request-Response Games Wolfgang Thomas

Request-Response Games Over a game graph G = ( V , E ) introduce “request sets” sets Rqu 1 , . . . , Rqu k ⊆ V “response” sets Rsp 1 , . . . , Rsp k ⊆ V RR-condition: k � ∀ s ( Rqu i ( s ) → ∃ t ( s < t ∧ Rsp i ( t ))) i = 1 Standard solution via a reduction to B¨ uchi games. Wolfgang Thomas

Measuring Quality of Solution Linear Penalty model: For each moment of waiting (for each RR-condition) pay 1 unit Quadratic Penalty model: For the i -th moment of waiting pay i units Activation of i -th condition in a play ̺ is a visit to Rqu i such that all previous visits to Rqu i are already matched by an Rsp i -visit. Wolfgang Thomas

Values of Plays and Strategies For both linear and quadratic penalty define: w ̺ ( n ) = sum of penalties in ̺ ( 0 ) . . . ̺ ( n ) divided by number of activations ”average penalty sum per activation” w ( ̺ ) = lim sup n → ∞ w ̺ ( n ) Given a strategy σ for controller and a strategy τ for adversary ̺ ( σ , τ ) : = the play induced by σ and τ w ( σ ) : = sup τ w ( ̺ ( σ , τ )) Call σ optimal if there is no other strategy with smaller value. Wolfgang Thomas

Winning Infinite Games in Finite Time Wolfgang Thomas Francqui - PowerPoint PPT Presentation

Winning Infinite Games in Finite Time Wolfgang Thomas Francqui Lecture, Mons, April 2013 Wolfgang Thomas R. McNaughton Wolfgang Thomas The Problem Given a Muller game with the collection F of winning loops for Player 2 play this like a

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

Time-optimal Winning Strategies in Infinite Games Martin Zimmermann RWTH Aachen University

Winning Cores in Parity Games Steen Vester DTU Compute October 22, 2015 S. Vester (DTU Compute)

Infinite graphs P eter Komj ath LC12 P eter Komj ath Infinite graphs Infinite

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Finite and Infinite Ramsey Theorem Silvia Steila Universit` a degli studi di Torino April 28,

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Time-optimal Strategies for Infinite Games Martin Zimmermann RWTH Aachen University March 10th,

Finite and infinite traces, inductively and coinductively Jurriaan Rot WAIT 2018 1/16 Overview

Infinite dimensional sub-Riemannian geometry Sylvain Arguill` ere (CIS, Johns Hopkins

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

Deep Learning over Big Code for Program Analysis and Synthesis Swarat Chaudhuri, Vijay

Improving Social Mobility Removing the Sticky Floors and Sticky Ceilings Karen Iles

Hayes, Catherine (2018) Keynote Address 'Beyond Disciplinarity in Strategic Pedagogical Research

A c c e s s i b i l i t y i s n o t f o r s c r e e n r e a d e r

Automated Planning and PDDL Andrea Marrella marrella@diag.uniroma1.it Reasoning Robots:

UPDATES ON MALAYSIAN VARIATION GUIDELINE FOR PHARMACEUTICAL PRODUCTS 2013 Chai Che Leong

K. V. Gayani Krishanthi 419021422 Professional Project Module GDM 406 xxxx Words 05 April 2015

Heavy baryon spectrum new heavy exotics and isospin breaking Marek Karliner with B. Keren-Zur,