Stochastic Games Reachability objectives The value (in Formal - PowerPoint PPT Presentation

Stochastic games Antonín Kuˇ cera Preliminaries Games Strategies, plays Objectives Stochastic Games Reachability objectives The value (in Formal Verification) Min strategies Max strategies Determinacy Finite-state games BPA games Branching-time objectives Basic properties Antonín Kuˇ cera Deciding the winner Games with time Masaryk University Brno SFM-10:QAPL 2010 1/56

Stochastic Game theory games Antonín Kuˇ cera Preliminaries Games Game theory studies the behavior of rational “players” who can Strategies, plays Objectives make choice and attempt to achieve a certain objective. A Reachability player’s success depends on the choices of the other players. objectives The value Min strategies stochastic games: Max strategies Determinacy Finite-state games the impact of players’ choices in uncertain; BPA games Branching-time the players’ choice can be randomized. objectives Basic properties Deciding the winner games in computer science: Games with time formal semantics; communication protocols; Internet auctions; . . . many other things. SFM-10:QAPL 2010 2/56

Stochastic Stochastic games in formal verification games Antonín Kuˇ cera Preliminaries Games Strategies, plays Objectives Reachability Our setting: objectives The value Min strategies state space: discrete Max strategies Determinacy players: controller, environment Finite-state games BPA games objectives: antagonistic Branching-time objectives Basic properties choice: turn-based, randomized Deciding the winner Games with information: perfect time Is there a strategy for the controller such that the system satisfies a certain property no matter what the environment does? SFM-10:QAPL 2010 3/56

Stochastic Outline games Antonín Kuˇ cera Preliminaries Games Strategies, plays Objectives Reachability objectives Preliminaries. The value Min strategies Games, strategies, objectives. Max strategies Determinacy Finite-state games Stochastic games with reachability objectives. BPA games Branching-time objectives The (non)existence of optimal strategies. Basic properties Deciding the winner Algorithms for finite-state games. Games with time Stochastic games with branching-time objectives. Stochastic games with time. SFM-10:QAPL 2010 4/56

Stochastic Markov chains games Antonín Kuˇ cera Preliminaries Definition 1 (Markov chain) Games Strategies, plays Objectives 1 Reachability 4 M = ( S , → , Prob ) 1 1 objectives s t 2 3 The value 1 S is at most countable set of states; Min strategies 3 Max strategies 1 1 Determinacy 4 3 → ⊆ S × S is a transition relation; Finite-state games BPA games u Prob is a probability assignment. Branching-time objectives 1 Basic properties Deciding the winner Games with time SFM-10:QAPL 2010 5/56

Stochastic Markov chains games Antonín Kuˇ cera Preliminaries Definition 1 (Markov chain) Games Strategies, plays Objectives 1 Reachability 4 M = ( S , → , Prob ) 1 1 objectives s t 2 3 The value 1 S is at most countable set of states; Min strategies 3 Max strategies 1 1 Determinacy 4 3 → ⊆ S × S is a transition relation; Finite-state games BPA games u Prob is a probability assignment. Branching-time objectives 1 Basic properties Deciding the winner Games with time We want to measure the probability of certain subsets of Run ( s ) . For every finite path w initiated in s , we define the probability of Run ( w ) in the natural way. This assignment can be uniquely extended to the (Borel) σ -algebra F generated by all Run ( w ) . Thus, we obtain the probability space ( Run ( s ) , F , P ) . SFM-10:QAPL 2010 5/56

Stochastic Turn-based stochastic games games Antonín Kuˇ cera Preliminaries Games Strategies, plays Objectives Reachability objectives Definition 2 (Turn-based stochastic game) The value Min strategies Max strategies G = ( V , E , ( V � , V � , V � ) , Prob ) Determinacy Finite-state games the set V is at most countable; BPA games 0 . 2 Branching-time each vertex has a successor; objectives 0 . 8 Basic properties Deciding the winner Prob is positive; Games with time G is a Markov decision process (MDP) if V � = ∅ or V � = ∅ . 0 . 4 0 . 6 SFM-10:QAPL 2010 6/56

Stochastic Strategies games Antonín Kuˇ cera Preliminaries Games Strategies, plays Definition 3 (Strategy) Objectives Let G = ( V , E , ( V � , V � , V � ) , Prob ) be a game. A strategy for Reachability objectives player � is a function σ which to every wv ∈ V ∗ V � assigns a The value Min strategies probability distribution over the set of outgoing edges of v. Max strategies Determinacy Finite-state games BPA games A strategy for player � is defined analogously. Branching-time objectives Basic properties We can classify strategies according to Deciding the winner Games with memory requirements: history-dependent (H), time finite-memory (F), memoryless (M) randomization: randomized (R), deterministic (D) Thus, we obtain the classes of MD, MR, FD, FR, HD, and HR strategies. SFM-10:QAPL 2010 7/56

Stochastic Plays games Antonín Kuˇ cera Preliminaries Games Strategies, plays Objectives Reachability objectives Definition 4 (Play) The value Min strategies Let G = ( V , E , ( V � , V � , V � ) , Prob ) be a game. Each pair ( σ, π ) of Max strategies Determinacy strategies for player � and player � determines a unique play Finite-state games G ( σ,π ) , which is a Markov chain where V + is the set of states and BPA games Branching-time transitions are defined accordingly. objectives Basic properties Deciding the winner Games with Plays are infinite trees. time For a pair of memoryless strategies ( σ, π ) , the play G ( σ,π ) can be depicted as a Markov chain with the set of states V . SFM-10:QAPL 2010 8/56

Stochastic Plays (2) games Antonín Kuˇ cera Preliminaries Example 5 (A game and its play) Games Strategies, plays Objectives v u 1 Reachability objectives The value Min strategies Max strategies Determinacy Finite-state games BPA games Branching-time objectives Basic properties Deciding the winner Games with time SFM-10:QAPL 2010 9/56

Stochastic Plays (2) games Antonín Kuˇ cera Preliminaries Example 5 (A game and its play) Games Strategies, plays Objectives v u 1 Reachability objectives The value Min strategies Is there a strategy σ such that v | = G > 0 ( v ) in G σ ? Max strategies Determinacy Finite-state games BPA games Branching-time objectives Basic properties Deciding the winner Games with time SFM-10:QAPL 2010 9/56

Stochastic Plays (2) games Antonín Kuˇ cera Preliminaries Example 5 (A game and its play) Games Strategies, plays Objectives v u 1 Reachability objectives The value Min strategies Is there a strategy σ such that v | = G > 0 ( v ) in G σ ? Max strategies Determinacy Finite-state games BPA games Is there a strategy σ such that v | = G > 0 ( v ∧ F > 0 u ) in G σ ? Branching-time objectives Basic properties Deciding the winner Games with time SFM-10:QAPL 2010 9/56

Stochastic Plays (2) games Antonín Kuˇ cera Preliminaries Example 5 (A game and its play) Games Strategies, plays Objectives v u 1 Reachability objectives The value Min strategies Is there a strategy σ such that v | = G > 0 ( v ) in G σ ? Max strategies Determinacy Finite-state games BPA games Is there a strategy σ such that v | = G > 0 ( v ∧ F > 0 u ) in G σ ? Branching-time objectives Obviously, there is no such MR (or even FR) strategy. Basic properties Deciding the winner Games with time SFM-10:QAPL 2010 9/56

Stochastic Plays (2) games Antonín Kuˇ cera Preliminaries Example 5 (A game and its play) Games Strategies, plays Objectives v u 1 Reachability objectives The value Min strategies Is there a strategy σ such that v | = G > 0 ( v ) in G σ ? Max strategies Determinacy Finite-state games BPA games Is there a strategy σ such that v | = G > 0 ( v ∧ F > 0 u ) in G σ ? Branching-time objectives Obviously, there is no such MR (or even FR) strategy. Basic properties Deciding the winner 1 / 2 | wv | 1 − 1 / 2 | wv | Games with Let σ ( wv ) = v − − − − → u , v − − − − − − → v time 1 / 2 3 / 4 7 / 8 15 / 16 v vv vvv vvvv 1 / 2 1 / 4 1 / 8 1 / 16 vu vvu vvvu vvvvu 1 1 1 1 SFM-10:QAPL 2010 9/56

Stochastic A taxonomy of objectives games Antonín Kuˇ cera Preliminaries Each play of a game G is assigned a (numerical) yield. The Games Strategies, plays goal of player � / � is to maximize/minimize the yield. Objectives Reachability objectives Win-lose objectives assign either 1 or 0 to each play. The value Min strategies P � ̺ ϕ , where ϕ is an LTL formula. Max strategies Determinacy Finite-state games PCTL or PCTL* objectives. BPA games Branching-time objectives Objectives specified by Borel measurable payoffs. Basic properties Deciding the winner yield ( G σ,π ) = E ( f σ,π ) , where f : Run ( G ) → R is measurable. Games with time Qualitative payoffs assign either 1 or 0 to each run Büchi, parity, Rabin, Street, Muller, etc. Quantitative payoffs � n i = 0 rew ( w ( i )) Mean payoff: MP ( w ) = lim n →∞ n Discounted payoff: DP ( w ) = � ∞ i = 0 λ i · rew ( w ( i )) SFM-10:QAPL 2010 10/56

Stochastic Games Reachability objectives The value (in Formal - PowerPoint PPT Presentation

Stochastic games Antonn Ku cera Preliminaries Games Strategies, plays Objectives Stochastic Games Reachability objectives The value (in Formal Verification) Min strategies Max strategies Determinacy Finite-state games BPA games

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

Empirical-evidence Equilibria in Stochastic Games Nicolas Dudebout Outline 2 Stochastic

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Multigrid methods for two player zero-sum stochastic games Sylvie Detournay INRIA Saclay and

Strategy recovery for stochastic mean payoff games Marcello Mamino TU Dresden GRASTA 15,

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions

Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6th, 2006 CS286r

Dynamic Games in Environmental Economics PhD minicourse Part II: Stochastic Games and Contracts

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Cheap Talk Games with three types Felix Munoz-Garcia Strategy and Game Theory - Washington State

Personalised Care James Sanderson May 2019 www.england.nhs.uk Comprehensive Model for

STAT 213 Multiple Comparisons and the Family-Wise Error Rate Colin Reimer Dawson Oberlin

Extending the Palladio Component Model using Profiles and Stereotypes Palladio Days 2012,

Dialogue categories and Frobenius monoids Paul-Andr Mellis CNRS & Universit Paris

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of

Chapter 4, Requirements Elicitation, examples Example: Selection of Software Lifecycle Activities

Verification Games Verifying Fire Hydrants 10 mins http://pygy.co/bQn

Sambuz

Useful Links

Newsletter

Mail Us

Stochastic Games Reachability objectives The value (in Formal - PowerPoint PPT Presentation

Stochastic games Antonn Ku cera Preliminaries Games Strategies, plays Objectives Stochastic Games Reachability objectives The value (in Formal Verification) Min strategies Max strategies Determinacy Finite-state games BPA games

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

Empirical-evidence Equilibria in Stochastic Games Nicolas Dudebout Outline 2 Stochastic

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Multigrid methods for two player zero-sum stochastic games Sylvie Detournay INRIA Saclay and

Strategy recovery for stochastic mean payoff games Marcello Mamino TU Dresden GRASTA 15,

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions

Nash Q-Learning for General-Sum Stochastic Games Hu &amp; Wellman March 6th, 2006 CS286r

Dynamic Games in Environmental Economics PhD minicourse Part II: Stochastic Games and Contracts

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Cheap Talk Games with three types Felix Munoz-Garcia Strategy and Game Theory - Washington State

Personalised Care James Sanderson May 2019 www.england.nhs.uk Comprehensive Model for

STAT 213 Multiple Comparisons and the Family-Wise Error Rate Colin Reimer Dawson Oberlin

Extending the Palladio Component Model using Profiles and Stereotypes Palladio Days 2012,

Dialogue categories and Frobenius monoids Paul-Andr Mellis CNRS &amp; Universit Paris

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI &amp; University of

Chapter 4, Requirements Elicitation, examples Example: Selection of Software Lifecycle Activities

Verification Games Verifying Fire Hydrants 10 mins http://pygy.co/bQn

Sambuz

Useful Links

Newsletter

Mail Us

Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6th, 2006 CS286r

Dialogue categories and Frobenius monoids Paul-Andr Mellis CNRS & Universit Paris

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of