Expectations or Guarantees? I Want It All! A Crossroad between Games - PowerPoint PPT Presentation

Expectations or Guarantees? I Want It All! A Crossroad between Games and MDPs V. Bruy` ere (UMONS) E. Filiot (ULB) M. Randour (UMONS-ULB) J.-F. Raskin (ULB) Grenoble - 05.04.2014 SR 2014 - 2nd International Workshop on Strategic Reasoning

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion The talk in two slides (1/2) Verification and synthesis: � a reactive system to control , � an interacting environment , � a specification to enforce . Focus on quantitative properties . Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 1 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion The talk in two slides (1/2) Verification and synthesis: � a reactive system to control , � an interacting environment , � a specification to enforce . Focus on quantitative properties . Several ways to look at the interactions, and in particular, the nature of the environment . Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 1 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion The talk in two slides (2/2) Games MDPs → antagonistic adversary → stochastic adversary → guarantees on worst-case → optimize expected value Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 2 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion The talk in two slides (2/2) Games MDPs → antagonistic adversary → stochastic adversary → guarantees on worst-case → optimize expected value ∧ BWC synthesis → ensure both Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 2 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion The talk in two slides (2/2) Games MDPs → antagonistic adversary → stochastic adversary → guarantees on worst-case → optimize expected value ∧ BWC synthesis → ensure both Studied Mean-Payoff Shortest Path value functions Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 2 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Advertisement Featured in STACS’14 [BFRR14] Full paper available on arXiv: abs/1309.5439 Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 3 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion 1 Context 2 BWC Synthesis 3 Mean-Payoff 4 Shortest Path 5 Conclusion Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 4 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Quantitative games on graphs Graph G = ( S , E , w ) with w : E → Z Two-player game G = ( G , S 1 , S 2 ) � P 1 states = 2 2 � P 2 states = 5 Plays have values � f : Plays( G ) → R ∪ {−∞ , ∞} − 1 7 Players follow strategies − 4 � λ i : Prefs i ( G ) → D ( S ) � Finite memory ⇒ stochastic output Moore machine M ( λ i ) = (Mem , m 0 , α u , α n ) Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 6 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Quantitative games on graphs Graph G = ( S , E , w ) with w : E → Z Two-player game G = ( G , S 1 , S 2 ) � P 1 states = 2 2 � P 2 states = 5 Plays have values � f : Plays( G ) → R ∪ {−∞ , ∞} − 1 7 Players follow strategies − 4 � λ i : Prefs i ( G ) → D ( S ) � Finite memory ⇒ stochastic output Moore Then, (2 , 5 , 2) ω machine M ( λ i ) = (Mem , m 0 , α u , α n ) Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 6 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Markov decision processes MDP P = ( G , S 1 , S ∆ , ∆) with ∆: S ∆ → D ( S ) 2 2 � P 1 states = 5 � stochastic states = MDP = game + strategy of P 2 − 1 7 � P = G [ λ 2 ] − 4 1 2 1 2 Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 7 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Markov chains MC M = ( G , δ ) with δ : S → D ( S ) MC = MDP + strategy of P 1 = game + both strategies 2 2 � M = P [ λ 1 ] = G [ λ 1 , λ 2 ] 1 5 4 3 4 − 1 7 − 4 1 2 1 2 Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 8 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Markov chains MC M = ( G , δ ) with δ : S → D ( S ) MC = MDP + strategy of P 1 = game + both strategies 2 2 � M = P [ λ 1 ] = G [ λ 1 , λ 2 ] 1 5 4 Event A ⊆ Plays( G ) 3 4 � probability P M − 1 s init ( A ) 7 − 4 1 Measurable f : Plays( G ) → R ∪ {−∞ , ∞} 2 1 � expected value E M s init ( f ) 2 Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 8 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Classical interpretations System trying to ensure a specification = P 1 � whatever the actions of its environment Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 9 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Classical interpretations System trying to ensure a specification = P 1 � whatever the actions of its environment The environment can be seen as � antagonistic two-player game, worst-case threshold problem for µ ∈ Q ∃ ? λ 1 ∈ Λ 1 , ∀ λ 2 ∈ Λ 2 , ∀ π ∈ Outs G ( s init , λ 1 , λ 2 ) , f ( π ) ≥ µ Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 9 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion Classical interpretations System trying to ensure a specification = P 1 � whatever the actions of its environment The environment can be seen as � antagonistic two-player game, worst-case threshold problem for µ ∈ Q ∃ ? λ 1 ∈ Λ 1 , ∀ λ 2 ∈ Λ 2 , ∀ π ∈ Outs G ( s init , λ 1 , λ 2 ) , f ( π ) ≥ µ � fully stochastic MDP, expected value threshold problem for ν ∈ Q ∃ ? λ 1 ∈ Λ 1 , E P [ λ 1 ] s init ( f ) ≥ ν Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 9 / 26

Context BWC Synthesis Mean-Payoff Shortest Path Conclusion What if you want both? In practice, we want both 1 nice expected performance in the everyday situation, 2 strict (but relaxed) performance guarantees even in the event of very bad circumstances. Beyond Worst-Case Synthesis Bruy` ere, Filiot, Randour, Raskin 11 / 26

Expectations or Guarantees? I Want It All! A Crossroad between Games - PowerPoint PPT Presentation

Expectations or Guarantees? I Want It All! A Crossroad between Games and MDPs V. Bruy` ere (UMONS) E. Filiot (ULB) M. Randour (UMONS-ULB) J.-F. Raskin (ULB) Grenoble - 05.04.2014 SR 2014 - 2nd International Workshop on Strategic Reasoning

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

CULTURAL CULTURAL TOURISM TOURISM B&H territory represents a crossroad of East and West. It

EXPECTATIONS OF US/EU BUYER EXPECTATIONS OF US/EU BUYER EXPECTATIONS OF US/EU BUYER EXPECTATIONS

IRMA Initiative for Risk Mitigation in Africa PARTIAL RISK GUARANTEES AND INSURANCE PRODUCTS

Incremental Consistency Guarantees For Replicated Objects Rachid Guerraoui, Matej Pavlovic,

Intersections & Turnabouts Intersections Come in a Variety of Designs + A crossroad - Two

Africa. Lebanon has been the crossroad of many civilizations; the traces of which can still be

I-229 Exit 5 (26 th Street) Crossroad Corridor Study Public Open House #3 Jan 15 th , 2014 5:30

NEW ZEALAND WINE REGIONS Clos Henri at the crossroad of a fascina0ng geological

Artificial Intelligence and Security Whats at the crossroad? Our first policy considerations

Workshop on AGRO-RESIDUES AT THE CROSSROAD TOWARDS 2030 Brussels, European Parliament, 17 May

at a Critical Crossroad Stephen Tapp Ari Van Assche Robert Wolfe Research Director, Associate

My colleges Jordyn keuchler-Carey I want to major in being a lawyer. I want to be a lawyer

Understanding Business Expectations: Understanding Business Expectations: Understanding Business

MCAS 2.0 2016-2017 PARCC Achievement Levels Level 5 Exceeded Expectations Level 4 Met

Meet Your Expectations With Guarantees: Beyond Worst-Case Synthesis in Quantitative Games V.

Data Stream Classification using Random Feature Functions and Novel Method Combinations Jesse

Learning Theory CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2016

Planning and Optimization December 16, 2019 G8. Monte-Carlo Tree Search Algorithms (Part II)

Assume we are reading a stream of n distinct integers in { 1 , . . . , n + 1 } .

SAMOA: A Platform for Mining Big Data Streams Nicolas Kourtellis Associate Researcher

Last week 1. We introduced the L p spaces: f is A -measurable L p = f :

Last week 1. We talked about some Hilbert space facts you knew from before. 2. We looked at the

Quasiregularly Elliptic Manifolds and Cohomology Eden Prywes University of California, Los

Sambuz

Useful Links

Newsletter

Mail Us

Expectations or Guarantees? I Want It All! A Crossroad between Games - PowerPoint PPT Presentation

Expectations or Guarantees? I Want It All! A Crossroad between Games and MDPs V. Bruy` ere (UMONS) E. Filiot (ULB) M. Randour (UMONS-ULB) J.-F. Raskin (ULB) Grenoble - 05.04.2014 SR 2014 - 2nd International Workshop on Strategic Reasoning

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

CULTURAL CULTURAL TOURISM TOURISM B&amp;H territory represents a crossroad of East and West. It

EXPECTATIONS OF US/EU BUYER EXPECTATIONS OF US/EU BUYER EXPECTATIONS OF US/EU BUYER EXPECTATIONS

IRMA Initiative for Risk Mitigation in Africa PARTIAL RISK GUARANTEES AND INSURANCE PRODUCTS

Incremental Consistency Guarantees For Replicated Objects Rachid Guerraoui, Matej Pavlovic,

Intersections &amp; Turnabouts Intersections Come in a Variety of Designs + A crossroad - Two

Africa. Lebanon has been the crossroad of many civilizations; the traces of which can still be

I-229 Exit 5 (26 th Street) Crossroad Corridor Study Public Open House #3 Jan 15 th , 2014 5:30

NEW ZEALAND WINE REGIONS Clos Henri at the crossroad of a fascina0ng geological

Artificial Intelligence and Security Whats at the crossroad? Our first policy considerations

Workshop on AGRO-RESIDUES AT THE CROSSROAD TOWARDS 2030 Brussels, European Parliament, 17 May

at a Critical Crossroad Stephen Tapp Ari Van Assche Robert Wolfe Research Director, Associate

My colleges Jordyn keuchler-Carey I want to major in being a lawyer. I want to be a lawyer

Understanding Business Expectations: Understanding Business Expectations: Understanding Business

MCAS 2.0 2016-2017 PARCC Achievement Levels Level 5 Exceeded Expectations Level 4 Met

Meet Your Expectations With Guarantees: Beyond Worst-Case Synthesis in Quantitative Games V.

Data Stream Classification using Random Feature Functions and Novel Method Combinations Jesse

Learning Theory CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2016

Planning and Optimization December 16, 2019 G8. Monte-Carlo Tree Search Algorithms (Part II)

Assume we are reading a stream of n distinct integers in { 1 , . . . , n + 1 } .

SAMOA: A Platform for Mining Big Data Streams Nicolas Kourtellis Associate Researcher

Last week 1. We introduced the L p spaces: f is A -measurable L p = f :

Last week 1. We talked about some Hilbert space facts you knew from before. 2. We looked at the

Quasiregularly Elliptic Manifolds and Cohomology Eden Prywes University of California, Los

Sambuz

Useful Links

Newsletter

Mail Us

CULTURAL CULTURAL TOURISM TOURISM B&H territory represents a crossroad of East and West. It

Intersections & Turnabouts Intersections Come in a Variety of Designs + A crossroad - Two