percentile queries in multi dimensional markov decision
play

Percentile Queries in Multi-Dimensional Markov Decision Processes - PowerPoint PPT Presentation

Percentile Queries in Multi-Dimensional Markov Decision Processes Mickael Randour 1 cois Raskin 2 Ocan Sankur 2 Jean-Fran 1 LSV - CNRS & ENS Cachan, France 2 ULB, Belgium September 16, 2015 - Highlights 2015, Prague 3rd Highlights of


  1. Percentile Queries in Multi-Dimensional Markov Decision Processes Mickael Randour 1 cois Raskin 2 Ocan Sankur 2 Jean-Fran¸ 1 LSV - CNRS & ENS Cachan, France 2 ULB, Belgium September 16, 2015 - Highlights 2015, Prague 3rd Highlights of Logic, Games and Automata

  2. Illustration: stochastic shortest path Multi-constraint percentile queries In a nutshell Strategy synthesis for Markov Decision Processes (MDPs) Finding good controllers for systems interacting with a stochastic environment. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 1 / 8

  3. Illustration: stochastic shortest path Multi-constraint percentile queries In a nutshell Strategy synthesis for Markov Decision Processes (MDPs) Finding good controllers for systems interacting with a stochastic environment. Good? Performance evaluated through payoff functions . Usual problem is to optimize the expected performance or the probability of achieving a given performance level . Not sufficient for many practical applications. � Reason about trade-offs and interplays . � Several extensions, more expressive but also more complex. . . Multi-Constraint Percentile Queries Randour, Raskin, Sankur 1 / 8

  4. Illustration: stochastic shortest path Multi-constraint percentile queries In a nutshell Strategy synthesis for Markov Decision Processes (MDPs) Finding good controllers for systems interacting with a stochastic environment. Good? Performance evaluated through payoff functions . Usual problem is to optimize the expected performance or the probability of achieving a given performance level . Not sufficient for many practical applications. � Reason about trade-offs and interplays . � Several extensions, more expressive but also more complex. . . Aim of this talk Multi-constraint percentile queries : generalizes the problem to multiple dimensions, multiple constraints. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 1 / 8

  5. Illustration: stochastic shortest path Multi-constraint percentile queries Advertisement Full paper available on arXiv [RRS14]: abs/1410.4801 Featured in CAV’15 [RRS15a] Multi-Constraint Percentile Queries Randour, Raskin, Sankur 2 / 8

  6. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck Two-dimensional weights on actions: time and cost . Payoff: sum of weights up to work. Often necessary to consider trade-offs : e.g., between the probability to reach work in due time and the risks of an expensive journey. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  7. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck Classical problem considers only a single percentile constraint . Single-constraint percentile problem Given MDP M , initial state s init , one-dimension payoff function f , value threshold v ∈ Q , and probability threshold α ∈ [0 , 1] ∩ Q , � � decide if there exists a strategy σ such that P σ f ≥ v ≥ α . M , s init Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  8. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck Classical problem considers only a single percentile constraint . C1 : 80% of runs reach work in at most 40 minutes. � Taxi � ≤ 10 minutes with probability 0 . 99 > 0 . 8. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  9. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck Classical problem considers only a single percentile constraint . C1 : 80% of runs reach work in at most 40 minutes. � Taxi � ≤ 10 minutes with probability 0 . 99 > 0 . 8. C2 : 50% of them cost at most 10$ to reach work. � Bus � ≥ 70% of the runs reach work for 3$. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  10. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck Classical problem considers only a single percentile constraint . C1 : 80% of runs reach work in at most 40 minutes. � Taxi � ≤ 10 minutes with probability 0 . 99 > 0 . 8. C2 : 50% of them cost at most 10$ to reach work. � Bus � ≥ 70% of the runs reach work for 3$. Taxi �| = C2, bus �| = C1. What if we want C1 ∧ C2? Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  11. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck C1 : 80% of runs reach work in at most 40 minutes. C2 : 50% of them cost at most 10$ to reach work. Study of multi-constraint percentile queries . � Sample strategy: bus once, then taxi. Requires memory . � Another strategy: bus with probability 3 / 5, taxi with probability 2 / 5. Requires randomness . Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  12. Illustration: stochastic shortest path Multi-constraint percentile queries Illustration: stochastic shortest path problem home 0 . 3 bus, 30, 3 taxi, 10, 20 0 . 7 0 . 01 0 . 99 car work wreck C1 : 80% of runs reach work in at most 40 minutes. C2 : 50% of them cost at most 10$ to reach work. Study of multi-constraint percentile queries . In general, both memory and randomness are required. � = classical problems (single constraint, expected value, etc) Multi-Constraint Percentile Queries Randour, Raskin, Sankur 3 / 8

  13. Illustration: stochastic shortest path Multi-constraint percentile queries Multi-constraint percentile problem Multi-constraint percentile problem Given d -dimensional MDP M , initial state s init , payoff function f , and q ∈ N percentile constraints described by dimensions l i ∈ { 1 , . . . , d } , value thresholds v i ∈ Q and probability thresholds α i ∈ [0 , 1] ∩ Q , where i ∈ { 1 , . . . , q } , decide if there exists a strategy σ such that query Q holds, with q � Q := � � P σ f l i ≥ v i ≥ α i . M , s init i =1 Very general framework allowing for: multiple constraints related to � = or = dimensions, � = value and probability thresholds. � For SP, even � = targets for each constraint. � Great flexibility in modeling applications. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 4 / 8

  14. Illustration: stochastic shortest path Multi-constraint percentile queries Results overview (1/2) Wide range of payoff functions � multiple reachability, � inf, sup, lim inf, lim sup, � mean-payoff (MP, MP), � shortest path (SP), � discounted sum (DS). Multi-Constraint Percentile Queries Randour, Raskin, Sankur 5 / 8

  15. Illustration: stochastic shortest path Multi-constraint percentile queries Results overview (1/2) Wide range of payoff functions � multiple reachability, � inf, sup, lim inf, lim sup, � mean-payoff (MP, MP), � shortest path (SP), � discounted sum (DS). Several variants : � multi-dim. multi-constraint, � single-dim. multi-constraint, � single-constraint. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 5 / 8

  16. Illustration: stochastic shortest path Multi-constraint percentile queries Results overview (1/2) Wide range of payoff functions � multiple reachability, � inf, sup, lim inf, lim sup, � mean-payoff (MP, MP), � shortest path (SP), � discounted sum (DS). Several variants : � multi-dim. multi-constraint, � single-dim. multi-constraint, � single-constraint. For each one: � algorithms, � lower bounds, � memory requirements. � Complete picture for this new framework. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 5 / 8

  17. Illustration: stochastic shortest path Multi-constraint percentile queries Results overview (2/2) Single-dim. Multi-dim. Single-constraint Multi-constraint Multi-constraint Reachability P [Put94] P( M ) · E( Q ) [EKVY08], PSPACE-h — P( M ) · E( Q ) f ∈ F P [CH09] P PSPACE-h. MP P [Put94] P P MP P [Put94] P( M ) · E( Q ) P( M ) · E( Q ) P( M ) · P ps ( Q ) [HK15] P( M ) · P ps ( Q ) (one target) P( M ) · E( Q ) SP PSPACE-h. [HK15] PSPACE-h. [HK15] PSPACE-h. [HK15] P ps ( M , Q , ε ) P ps ( M , ε ) · E( Q ) P ps ( M , ε ) · E( Q ) ε -gap DS NP-h. NP-h. PSPACE-h. � F = { inf , sup , lim inf , lim sup } � M = model size, Q = query size � P( x ), E( x ) and P ps ( x ) resp. denote polynomial, exponential and pseudo-polynomial time in parameter x . All results without reference are new. Multi-Constraint Percentile Queries Randour, Raskin, Sankur 6 / 8

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend