Heuristic Search Planning With Multi-Objective Probabilistic LTL - PowerPoint PPT Presentation

Heuristic Search Planning With Multi-Objective Probabilistic LTL Constraints Peter Baumgartner, Sylvie Thiébaux, Felipe Trevizan Data61/CSIRO and Research School of Computer Science, ANU Australia

Planning Under Uncertainty Goal Actions: move le fu , move right, enter, get Eve, exit 2

Planning Under Uncertainty Goal action ⟹ stochastic environment response Actions: move le fu , move right, enter, get Eve, exit Environment: door possibly jams, … 2

Planning Under Uncertainty 0.9 0.5 Goal action ⟹ stochastic environment response Actions: move le fu , move right, enter, get Eve, exit Environment: door possibly jams, … 2

Planning Under Uncertainty 0.9 0.5 Goal action ⟹ stochastic environment response Actions: move le fu , move right, enter, get Eve, exit Environment: door possibly jams, … Stochastic Shortest Path Problem (SSP) Problem : What action to take in what state to reach the goal with minimal costs? Solution : S tochastic policy : probability distribution on actions   “When at door 1 enter the room 3 out of 10 times,…”   2

Planning Under Uncertainty 0.9 0.5 Goal action ⟹ stochastic environment response Actions: move le fu , move right, enter, get Eve, exit Add constraints for better expressivity (C-SSP) Environment: door possibly jams, … - well-known: “fuel < 5” - here: PLTL Stochastic Shortest Path Problem (SSP) Problem : What action to take in what state to reach the goal with minimal costs? Solution : S tochastic policy : probability distribution on actions   “When at door 1 enter the room 3 out of 10 times,…”   2

Multi-Objective Probabilistic LTL (MO-PLTL) ψ := ⊤ | A | ψ ∧ ψ | ψ ∨ ψ | ¬ ψ 0 0 | X ψ | ψ U ψ | F ψ | G ψ (LTL) 0 0 ϕ := P >z ψ | P ≥ z ψ (PLTL) Eve stays in a room until Eve and Wall-E are together eve_in_a_room U together ( ψ 1 ) Once together, eventually together forever G (together ⇒ F G together) ( ψ 2 ) Wall-E never visits room1 twice G (wall-E_room1 ⇒ (wall-E_room1 U G ¬wall-E_room1) ( ψ 3 ) Additional Multi-Objective PLTL Constraint ϕ = P ≥ 0.8 ψ 1 ∧ P ≥ 1.0 ψ 2 ∧ P ≥ 0.5 ψ 3 (MO-PLTL) Task : compute a cost-minimal stochastic policy for reaching the goal (with probability 1)   such that ϕ is satisfied 3

Multi-Objective Probabilistic LTL (MO-PLTL) ψ := ⊤ | A | ψ ∧ ψ | ψ ∨ ψ | ¬ ψ 0 0 | X ψ | ψ U ψ | F ψ | G ψ (LTL) 0 0 ϕ := P >z ψ | P ≥ z ψ (PLTL) Eve stays in a room until Eve and Wall-E are together eve_in_a_room U together ( ψ 1 ) Once together, eventually together forever G (together ⇒ F G together) ( ψ 2 ) Wall-E never visits room1 twice G (wall-E_room1 ⇒ (wall-E_room1 U G ¬wall-E_room1) ( ψ 3 ) Not as used in “optimisation” Additional Multi-Objective PLTL Constraint ϕ = P ≥ 0.8 ψ 1 ∧ P ≥ 1.0 ψ 2 ∧ P ≥ 0.5 ψ 3 (MO-PLTL) Task : compute a cost-minimal stochastic policy for reaching the goal (with probability 1)   such that ϕ is satisfied 3

  Solving MO-PLTL Methods Based on Probabilistic Verification • State of the art method, implemented in PRISM probabilistic model checker • Needs infinite runs   (1) add self-loop at Goal   (2) add Goal constraint : ϕ = P 1 ψ 1 ∧ ⋯ ∧ P k ψ k ∧ P ≥ 1 F Goal • Compute cross-product automaton   A = DRA( ψ 1 ) × ⋯ × DRA( ψ k ) × DRA( F Goal) × S (S is given state transition system, MDP). • Obtain policy for ϕ as a solution of a certain linear program obtained from A Complexity • |DRA( ψ )| is double exponential in | ψ | • | S | is usually huge for planning problems - cannot a ff ord to generate in full • Upfront DRA-computation/crossproduct is problematic even for small examples • The verification/synthesis problem is 2EXPTIME complete • Complicated algorithms (see also [deGiacomo&Vardi IJCAI2013, IJCAI2015]) 4

  Solving MO-PLTL Methods Based on Probabilistic Verification • State of the art method, implemented in PRISM probabilistic model checker • Needs infinite runs   DRA NBA ψ (1) add self-loop at Goal   (2) add Goal constraint : ϕ = P 1 ψ 1 ∧ ⋯ ∧ P k ψ k ∧ P ≥ 1 F Goal • Compute cross-product automaton   A = DRA( ψ 1 ) × ⋯ × DRA( ψ k ) × DRA( F Goal) × S (S is given state transition system, MDP). • Obtain policy for ϕ as a solution of a certain linear program obtained from A Complexity • |DRA( ψ )| is double exponential in | ψ | • | S | is usually huge for planning problems - cannot a ff ord to generate in full • Upfront DRA-computation/crossproduct is problematic even for small examples • The verification/synthesis problem is 2EXPTIME complete • Complicated algorithms (see also [deGiacomo&Vardi IJCAI2013, IJCAI2015]) 4

  Solving MO-PLTL Methods Based on Probabilistic Verification • State of the art method, implemented in PRISM probabilistic model checker • Needs infinite runs   DRA NBA ψ (1) add self-loop at Goal   (2) add Goal constraint : ϕ = P 1 ψ 1 ∧ ⋯ ∧ P k ψ k ∧ P ≥ 1 F Goal • Compute cross-product automaton   A = DRA( ψ 1 ) × ⋯ × DRA( ψ k ) × DRA( F Goal) × S (S is given state transition system, MDP). • Obtain policy for ϕ as a solution of a certain linear program obtained from A Complexity • |DRA( ψ )| is double exponential in | ψ | • | S | is usually huge for planning problems - cannot a ff ord to generate in full • Upfront DRA-computation/crossproduct is problematic even for small examples • The verification/synthesis problem is 2EXPTIME complete • Complicated algorithms (see also [deGiacomo&Vardi IJCAI2013, IJCAI2015]) We have a specific problem - all BSCCs are self-loops at goals - and can do better 4

Contributions Verification Based Our Method General Yes No (Requires Goal) (1) Formula progression, Tseitin Approach Automata (DRA) (2) NBA State Space Upfront On-the-fly Complexity Double exponential in ϕ Single exponential in ϕ for (1) Heuristics No Yes (i 2 Dual) Baier&McIlraith ICAPS 2006: non-stochastic planning w/ LTL, heuristics, NFA, by compilation 5

Contributions Verification Based Our Method General Yes No (Requires Goal) (1) Formula progression, Tseitin Approach Automata (DRA) (2) NBA State Space Upfront On-the-fly Complexity Double exponential in ϕ Single exponential in ϕ for (1) Heuristics No Yes (i 2 Dual) Baier&McIlraith ICAPS 2006: non-stochastic planning w/ LTL, heuristics, NFA, by compilation Rest of this talk : approach, complexity, heuristics, experiments 5

How to Check a Policy π for Satisfying a PLTL Formula s 0 Given policy π = β α s 0 : [ α → 0.6, β → 0.4 ] 0.6 0.7 0.3 0.4 It follows s 0 ⊨ P >0.6 F A s a s b s c s d Proof {A} {A} The probability of all paths from s 0 satisfying F A is > 0.6 s 0 ⊨ P >0.6 F A 6

How to Check a Policy π for Satisfying a PLTL Formula s 0 Given policy π = β α s 0 : [ α → 0.6, β → 0.4 ] 0.6 0.7 0.3 0.4 It follows s 0 ⊨ P >0.6 F A s a s b s c s d Proof {A} {A} The probability of all paths from s 0 satisfying F A is > 0.6 s 0 ⊨ P >0.6 F A i ff Pr{p | p is a path from s 0 and p ⊨ F A} > 0.6 6

How to Check a Policy π for Satisfying a PLTL Formula s 0 Given policy π = β α s 0 : [ α → 0.6, β → 0.4 ] 0.6 0.7 0.3 0.4 It follows s 0 ⊨ P >0.6 F A s a s b s c s d Proof {A} {A} The probability of all paths from s 0 satisfying F A is > 0.6 s 0 ⊨ P >0.6 F A i ff Pr{p | p is a path from s 0 and p ⊨ F A} > 0.6 Non-probabilistic LTL Ignore finiteness of paths on this slide 6

How to Check a Policy π for Satisfying a PLTL Formula s 0 Given policy π = β α s 0 : [ α → 0.6, β → 0.4 ] 0.6 0.7 0.3 0.4 It follows s 0 ⊨ P >0.6 F A s a s b s c s d Proof {A} {A} The probability of all paths from s 0 satisfying F A is > 0.6 s 0 ⊨ P >0.6 F A i ff Pr{p | p is a path from s 0 and p ⊨ F A} > 0.6 i ff Non-probabilistic LTL Pr{ s 0 s a , s 0 s c } > 0.6 Ignore finiteness of paths on this slide 6

How to Check a Policy π for Satisfying a PLTL Formula s 0 Given policy π = β α s 0 : [ α → 0.6, β → 0.4 ] 0.6 0.7 0.3 0.4 It follows s 0 ⊨ P >0.6 F A s a s b s c s d Proof {A} {A} The probability of all paths from s 0 satisfying F A is > 0.6 s 0 ⊨ P >0.6 F A i ff Pr{p | p is a path from s 0 and p ⊨ F A} > 0.6 i ff Non-probabilistic LTL Pr{ s 0 s a , s 0 s c } > 0.6 Ignore finiteness of paths on this slide i ff 0.6 ⋅ 0.6 + 0.4 ⋅ 0.7 = 0.64 > 0.6 6

How to Check a Policy π for Satisfying a PLTL Formula Synthesize s 0 Given policy π = β α s 0 : [ α → 0.6, β → 0.4 ] 0.6 0.7 0.3 0.4 It follows s 0 ⊨ P >0.6 F A s a s b s c s d Proof {A} {A} The probability of all paths from s 0 satisfying F A is > 0.6 s 0 ⊨ P >0.6 F A i ff Pr{p | p is a path from s 0 and p ⊨ F A} > 0.6 i ff Non-probabilistic LTL Pr{ s 0 s a , s 0 s c } > 0.6 Ignore finiteness of paths on this slide i ff 0.6 ⋅ 0.6 + 0.4 ⋅ 0.7 = 0.64 > 0.6 6

Heuristic Search Planning With Multi-Objective Probabilistic LTL - PowerPoint PPT Presentation

Heuristic Search Planning With Multi-Objective Probabilistic LTL Constraints Peter Baumgartner, Sylvie Thibaux, Felipe Trevizan Data61/CSIRO and Research School of Computer Science, ANU Australia Planning Under Uncertainty Goal Actions: move

Heuristic Search Lucia Moura Winter 2018 Heuristic Search Lucia Moura Heuristic Search Intro

Heuristic Search Heuristic Search Best-First A * Heuristic Functions Some material

Heuristic Search CPSC 322 Lecture 6 September 17, 2007 Textbook 3.5 Heuristic Search CPSC

Heuristic search Weighted A Kustaa Kangas October 17, 2013 K. Kangas () Heuristic search

Search Overview Introduction to Search Blind Search Techniques Heuristic Search

Heuristic Search: A* and beyond Heuristic Search: A* and beyond Course: CS40002 Course: CS40002

Heuristic (Informed) search strategy Search Algorithm #2 Search SEARCH#2 1.

TDDC17 Separation property of graph search Systematic Search through the state space Seminar 3

An Empirical Study Meta and Hyper Heuristic Search for Multi-Objective Release Planning Yuanyuan

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Heuristic Search for Planning Sheila McIlraith University of Toronto Fall 2010 S. McIlraith

Heuristic Search for Planning Sheila McIlraith University of Toronto Fall 2010 S. McIlraith

Today CSE 473: Artificial Intelligence Autumn 2018 A* Search Heuristic Search and A*

Learning Goals 1 5 A* Search with the Misplaced Tile Heuristic 5 3 A* Search with the

ECE 4524 Artificial Intelligence and Engineering Applications Lecture 4: Heuristic Search

Today CSE 473: Artificial Intelligence Spring 2018 A* Search Heuristic Search and A*

Limit-Deterministic Bchi Automata for Probabilistic Model Checking Jan Ketnsk Javier

Complexity of domain-independent planning Jos Luis Ambite 1 Decidability Decision problem: a

Rendering: Monte Carlo Integration I Bernhard Kerbl Research Division of Computer Graphics

Doubly Truncated Generalized Entropy Mohammadreza Nourbakhsh, Gholamhossein Yari School of

The doubly-exponential problem in equation/inequality solving James Davenport 1 University of

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Qingyuan Zhao

Specifications & motivation UPPER LIMIT lower much better ! Nr of bits to code a hit :

New Bayesian features: Predictions, multiple chains, and more Yulia Marchenko StataCorp LLC

Heuristic Search Planning With Multi-Objective Probabilistic LTL - PowerPoint PPT Presentation

Heuristic Search Planning With Multi-Objective Probabilistic LTL Constraints Peter Baumgartner, Sylvie Thibaux, Felipe Trevizan Data61/CSIRO and Research School of Computer Science, ANU Australia Planning Under Uncertainty Goal Actions: move

Heuristic Search Lucia Moura Winter 2018 Heuristic Search Lucia Moura Heuristic Search Intro

Heuristic Search Heuristic Search Best-First A * Heuristic Functions Some material

Heuristic Search CPSC 322 Lecture 6 September 17, 2007 Textbook 3.5 Heuristic Search CPSC

Heuristic search Weighted A Kustaa Kangas October 17, 2013 K. Kangas () Heuristic search

Search Overview Introduction to Search Blind Search Techniques Heuristic Search

Heuristic Search: A* and beyond Heuristic Search: A* and beyond Course: CS40002 Course: CS40002

Heuristic (Informed) search strategy Search Algorithm #2 Search SEARCH#2 1.

TDDC17 Separation property of graph search Systematic Search through the state space Seminar 3

An Empirical Study Meta and Hyper Heuristic Search for Multi-Objective Release Planning Yuanyuan

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Heuristic Search for Planning Sheila McIlraith University of Toronto Fall 2010 S. McIlraith

Heuristic Search for Planning Sheila McIlraith University of Toronto Fall 2010 S. McIlraith

Today CSE 473: Artificial Intelligence Autumn 2018 A* Search Heuristic Search and A*

Learning Goals 1 5 A* Search with the Misplaced Tile Heuristic 5 3 A* Search with the

ECE 4524 Artificial Intelligence and Engineering Applications Lecture 4: Heuristic Search

Today CSE 473: Artificial Intelligence Spring 2018 A* Search Heuristic Search and A*

Limit-Deterministic Bchi Automata for Probabilistic Model Checking Jan Ketnsk Javier

Complexity of domain-independent planning Jos Luis Ambite 1 Decidability Decision problem: a

Rendering: Monte Carlo Integration I Bernhard Kerbl Research Division of Computer Graphics

Doubly Truncated Generalized Entropy Mohammadreza Nourbakhsh, Gholamhossein Yari School of

The doubly-exponential problem in equation/inequality solving James Davenport 1 University of

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Qingyuan Zhao

Specifications &amp; motivation UPPER LIMIT lower much better ! Nr of bits to code a hit :

New Bayesian features: Predictions, multiple chains, and more Yulia Marchenko StataCorp LLC

Specifications & motivation UPPER LIMIT lower much better ! Nr of bits to code a hit :