Applying Search Based Probabilistic Inference Algorithms to - PowerPoint PPT Presentation

Applying Search Based Probabilistic Inference Algorithms to Probabilistic Conformant Planning: Preliminary Results Junkyu Lee * , Radu Marinescu ** and Rina Dechter * * University of California, Irvine ** IBM Research, Ireland ISAIM 2016

Overview  Probabilistic Conformant Planning  Agent, Example, Problem, and Task  Graphical Model and Probabilistic Inference  Probabilistic Conformant Planning as Marginal MAP Inference  AND/OR Search Algorithms for Marginal MAP Inference  Compiling Graphical Models from Planning Problems  Example Domain: Blocks World  Compiling Probabilistic PDDL into 2 stage DBN  Compiling Finite Domain Representation (SAS+) into 2 stage DBN  Experiment Results (Blocks World Domain) 2

Probabilistic Conformant Planning - Agent Stuart Russell and Peter Norvig. Artificial Intelligence: A Modern Approach (3rd Ed.)  No observation  Uncertain environment  Uncertain initial states: Probability distribution over possible states  Uncertain action effects: Probability distribution over possible effects  Find a sequence of actions that reach goal with desired criteria  given plan length, maximize the probability of reaching goal, etc 3

Probabilistic Conformant Planning - Example  Spacecraft Recovery*  Complex systems could fail  Observation is sometimes limited  Diagnosis yields plausible states with scores (probability)  Generate a fail-safe recovery plan that can be applied to all plausible states. *Fragment-based Conformant Planning, J. Kurien, P. Nayak, and D. Smith AIPS 2002 4

Probabilistic Conformant Planning - Problem and Task  Probabilistic Conformant Planning Problem P = <S, A, I, G, T> • S : a set of possible states • A : a set of actions • I : initial belief state (probability distribution over initial states) • G: a set of goal states T: Markovian state transition function (T: S x A x S  [0, 1]) •  Probabilistic Conformant Planning Task <P, L>: Maximize probability of reaching goal given fixed plan length L <P, θ >: A plan of arbitrary length reaching goal with a probability higher than θ 5

Graphical Models  A graphical model ( X , D , F )  X = {X 1 , … , X n } variables  D= {D 1 , … , D n } domains  F= {f 1 , … , f m } functions • Constraints, CPTs, CNFs, …  Operators  Combination (product)  Elimination (max/sum)  Tasks  Probability of Evidence (PR) All these tasks are NP-hard  Most Probable Explanation (MPE) Exploit problem structure (primal graph)  Marginal MAP (Maximum A Posteriori) 6

Conformant Planning as Marginal MAP  Finite Horizon Probabilistic Conformant Planning <S, A, I, G, T, L>  Random variables  State transition function  Joint probability distribution given a plan that satisfying the goal  Optimal Plan as MMAP 7

AND/OR Search Algorithm for MMAP AND/OR Search Graph AND/OR Branch and Bound Search [Kask, Dechter 2001] [Decther and Mateescue 2006] [Marinescue, Dechter 2005-2009] Graphical Model Breadth Rotate Search [Otten, Dechter 2011] Mini-bucket Elimination with Moment Matching [Dechter and Rish 1997, 2003] [Flerova, Ihler 2011] 8

Example Domain: Blocks World b2 b1 State: OnTable (b1) and On(b2, b1) and Clear(b2) and EmptyHand Table action: pick-up-from-block(b2, b1) b2 b1 State: OnTable (b1) and Clear(b1) and Holding(b2) Table action: put-down-to-table(b2) b2 b1 State: OnTable (b1) and OnTable(b2) and Clear(b1) and Clear (b2) Table and EmptyHand 9

Example Domain: Blocks World action: pick-up-from-table(b1) b1 b2 Table State: OnTable (b2) and Clear(b2) and Holding(b1) action: put-on-block(b1, b2) b1 b2 State: OnTable (b2) and On(b1, b2) and Clear(b1) and EmptyHand Table 10

Blocks World in PDDL (deterministic)  Predicates for describing states  Clear(?b block), OnTable(?b block),  On(?b1, ?b2 block), Holding(?b block), EmptyHand  Initial State  On(b2, b1) and OnTable(b1) and Clear(b2) and EmptyHand  Goal State  On(b1, b2) and OnTable(b2) and Clear(b1) and EmptyHand  Action Schema for describing actions  Pick-up-from-block (?b1 ?b2 - block)  Pick-up-from-Table (?b – block)  Put-on-block(?b1 ?b2 – block)  Put-down-to-table(?b – block) 11

Blocks World in PDDL (deterministic)  Action schema for describing deterministic state transitions  Pick-up-from-block(?b1, ?b2 - block) • Precondition: EmptyHand and Clear(?b1) and On(?b1, ?b2) • Effect: Holding(?b1) and Clear(?b2) and (Not EmptyHand) and (Not Clear(?b1)) and (Not On(?b1, ?b2))  Pick-up-from-table(?b - block) • Precondition: EmptyHand and Clear(?b) and OnTable(?b) • Effect: Holding(?b) and (Not EmptyHand) and (Not OnTable(?b)) and (Not Clear(?b)) 12

Compiling Graphical Models from Planning Domains IPC-1998, 2000 IPC- 2004 McDermott et al 1998 Younes and Littman 2004 Two Encoding Schemes Planning Probabilistic Domain Planning Definition Domain Language Definition 2 Stage DBN Language -standard language for “classical planning problems” - influenced by STRIPS and ADL formalism Extension of PDDL 2.1 & to support “Probabilistic Actions” Replicate it over L finite horizon Finite [Helmert 2006, 2009] Domain Representation Finite (SAS+) Domain with Representation Probabilistic (SAS+) Effects - Multi-valued state variables - Simplified Action Structure+ (SAS+) (Backstrom 1995) 13

Blocks World in PPDDL (Probabilistic)  Action schema for describing probabilistic state transitions  Pick-up-from-block(?b1, ?b2 - block) Precondition: EmptyHand and Clear(?b1) and On(?b1, ?b2) • • Effect1: 0.75 Holding(?b1) and Clear(?b2) and (Not EmptyHand) and (Not Clear(?b1)) and (Not On(?b1, ?b2)) • Effect2: 0.25 Clear(?b2) and OnTable(?b1) and (Not (On(?b1, ?b2))  Pick-up-from-table(?b - block) • Precondition: EmptyHand and Clear(?b) and OnTable(?b) • Effect1: 0.75 Holding(?b) and (Not EmptyHand) and (Not OnTable(?b)) and (Not Clear(?b)) 14

Compiling PPDDL into 2 stage DBN  Convert each ground action schema into 2TDBN Pre-state variable effect variable post state variable (probabilistic) Clear b1 Clear b1 pickupfromtable b1 OnTable b1 OnTable b1 On b1 b2 On b1 b2 Holding b1 Holding b1 EmptyHand EmptyHand Clear b2 Clear b2 OnTable b2 OnTable b2 On b2 b1 On b2 b1 Holding b2 Holding b2 as shown in PPDDL 1.0 Specification 15

Compiling PPDDL into 2 stage DBN  Introduce additional variables to bound scope  Precondition, Add effect, Del effect, Action Action variable Pickupfromtable b1 Del state variable precondition variable Del Clear b1 precondition pickupfromtable b1 Clear b1 Clear b1 Add state variable Add Clear b1 Del OnT able b1 OnTable b1 OnTable b1 as serial encoding of SATPLAN Add OnT able b1 16

Compiling PPDDL into 2 stage DBN  Combine all 2TDBNs into Single 2TDBN  If scope size needs to be bounded, introduce hidden variables s1 hidden s2 precondition s3 s4 17

Compiling PPDDL into 2TDBN  Slippery Gripper Domain Example 18

Complexity of Translation from PPDDL  Input PPDDL parameters  Number of ground objects =  Number of action schemata = • Maximum number of object parameters = • Maximum number of probabilistic effects =  Number of predicates =  Number of Variables at each time  Number of action variables  Number of state variables  Number of effect variables  Number of Add/Del state variables  19

Compiling Graphical Models from Planning Domains IPC-1998, 2000 IPC- 2004 McDermott et al 1998 Younes and Littman 2004 Two Encoding Schemes Planning Probabilistic Domain Planning Definition Domain Language Definition 2 Stage DBN Language -standard language for “classical planning problems” - influenced by STRIPS and ADL formalism Extension of PDDL 2.1 & to support “Probabilistic Actions” Replicate it over L finite horizon Finite [Helmert 2006, 2009] Domain Representation Finite (SAS+) Domain with Representation Probabilistic (SAS+) Effects - Multi-valued state variables - Simplified Action Structure+ (SAS+) (Backstrom 1995) 20

Blocks World in FDR (SAS+)  Simplified Action Structure+ (Backstrom 1995)  Multi-valued state variables State variable is an aggregate of mutually exclusive ground predicates •  Operators (collection of changes of values in state variables) • Prevail condition: Value of a variable remains same • Pre-condition: Value of a variable before state transition • Post-condition: Value of a variable after state transition  Translate PDDL  FDR (Helmert 2009)  Generalize SAS+ with conditional effects and derived predicates  Automated translator from PDDL 2.2 to SAS+ 21

Blocks World in FDR (SAS+)  Multi-Valued State Variables  9 binary state variables clear b1, OnTable b1, On b1, b2, Holdig b1, Emptyhand, • clear b2, OnTable b2, Onb2 b1, Holding b2 translated as  5 multi-valued state variables • Var0 = {Clear(b1), Not Clear(b1)} • Var1 = {Clear(b2), Not Clear(b2)} • Var2 = {EmptyHand, Not EmptyHand} • Var3 = {Holding(b1), On(b1, b2), OnTable(b1)} • Var4 = {Holding(b2), On(b2, b1), OnTable(b2)} 22

Applying Search Based Probabilistic Inference Algorithms to - PowerPoint PPT Presentation

Applying Search Based Probabilistic Inference Algorithms to Probabilistic Conformant Planning: Preliminary Results Junkyu Lee * , Radu Marinescu ** and Rina Dechter * * University of California, Irvine ** IBM Research, Ireland ISAIM 2016

Search Algorithms 3 AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 1 3 Search Algorithms

Informed search algorithms Outline Best-first search Greedy best-first search A *

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Probabilistic Graphical Models Probabilistic Graphical Models MAP inference Siamak Ravanbakhsh

15-780 Graduate Artificial Intelligence: Probabilistic inference J. Zico Kolter (this

On Computational and Probabilistic Inference Rajat Mani Thomas Objectives: Revisiting Bayesian

Search Problems and Algorithms T79.4201 Search Problems and Algorithms (4 ECTS) T-79.4201

Local search algorithms AIMA sections 4.1,4.2 Summary Local search algorithms Hill-climbing

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

CS 4110 Probabilistic Programming Probabilistic Programming It's not about writing software.

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

CS325 Artificial Intelligence Ch 14b Probabilistic Inference Cengiz Gnay Spring 2013

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Search 3 AI Slides (5e) c Lin Zuoquan@PKU 2003-2019 3 1 3 Search 3.1 Problem-solving

Introduction to other file types Importing Data in Python I Other file types Excel

ZFONE ZRTP Philip Zimmermanns new secure VOIP Hello(As Zid, Hash, Enc, SAS.)

SAS + -Planung Vorlesung Handlungsplanung Tilman Mehler 1 Uberblick Wir haben u.A.

S IMULATIONS (SAS) Lars Davidson, www.tfd.chalmers.se/lada L ARGE E DDY S IMULATIONS SGS SGS

Tensor Product State (TPS) and Projected Entangled Pair State (PEPS), these terms are quite

Invariance of Conjunctions of Polynomial Equalities for Algebraic Differential Equations Khalil

CLIC MODULE Nominal Operation Mode Temperature Results Daskalaki Elena Vamvakas Alex Xydou

Empirical Evaluation of Search Algorithms for Satisficing Planning Patrick von Reth Department