landmarks revisited
play

Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias - PowerPoint PPT Presentation

Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias Westphal 2 1 Griffith University & NICTA, Australia 2


  1. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias Westphal 2 1 Griffith University & NICTA, Australia 2 Albert-Ludwigs-Universit¨ at Freiburg, Germany AAAI 2008

  2. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  3. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  4. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation SAS + planning task SAS + planning task: Π = �V , A , s 0 , s ⋆ � V : state variables with finite domain D v Fact: variable-value pair v �→ d ( v ∈ V , d ∈ D v ) State: variable assignment for all v ∈ V A : actions � pre , eff � , with pre , eff fact sets Action a = � pre , eff � applicable in state s if pre ⊆ s Applying a in s updates s s 0 : initial state s ⋆ : partial variable assignment called the goal Sequence of actions π a plan iff s ⋆ ⊆ s 0 [ π ].

  5. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  6. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  7. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  8. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  9. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  10. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E } o-at-E

  11. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task cont’d A p B C E o D t load-o-t-B : � Pre = { v o �→ B , v t �→ B } , Eff = { v o �→ t }�

  12. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Preferred Operators t-at-D o-at-B ... drive-t-D-B t-at-B ... o-at-B ... load-o t-at-B o-in-t ... Improvement of heuristic search approaches (Helmert 2006) Idea: prefer actions that are likely to improve heuristic value E. g. those which are part of plan for simplified problem

  13. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  14. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks in Previous Work Facts that must be true in every plan (Porteous et al. 2001 & 2002; Hoffmann et al. 2004) Intuitively helpful to direct seach Automatically found, incl. orderings o-at-B t-at-B o-in-t A p t-at-C B C E p-at-C o-at-C o D o-in-p t o-at-E

  15. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks in Previous Work cont’d o-at-B t-at-B Find landmarks by backchaining o-in-t Every goal is a landmark If B is landmark and all actions that first t-at-C achieve B have A as precondition, then p-at-C o-at-C A is a landmark Approximation with RPGs: consider o-in-p all achievers “possibly before” B o-at-E (Porteous et al. 2002) Disjunctive landmarks also possible: (o-in-p 1 ∨ o-in-p 2 )

  16. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks in Previous Work cont’d o-at-B t-at-B Use as subgoals, then simply concatenate plans of subtasks o-in-t (“LM-local”) t-at-C Greatly speeds up search p-at-C o-at-C in many domains But: bad-quality plans, incomplete o-in-p (dead ends) o-at-E Any base planner possible for subtasks

  17. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  18. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmark Heuristic + Preferred Operators Novel usage of landmarks Pseudo-Heuristic = #landmarks that still need to be achieved Take orderings into account (see paper for details) Preferred operators = landmark-achieving operators or operators in relaxed plan to nearest landmark Combination with other heuristics through multi-heuristic BFS (Helmert 2006) Experiments with several heuristics (FF, CG, blind) on all tasks from past planning competitions

  19. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Results: %Tasks solved (Average) Algorithm Base Heuristic base LM-local LM-heur FF heuristic 87 82 88 CG heuristic 74 66 87 blind heuristic 25 52 84 Note: updated results for LM-local With all 3 heuristics, LM-heur dominates other approaches LM-local worse than base with CG and blind heuristic (dead ends in 8 domains) FF-heuristic: base and LM-local are close. . .

  20. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Results: %Tasks solved (Average) Algorithm Base Heuristic base LM-local LM-heur FF heuristic 82 87 88 CG heuristic 74 66 87 blind heuristic 25 52 84 With all 3 heuristics, LM-heur dominates other approaches LM-local worse than base with CG and blind heuristic (dead ends in 8 domains) FF-heuristic: base and LM-local are close. . .

  21. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Results: #Tasks solved exclusively (FF heuristic) FF heuristic Domain base LM-heur Airport (50) 2 6 Depot (22) 0 2 Freecell (80) 1 3 Logistics-1998 (35) 0 2 Miconic-FullADL (150) 2 0 MPrime (35) 0 3 Mystery (30) 0 1 Pathways (30) 1 2 Philosophers (48) 0 2 Pipesworld-NoTankage (50) 0 2 Pipesworld-Tankage (50) 1 5 Schedule (150) 0 1 Storage (30) 1 0 Total 12 25 LM-heur solves twice as many tasks exclusively as base

  22. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  23. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Extended Landmark Generation Adapted previous procedures to SAS + planning Admit disjunctive landmarks Find additional landmarks through DTGs

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend