Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias - - PowerPoint PPT Presentation

landmarks revisited
SMART_READER_LITE
LIVE PREVIEW

Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias - - PowerPoint PPT Presentation

Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias Westphal 2 1 Griffith University & NICTA, Australia 2


slide-1
SLIDE 1

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Landmarks Revisited

Silvia Richter1 Malte Helmert2 Matthias Westphal2

1Griffith University & NICTA, Australia 2Albert-Ludwigs-Universit¨

at Freiburg, Germany

AAAI 2008

slide-2
SLIDE 2

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Outline

1

Introduction to SAS+ Planning

2

Landmarks in Previous Work

3

Using Landmarks as Pseudo-Heuristic

4

Extended Landmark Generation

slide-3
SLIDE 3

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Outline

1

Introduction to SAS+ Planning

2

Landmarks in Previous Work

3

Using Landmarks as Pseudo-Heuristic

4

Extended Landmark Generation

slide-4
SLIDE 4

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

SAS+ planning task

SAS+ planning task: Π = V, A, s0, s⋆ V: state variables with finite domain Dv Fact: variable-value pair v → d (v ∈ V, d ∈ Dv) State: variable assignment for all v ∈ V A: actions pre, eff, with pre, eff fact sets

Action a = pre, eff applicable in state s if pre ⊆ s Applying a in s updates s

s0: initial state s⋆: partial variable assignment called the goal Sequence of actions π a plan iff s⋆ ⊆ s0[π].

slide-5
SLIDE 5

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task

A B C D

  • t

E p V = {vo, vt, vp} Dvo = {A, B, C, D, E, t, p} Dvt = {A, B, C, D}, Dvp = {C, E} A = {drive-t-D-B, load-o-t-B, . . . } s0 = {vo → B, vt → D, vp → E} s⋆ = {vo → E}

slide-6
SLIDE 6

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task

A B C D

  • t

E p V = {vo, vt, vp} Dvo = {A, B, C, D, E, t, p} Dvt = {A, B, C, D}, Dvp = {C, E} A = {drive-t-D-B, load-o-t-B, . . . } s0 = {vo → B, vt → D, vp → E} s⋆ = {vo → E}

slide-7
SLIDE 7

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task

A B C D

  • t

E p V = {vo, vt, vp} Dvo = {A, B, C, D, E, t, p} Dvt = {A, B, C, D}, Dvp = {C, E} A = {drive-t-D-B, load-o-t-B, . . . } s0 = {vo → B, vt → D, vp → E} s⋆ = {vo → E}

slide-8
SLIDE 8

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task

A B C D

  • t

E p V = {vo, vt, vp} Dvo = {A, B, C, D, E, t, p} Dvt = {A, B, C, D}, Dvp = {C, E} A = {drive-t-D-B, load-o-t-B, . . . } s0 = {vo → B, vt → D, vp → E} s⋆ = {vo → E}

slide-9
SLIDE 9

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task

A B C D

  • t

E p V = {vo, vt, vp} Dvo = {A, B, C, D, E, t, p} Dvt = {A, B, C, D}, Dvp = {C, E} A = {drive-t-D-B, load-o-t-B, . . . } s0 = {vo → B, vt → D, vp → E} s⋆ = {vo → E}

slide-10
SLIDE 10

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task

A B C D

  • t

E p V = {vo, vt, vp} Dvo = {A, B, C, D, E, t, p} Dvt = {A, B, C, D}, Dvp = {C, E} A = {drive-t-D-B, load-o-t-B, . . . } s0 = {vo → B, vt → D, vp → E} s⋆ = {vo → E}

  • -at-E
slide-11
SLIDE 11

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Encoding of example task cont’d

A B C D

  • t

E p load-o-t-B : Pre = {vo → B, vt → B}, Eff = {vo → t}

slide-12
SLIDE 12

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Preferred Operators

t-at-D

  • -at-B

... t-at-B

  • -at-B

... ... t-at-B

  • -in-t

... drive-t-D-B load-o Improvement of heuristic search approaches (Helmert 2006) Idea: prefer actions that are likely to improve heuristic value

  • E. g. those which are part of plan for simplified problem
slide-13
SLIDE 13

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Outline

1

Introduction to SAS+ Planning

2

Landmarks in Previous Work

3

Using Landmarks as Pseudo-Heuristic

4

Extended Landmark Generation

slide-14
SLIDE 14

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Landmarks in Previous Work

Facts that must be true in every plan (Porteous et al. 2001 & 2002; Hoffmann et al. 2004) Intuitively helpful to direct seach Automatically found, incl. orderings A B C D

  • t

E p

  • -at-B
  • -in-t
  • -at-E

t-at-B t-at-C

  • -at-C

p-at-C

  • -in-p
slide-15
SLIDE 15

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Landmarks in Previous Work cont’d

Find landmarks by backchaining Every goal is a landmark If B is landmark and all actions that first achieve B have A as precondition, then A is a landmark Approximation with RPGs: consider all achievers “possibly before” B (Porteous et al. 2002)

  • -at-B
  • -in-t
  • -at-E

t-at-B t-at-C

  • -at-C

p-at-C

  • -in-p

Disjunctive landmarks also possible: (o-in-p1 ∨ o-in-p2)

slide-16
SLIDE 16

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Landmarks in Previous Work cont’d

Use as subgoals, then simply concatenate plans of subtasks (“LM-local”) Greatly speeds up search in many domains But: bad-quality plans, incomplete (dead ends) Any base planner possible for subtasks

  • -at-B
  • -in-t
  • -at-E

t-at-B t-at-C

  • -at-C

p-at-C

  • -in-p
slide-17
SLIDE 17

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Outline

1

Introduction to SAS+ Planning

2

Landmarks in Previous Work

3

Using Landmarks as Pseudo-Heuristic

4

Extended Landmark Generation

slide-18
SLIDE 18

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Landmark Heuristic + Preferred Operators

Novel usage of landmarks Pseudo-Heuristic = #landmarks that still need to be achieved Take orderings into account (see paper for details) Preferred operators = landmark-achieving operators

  • r operators in relaxed plan to nearest landmark

Combination with other heuristics through multi-heuristic BFS (Helmert 2006) Experiments with several heuristics (FF, CG, blind) on all tasks from past planning competitions

slide-19
SLIDE 19

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Results: %Tasks solved (Average)

Algorithm Base Heuristic base LM-local LM-heur FF heuristic 87 82 88 CG heuristic 74 66 87 blind heuristic 25 52 84

Note: updated results for LM-local

With all 3 heuristics, LM-heur dominates other approaches LM-local worse than base with CG and blind heuristic (dead ends in 8 domains) FF-heuristic: base and LM-local are close. . .

slide-20
SLIDE 20

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Results: %Tasks solved (Average)

Algorithm Base Heuristic base LM-local LM-heur FF heuristic 87 82 88 CG heuristic 74 66 87 blind heuristic 25 52 84

With all 3 heuristics, LM-heur dominates other approaches LM-local worse than base with CG and blind heuristic (dead ends in 8 domains) FF-heuristic: base and LM-local are close. . .

slide-21
SLIDE 21

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Results: #Tasks solved exclusively (FF heuristic)

FF heuristic Domain base LM-heur Airport (50) 6 2 Depot (22) 2 Freecell (80) 1 3 Logistics-1998 (35) 2 Miconic-FullADL (150) 2 MPrime (35) 3 Mystery (30) 1 Pathways (30) 1 2 Philosophers (48) 2 Pipesworld-NoTankage (50) 2 Pipesworld-Tankage (50) 1 5 Schedule (150) 1 Storage (30) 1 Total 12 25

LM-heur solves twice as many tasks exclusively as base

slide-22
SLIDE 22

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Outline

1

Introduction to SAS+ Planning

2

Landmarks in Previous Work

3

Using Landmarks as Pseudo-Heuristic

4

Extended Landmark Generation

slide-23
SLIDE 23

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Extended Landmark Generation

Adapted previous procedures to SAS+ planning Admit disjunctive landmarks Find additional landmarks through DTGs

slide-24
SLIDE 24

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Domain Transition Graphs (DTGs)

The domain transition graph of v ∈ V (DTGv) represents how the value of v can change Given: a SAS+ task V, A, s0, s⋆ DTGv is a directed graph with nodes Dv that has arc d, d′ iff d = d′, and ∃ action with v → d′ as effect, and either

v → d as precondition, or no precondition on v

slide-25
SLIDE 25

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

DTG Example

A B C D

  • t

E p DTG for vo: t B A C D p E

slide-26
SLIDE 26

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

DTG Example

A B C D

  • t

E p DTG for vo: t B A C D p E

slide-27
SLIDE 27

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

DTG Example

A B C D

  • t

E p DTG for vo: t B A C D p E load-o-t-B

slide-28
SLIDE 28

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Extended Landmark Generation

t B A C D p E Find additional landmarks through DTGs: if

s0(v) = d0, v → d landmark, and every path from d0 to d passes through d′,

then v → d′ landmark No further improvement in % solved, but shorter plans

slide-29
SLIDE 29

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Extended Landmark Generation

t B A C D p E Find additional landmarks through DTGs: if

s0(v) = d0, v → d landmark, and every path from d0 to d passes through d′,

then v → d′ landmark No further improvement in % solved, but shorter plans

slide-30
SLIDE 30

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Extended landmark generation – Plan length

Local: plans 6% longer than with base Heur: plans 1% shorter Heur with extended LMs: plans 3% shorter

10 20 30 40 50 60 70 80 90 100 20 40 60 80 100 120 140 160 Plan length Problems Plan length in Schedule base local-HPS heur-RHW 20 40 60 80 100 120 140 160 180 2 4 6 8 10 12 14 16 18 20 Plan length Problems Plan length in Gripper base local-HPS heur-RHW

slide-31
SLIDE 31

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Remarks on Runtime

LM generation usually < 1 sec. (max. 2 min.) During search: slight overhead through landmarks (≤ 18 %) Overhead typically outweighed by benefit in larger problems

0.01 0.1 1 2 4 6 8 10 12 14 16 18 20 Runtime (secs.) Problems Runtime in Gripper base local-HPS heur-RHW 0.01 0.1 1 10 100 20 40 60 80 100 120 140 160 Runtime (secs.) Problems Runtime in Schedule base local-HPS heur-RHW

slide-32
SLIDE 32

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Summary

Landmark heuristic significantly improves existing heuristics More tasks solved Better quality of solutions (plan lengths) Complete, unlike previous local search approach First approach that handles disjunctive landmarks

slide-33
SLIDE 33

Introduction to SAS+ Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation

Summary

Landmark heuristic significantly improves existing heuristics More tasks solved Better quality of solutions (plan lengths) Complete, unlike previous local search approach First approach that handles disjunctive landmarks Thank you!