LP-based Heuristics for Cost-optimal Planning Florian Gabriele - - PowerPoint PPT Presentation

lp based heuristics for cost optimal planning
SMART_READER_LITE
LIVE PREVIEW

LP-based Heuristics for Cost-optimal Planning Florian Gabriele - - PowerPoint PPT Presentation

LP Heuristics Theoretical Results Empirical Results Conclusion LP-based Heuristics for Cost-optimal Planning Florian Gabriele Malte Blai Pommerening 1 oger 1 R Helmert 1 Bonet 2 1 University of Basel, Switzerland 2 Universidad Sim on


slide-1
SLIDE 1

LP Heuristics Theoretical Results Empirical Results Conclusion

LP-based Heuristics for Cost-optimal Planning

Florian Pommerening1 Gabriele R¨

  • ger1

Malte Helmert1 Blai Bonet2

1University of Basel, Switzerland 2Universidad Sim´

  • n Bol´

ıvar, Venezuela

June 24, 2014

slide-2
SLIDE 2

LP Heuristics Theoretical Results Empirical Results Conclusion

Recent interest in heuristics based on linear programming

Certified “hot topic” (AAAI 2013 Spotlight Talk: What’s Hot at ICAPS?) Landmarks, state equation, PDBs, optimal cost partitioning

Contributions

Common framework Combination of heuristic values beyond the maximum Theoretical tool to show dominance

slide-3
SLIDE 3

LP Heuristics Theoretical Results Empirical Results Conclusion

A framework for LP-based heuristics

slide-4
SLIDE 4

LP Heuristics Theoretical Results Empirical Results Conclusion

Background

Classical planning tasks

States assign values to variables Operators allow to manipulate states Implicitly defined transition system

Finding optimal solutions

Cheapest sequence of operators from initial state to a goal Common approach: A∗ + admissible heuristic

slide-5
SLIDE 5

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator occurrences in potential plans

(0,0,1) (3,0,2) (1,1,2) (3,2,2) (1,2,0) (2,2,0) (1,3,1) (1,2,1) (3,1,0) (2,1,0) (0,0,0) · · · (2,2,1)

slide-6
SLIDE 6

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator occurrences in potential plans Landmark

(0,0,1) (3,0,2) (1,1,2) (3,2,2) (1,2,0) (2,2,0) (1,3,1) (1,2,1) (3,1,0) (2,1,0) (0,0,0) · · · (2,2,1)

slide-7
SLIDE 7

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator occurrences in potential plans Landmark PDB

(0,0,1) (3,0,2) (1,1,2) (3,2,2) (1,2,0) (2,2,0) (1,3,1) (1,2,1) (3,1,0) (2,1,0) (0,0,0) · · · (2,2,1)

slide-8
SLIDE 8

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator occurrences in potential plans Landmark PDB Net change

(0,0,1) (3,0,2) (1,1,2) (3,2,2) (1,2,0) (2,2,0) (1,3,1) (1,2,1) (3,1,0) (2,1,0) (0,0,0) · · · (2,2,1)

slide-9
SLIDE 9

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator occurrences in potential plans Landmark PDB Net change

(0,0,1) (3,0,2) (1,1,2) (3,2,2) (1,2,0) (2,2,0) (1,3,1) (1,2,1) (3,1,0) (2,1,0) (0,0,0) · · · (2,2,1)

slide-10
SLIDE 10

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator occurrences in potential plans Landmark PDB Net change

(0,0,1) (3,0,2) (1,1,2) (3,2,2) (1,2,0) (2,2,0) (1,3,1) (1,2,1) (3,1,0) (2,1,0) (0,0,0) · · · (2,2,1)

slide-11
SLIDE 11

LP Heuristics Theoretical Results Empirical Results Conclusion

Operator-counting Constraints

Operator-counting constraint

Linear constraints Operator-counting variable Yo for each operator Satisfied by occurrences in any plan Example: Yo1 ≥ 2Yo2

IP/LP heuristics

Minimize

  • ∈O

cost(o) · Yo subject to some operator-counting constraints LP relaxation solvable in polynomial time Admissible heuristics

slide-12
SLIDE 12

LP Heuristics Theoretical Results Empirical Results Conclusion

How do existing heuristics fit?

slide-13
SLIDE 13

LP Heuristics Theoretical Results Empirical Results Conclusion

Example 1: Disjunctive Action Landmarks

Disjunctive action landmarks

Set of operators At least one has to be used in any plan

Landmarks constraints

  • ∈L

Yo ≥ 1 Existing heuristic

Optimal cost partitioning for landmarks (Karpas and Domshlak 2009) Extended by Keyder, Richter, and Helmert (2010) Formulation by Bonet and Helmert (2010) fits the framework

slide-14
SLIDE 14

LP Heuristics Theoretical Results Empirical Results Conclusion

Example 2: Pattern Databases

Pattern databases

Admissible Only subset of operators is relevant

Post-hoc optimization constraints hP (s) ≤

  • relevant for P

cost(o) · Yo Existing heuristic

Post-hoc optimization (Pommerening, R¨

  • ger, and Helmert 2013)

Minor reformulation fits the framework

slide-15
SLIDE 15

LP Heuristics Theoretical Results Empirical Results Conclusion

Example 3: Net Change

Net change for a value of a variable

Operators produce or consume the value

Net change constraints

Number of producers and consumers must balance out Lower bound estimation for operators that sometimes produce/consume.

Existing heuristic

State-equation heuristic (van den Briel et al. 2007, Bonet 2013, Bonet and van den Briel 2014) Fits the framework

slide-16
SLIDE 16

LP Heuristics Theoretical Results Empirical Results Conclusion

Example 4: Explicit State Abstractions

Explicit State Abstractions

PDBs, Merge&Shrink, CEGAR, . . .

Existing heuristic

Optimal cost partitioning heuristic (Katz and Domshlak 2010) Dual LP: new perspective on same problem Dual constraints are operator-counting constraints

slide-17
SLIDE 17

LP Heuristics Theoretical Results Empirical Results Conclusion

Theoretical Results

slide-18
SLIDE 18

LP Heuristics Theoretical Results Empirical Results Conclusion

Combination of Heuristic Values

Theorem The LP heuristic for a set of operator-counting constraints dominates the maximum

  • ver LP heuristics for the individual constraints

Better way to combine different sources of information Dominance can be strict Example: Positive interaction between constraints State-equation heuristic Landmark constraint 1

  • 1
  • 2

L = {o1}

slide-19
SLIDE 19

LP Heuristics Theoretical Results Empirical Results Conclusion

Dominance of heuristics

LP heuristics as analytic tool General scheme to show dominance of h1 over h2

1

h1 is the LP heuristic with constraints C1

2

h2 is the LP heuristic with constraints C2

3

Every solution of C1 satisfies constraints in C2

4

h1 ≥ h2

slide-20
SLIDE 20

LP Heuristics Theoretical Results Empirical Results Conclusion

Dominance of heuristics

Theorem hOCP

Sys1 ≤ hSEQ

hOCP

Sys1

Optimal cost partitioning heuristic Abstractions: one projection to each goal variable

hSEQ

State-equation heuristic

A counter example shows hSEQ ≤ hOCP

Sys1

slide-21
SLIDE 21

LP Heuristics Theoretical Results Empirical Results Conclusion

Implied constraints

Safety-based improvement of the state-equation heuristic (Bonet 2013)

Net change constraints contain lower bound estimation Corresponding upper bound estimation can be added Some inequalities become equalities

Theorem The safety-based improvement cannot increase the heuristic value of the state-equation heuristic.

slide-22
SLIDE 22

LP Heuristics Theoretical Results Empirical Results Conclusion

Empirical Results

slide-23
SLIDE 23

LP Heuristics Theoretical Results Empirical Results Conclusion

Results

Individual Constraints SEQ PhO-Sys1 PhO-Sys2 LMC OPT-Sys1 630 587 631 744 443 Combination of Constraints LMC LMC LMC PhO-Sys2 + PhO-Sys2 + PhO-Sys2 + SEQ + SEQ + SEQ hLM-cut 758 788 672 763 763

slide-24
SLIDE 24

LP Heuristics Theoretical Results Empirical Results Conclusion

Interaction of Constraints

Comparing combination in LP with maximum Coverage is unchanged Stronger heuristic estimates (synergy)

Fewer expansions More tasks solved with perfect heuristic

100 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 uns. unsolved LMC+ SEQ (123/788) max(LMC, SEQ) (109/788) Expansions

slide-25
SLIDE 25

LP Heuristics Theoretical Results Empirical Results Conclusion

Conclusion

slide-26
SLIDE 26

LP Heuristics Theoretical Results Empirical Results Conclusion

Conclusion

Common framework for LP-based heuristics

Operator-counting constraints IP/LP heuristics Fits many existing heuristics

Can be used to prove properties of heuristics Combination of information from different sources

Stronger estimates than through maximization Synergy effects

Poster presentation today in the second session (17:30)

slide-27
SLIDE 27

LP Heuristics Theoretical Results Empirical Results Conclusion

Conclusion

Common framework for LP-based heuristics

Operator-counting constraints IP/LP heuristics Fits many existing heuristics

Can be used to prove properties of heuristics Combination of information from different sources

Stronger estimates than through maximization Synergy effects

Poster presentation today in the second session (17:30)