Planning and Optimization D3. Abstractions: Additive Abstractions - - PowerPoint PPT Presentation

planning and optimization
SMART_READER_LITE
LIVE PREVIEW

Planning and Optimization D3. Abstractions: Additive Abstractions - - PowerPoint PPT Presentation

Planning and Optimization D3. Abstractions: Additive Abstractions Malte Helmert and Thomas Keller Universit at Basel October 30, 2019 Additivity Outlook Summary Content of this Course Foundations Logic Classical Heuristics


slide-1
SLIDE 1

Planning and Optimization

  • D3. Abstractions: Additive Abstractions

Malte Helmert and Thomas Keller

Universit¨ at Basel

October 30, 2019

slide-2
SLIDE 2

Additivity Outlook Summary

Content of this Course

Planning Classical Foundations Logic Heuristics Constraints Probabilistic Explicit MDPs Factored MDPs

slide-3
SLIDE 3

Additivity Outlook Summary

Content of this Course: Heuristics

Heuristics Delete Relaxation Abstraction Abstractions in General Pattern Databases Merge & Shrink Constraints Landmarks Network Flows Potential Heuristics

slide-4
SLIDE 4

Additivity Outlook Summary

Additivity

slide-5
SLIDE 5

Additivity Outlook Summary

Orthogonality of Abstractions

Definition (Orthogonal) Let α1 and α2 be abstractions of transition system T . We say that α1 and α2 are orthogonal if for all transitions s

− → t

  • f T , we have αi(s) = αi(t) for at least one i ∈ {1, 2}.
slide-6
SLIDE 6

Additivity Outlook Summary

Affecting Transition Labels

Definition (Affecting Transition Labels) Let T be a transition system, and let ℓ be one of its labels. We say that ℓ affects T if T has a transition s

− → t with s = t. Theorem (Affecting Labels vs. Orthogonality) Let α1 and α2 be abstractions of transition system T . If no label of T affects both T α1 and T α2, then α1 and α2 are orthogonal. (Easy proof omitted.)

slide-7
SLIDE 7

Additivity Outlook Summary

Orthogonal Abstractions: Example

2 6 5 7 3 4 1 9 12 14 13 11 15 10 8 Are the abstractions orthogonal?

slide-8
SLIDE 8

Additivity Outlook Summary

Orthogonal Abstractions: Example

2 6 5 7 3 4 1 9 12 14 13 11 15 10 8 Are the abstractions orthogonal?

slide-9
SLIDE 9

Additivity Outlook Summary

Orthogonality and Additivity

Theorem (Additivity for Orthogonal Abstractions) Let hα1, . . . , hαn be abstraction heuristics of the same transition system such that αi and αj are orthogonal for all i = j. Then n

i=1 hαi is a safe, goal-aware, admissible and consistent

heuristic for Π.

slide-10
SLIDE 10

Additivity Outlook Summary

Orthogonality and Additivity: Example

LLR LLL ILL LIL IIL IIR RIR IRR RRR RRL ILR RLR RLL RIL LIR LRR LRL IRL

transition system T state variables: first package, second package, truck

slide-11
SLIDE 11

Additivity Outlook Summary

Orthogonality and Additivity: Example

LLR LLL ILL LIL IIL IIR RIR IRR RRR RRL ILR RLR RLL RIL LIR LRR LRL IRL

LLR LLL LIL LIR LRR LRL ILR ILL IIL IIR IRR IRL RLR RLL RIL RIR RRR RRL abstraction α1 abstraction: only consider value of first package

slide-12
SLIDE 12

Additivity Outlook Summary

Orthogonality and Additivity: Example

LLR LLL ILL LIL IIL IIR RIR IRR RRR RRL ILR RLR RLL RIL LIR LRR LRL IRL

LLR LLL LIL LIR LRR LRL ILR ILL IIL IIR IRR IRL RLR RLL RIL RIR RRR RRL abstraction α1 abstraction: only consider value of first package

slide-13
SLIDE 13

Additivity Outlook Summary

Orthogonality and Additivity: Example

LLR LLL ILL LIL IIL IIR RIR IRR RRR RRL ILR RLR RLL RIL LIR LRR LRL IRL

LLR LLL ILL ILR RLR RLL LIR LIL IIL IIR RIR RIL LRR LRL IRL IRR RRR RRL abstraction α2 (orthogonal to α1) abstraction: only consider value of second package

slide-14
SLIDE 14

Additivity Outlook Summary

Orthogonality and Additivity: Example

LLR LLL ILL LIL IIL IIR RIR IRR RRR RRL ILR RLR RLL RIL LIR LRR LRL IRL

LLR LLL ILL ILR RLR RLL LIR LIL IIL IIR RIR RIL LRR LRL IRL IRR RRR RRL abstraction α2 (orthogonal to α1) abstraction: only consider value of second package

slide-15
SLIDE 15

Additivity Outlook Summary

Orthogonality and Additivity: Proof (1)

Proof. We prove goal-awareness and consistency; the other properties follow from these two. Let T = S, L, c, T, s0, S⋆ be the concrete transition system. Let h = n

i=1 hαi.

Goal-awareness: For goal states s ∈ S⋆, h(s) = n

i=1 hαi(s) = n i=1 0 = 0 because all individual

abstraction heuristics are goal-aware. . . .

slide-16
SLIDE 16

Additivity Outlook Summary

Orthogonality and Additivity: Proof (1)

Proof. We prove goal-awareness and consistency; the other properties follow from these two. Let T = S, L, c, T, s0, S⋆ be the concrete transition system. Let h = n

i=1 hαi.

Goal-awareness: For goal states s ∈ S⋆, h(s) = n

i=1 hαi(s) = n i=1 0 = 0 because all individual

abstraction heuristics are goal-aware. . . .

slide-17
SLIDE 17

Additivity Outlook Summary

Orthogonality and Additivity: Proof (2)

Proof (continued). Consistency: Let s

→ t ∈ T. We must prove h(s) ≤ c(o) + h(t). Because the abstractions are orthogonal, αi(s) = αi(t) for at most one i ∈ {1, . . . , n}. Case 1: αi(s) = αi(t) for all i ∈ {1, . . . , n}. Then h(s) = n

i=1 hαi(s)

= n

i=1 h∗ T αi (αi(s))

= n

i=1 h∗ T αi (αi(t))

= n

i=1 hαi(t)

= h(t) ≤ c(o) + h(t). . . .

slide-18
SLIDE 18

Additivity Outlook Summary

Orthogonality and Additivity: Proof (2)

Proof (continued). Consistency: Let s

→ t ∈ T. We must prove h(s) ≤ c(o) + h(t). Because the abstractions are orthogonal, αi(s) = αi(t) for at most one i ∈ {1, . . . , n}. Case 1: αi(s) = αi(t) for all i ∈ {1, . . . , n}. Then h(s) = n

i=1 hαi(s)

= n

i=1 h∗ T αi (αi(s))

= n

i=1 h∗ T αi (αi(t))

= n

i=1 hαi(t)

= h(t) ≤ c(o) + h(t). . . .

slide-19
SLIDE 19

Additivity Outlook Summary

Orthogonality and Additivity: Proof (2)

Proof (continued). Consistency: Let s

→ t ∈ T. We must prove h(s) ≤ c(o) + h(t). Because the abstractions are orthogonal, αi(s) = αi(t) for at most one i ∈ {1, . . . , n}. Case 1: αi(s) = αi(t) for all i ∈ {1, . . . , n}. Then h(s) = n

i=1 hαi(s)

= n

i=1 h∗ T αi (αi(s))

= n

i=1 h∗ T αi (αi(t))

= n

i=1 hαi(t)

= h(t) ≤ c(o) + h(t). . . .

slide-20
SLIDE 20

Additivity Outlook Summary

Orthogonality and Additivity: Proof (2)

Proof (continued). Consistency: Let s

→ t ∈ T. We must prove h(s) ≤ c(o) + h(t). Because the abstractions are orthogonal, αi(s) = αi(t) for at most one i ∈ {1, . . . , n}. Case 1: αi(s) = αi(t) for all i ∈ {1, . . . , n}. Then h(s) = n

i=1 hαi(s)

= n

i=1 h∗ T αi (αi(s))

= n

i=1 h∗ T αi (αi(t))

= n

i=1 hαi(t)

= h(t) ≤ c(o) + h(t). . . .

slide-21
SLIDE 21

Additivity Outlook Summary

Orthogonality and Additivity: Proof (3)

Proof (continued). Case 2: αi(s) = αi(t) for exactly one i ∈ {1, . . . , n}. Let k ∈ {1, . . . , n} such that αk(s) = αk(t). Then h(s) = n

i=1 hαi(s)

=

i∈{1,...,n}\{k} h∗ T αi (αi(s)) + hαk(s)

i∈{1,...,n}\{k} h∗ T αi (αi(t)) + c(o) + hαk(t)

= c(o) + n

i=1 hαi(t)

= c(o) + h(t), where the inequality holds because αi(s) = αi(t) for all i = k and hαk is consistent.

slide-22
SLIDE 22

Additivity Outlook Summary

Orthogonality and Additivity: Proof (3)

Proof (continued). Case 2: αi(s) = αi(t) for exactly one i ∈ {1, . . . , n}. Let k ∈ {1, . . . , n} such that αk(s) = αk(t). Then h(s) = n

i=1 hαi(s)

=

i∈{1,...,n}\{k} h∗ T αi (αi(s)) + hαk(s)

i∈{1,...,n}\{k} h∗ T αi (αi(t)) + c(o) + hαk(t)

= c(o) + n

i=1 hαi(t)

= c(o) + h(t), where the inequality holds because αi(s) = αi(t) for all i = k and hαk is consistent.

slide-23
SLIDE 23

Additivity Outlook Summary

Outlook

slide-24
SLIDE 24

Additivity Outlook Summary

Using Abstraction Heuristics in Practice

In practice, there are conflicting goals for abstractions: we want to obtain an informative heuristic, but want to keep its representation small. Abstractions have small representations if there are few abstract states and there is a succinct encoding for α.

slide-25
SLIDE 25

Additivity Outlook Summary

Counterexample: One-State Abstraction

LRR LLR LLL LRL ALR ALL BLL BRL ARL ARR BRR BLR RRR RRL RLR RLL

LRR LLR LLL LRL ALR ALL BLL BRL ARL ARR BRR BLR RRR RRL RLR RLL One-state abstraction: α(s) := const. + very few abstract states and succinct encoding for α − completely uninformative heuristic

slide-26
SLIDE 26

Additivity Outlook Summary

Counterexample: Identity Abstraction

LRR LLL LLR LRL ALR ALL BLL BRL ARL ARR BRR BLR RRR RRL RLR RLL

Identity abstraction: α(s) := s. + perfect heuristic and succinct encoding for α − too many abstract states

slide-27
SLIDE 27

Additivity Outlook Summary

Counterexample: Perfect Abstraction

LRR LLR LLL LRL

LLR LRL LLL

ALR ALL BLL BRL

ALR BRL ALL BLL

ARL ARR BRR BLR

ARL BLR ARR BRR

RRR RRL RLR RLL

RLL RRL RLR RRR Perfect abstraction: α(s) := h∗(s). + perfect heuristic and usually few abstract states − usually no succinct encoding for α

slide-28
SLIDE 28

Additivity Outlook Summary

Automatically Deriving Good Abstraction Heuristics

Abstraction Heuristics for Planning: Main Research Problem Automatically derive effective abstraction heuristics for planning tasks. we will study two state-of-the-art approaches in Chapters D4–D8

slide-29
SLIDE 29

Additivity Outlook Summary

Summary

slide-30
SLIDE 30

Additivity Outlook Summary

Summary

Abstraction heuristics from orthogonal abstractions can be added without losing admissibility or consistency. One sufficient condition for orthogonality is that all abstractions are affected by disjoint sets of labels. Practically useful abstractions are those which give informative heuristics, yet have a small representation. Coming up with good abstractions automatically is the main research challenge when applying abstraction heuristics in planning.