P LANNING WITH I NCOMPLETE U SER P REFERENCES AND D OMAIN M ODELS - - PowerPoint PPT Presentation

p lanning with i ncomplete u ser p references and d omain
SMART_READER_LITE
LIVE PREVIEW

P LANNING WITH I NCOMPLETE U SER P REFERENCES AND D OMAIN M ODELS - - PowerPoint PPT Presentation

P LANNING WITH I NCOMPLETE U SER P REFERENCES AND D OMAIN M ODELS Tuan Anh Nguyen Graduate Committee Members: Subbarao Kambhampati (Chair) Chitta Baral Minh B. Do Joohyung Lee David E. Smith M OTIVATION Automated Planning In practice


slide-1
SLIDE 1

PLANNING WITH INCOMPLETE USER PREFERENCES AND DOMAIN MODELS

Tuan Anh Nguyen Graduate Committee Members: Subbarao Kambhampati (Chair) Chitta Baral Minh B. Do Joohyung Lee David E. Smith

slide-2
SLIDE 2

2

MOTIVATION

Automated Planning Research:

 Actions  Preconditions  Effects

 Deterministic  Non-deterministic  Stochastic

 Initial situation  Goal conditions  What a user wants

about plans

 Find a (best) plan!

In practice…

 Action models are not

available upfront

 Cost of modeling  Error-prone  Users usually don’t

exactly know what they want

 Always want to see

more than one plan Planning with incomplete user preferences and domain models

slide-3
SLIDE 3

Preferences in Planning – Traditional View

 Classical Model: “Closed world” assumption

about user preferences.

 All preferences assumed to be fully

specified/available

Two possibilities

 If no preferences specified —then user is

assumed to be indifferent. Any single feasible plan considered acceptable.

 If preferences/objectives are specified, find a plan

that is optimal w.r.t. specified objectives. Either way, solution is a single plan

Full Knowledge

  • f Preferences

3 3

slide-4
SLIDE 4

Preferences in Planning—Real World

Real World: Preferences not fully known

Full Knowledge

  • f Preferences is

lacking

4

Unknown preferences

 For all we know, user may care about every thing

  • -- the flight carrier, the arrival and departure

times, the type of flight, the airport, time of travel and cost of travel… Partially known

 We know that users cares only about travel time

and cost. But we don’t know how she combines them…

4

slide-5
SLIDE 5

Domain Models in Planning – Traditional View

 Classical Model: “Closed world” assumption

about action descriptions.

 Fully specified preconditions and effects  Known exact probabilities of outcomes

Full Knowledge

  • f domain models

5

pick-up

:parameters (?b – ball ?r – room) :precondition (and (at ?b ?r) (at-robot ?r) (free-gripper)) :effect (and (carry ?b) (not (at ?b ?r)) (not (free-gripper)))

5

slide-6
SLIDE 6

Domain Models in Planning – (More) Practical View

6

 Completely modeling the domain dynamics  Time consuming  Error-prone  Sometimes impossible  What does it mean by planning with incompletely

specified domain models?

 Plan could fail! Prefer plans that are more likely to

succeed…

 How to define such a solution concept?

6

slide-7
SLIDE 7

Problems and Challenges

7

 Incompleteness representation

7

 Solution concepts  Planning techniques

slide-8
SLIDE 8

DISSERTATION OVERVIEW

8

“Model-lite” Planning

Preference incompleteness Domain incompleteness

 Representation  Solution concept  Solving techniques  Representation  Solution concept  Solving techniques

slide-9
SLIDE 9

DISSERTATION OVERVIEW

9

“Model-lite” Planning

Preference incompleteness Domain incompleteness

 Representation: two levels

  • f incompleteness

 User preferences exist, but

totally unknown

 Partially specified  Complete set of plan

attributes

 Parameterized value

function, unknown trade-off values

 Representation  Actions with possible

preconditions / effects

 Optionally with weights

for being the real ones

 Solution concept: plan sets  Solving techniques:

synthesizing high quality plan sets

 Solution concept: “robust”

plans

 Solving techniques:

synthesizing robust plans

slide-10
SLIDE 10

DISSERTATION OVERVIEW

10

“Model-lite” Planning

Preference incompleteness

 Representation: two levels

  • f incompleteness

 User preferences exist, but

totally unknown

 Partially specified  Full set of plan

attributes

 Parameterized value

function, unknown trade-off values

 Solution concept: plan sets

with quality

 Solving techniques:

synthesizing quality plan sets

 Distance measures w.r.t.

base-level features of plans (actions, states, causal links)

 CSP-based and local-search

based planners

 IPF/ICP measure  Sampling, ICP and Hybrid

approaches

slide-11
SLIDE 11

DISSERTATION OVERVIEW

11

“Model-lite” Planning

Preference incompleteness Domain incompleteness

 Representation: two levels

  • f incompleteness

 User preferences exist, but

totally unknown

 Partially specified  Full set of plan

attributes

 Parameterized value

function, unknown trade-off values

 Representation  Actions with possible

preconditions / effects

 Optionally with weights

for being the real ones

 Solution concept: plan sets  Solving techniques:

synthesizing high quality plan sets

 Solution concept: “robust”

plans

 Solving techniques:

synthesizing robust plans

Publication

 Domain independent approaches

for finding diverse plans. IJCAI (2007)

 Planning with partial preference

  • models. IJCAI (2009)

 Generating diverse plans to

handle unknown and partially known user preferences. AIJ 190 (2012) (with Biplav Srivastava, Subbarao Kambhampati, Minh Do, Alfonso Gerevini and Ivan Serina)

Publication

 Assessing and Generating

Robust Plans with Partial Domain Models. ICAPS-WS (2010)

 Synthesizing Robust Plans under

Incomplete Domain Models. AAAI-WS(2011), NIPS (2013)

 A Heuristic Approach to

Planning with Incomplete STRIPS Action Models. ICAPS (2014) (with Subbarao Kambhampati, Minh Do)

slide-12
SLIDE 12

PLANNING WITH INCOMPLETE DOMAIN MODELS

12

slide-13
SLIDE 13

REVIEW: STRIPS

Predicate set R: clear(x – object), on-

table(x – object), on(x – object, y – object), holding(x – object), hand-empty

Operators O:  Name (signature): pick-up(x – object)  Preconditions: hand-empty, clear(x)  Effects: ~hand-empty, holding(x), ~clear(x) A single complete model!

13

slide-14
SLIDE 14

PLANNING PROBLEM WITH STRIPS

Set of typed objects {𝑝1, … , 𝑝𝑙}  Together with predicate set 𝑄, we have a set of

grounded propositions 𝐺

 Together with operators 𝑃, we have a set of

grounded actions 𝐵

Initial state: 𝐽 ∈ 𝐺 Goals: 𝐻 ⊆ 𝐺

14

slide-15
SLIDE 15

PLANNING PROBLEM WITH STRIPS (2)

 Find: a plan 𝜌 achieves 𝐻 starting from 𝐽:

𝛿 𝜌, 𝐽 ⊇ 𝐻.

 Transition function:

 𝛿

𝑏 , 𝑡 = 𝑡 ∪ 𝐵𝑒𝑒 𝑏 ∖ 𝐸𝑓𝑚(𝑏) for applying 𝑏 ∈ 𝐵 in 𝑡 ⊆ 𝐺 s.t. 𝑄𝑠𝑓 𝑏 ⊆ 𝑡.

 Applying 𝜌 = 〈𝑏1, … , 𝑏𝑜〉 at state 𝑡: 𝛿 𝜌, 𝑡 =

𝛿( 𝑏𝑜 , 𝛿( 𝑏2, … , 𝑏𝑜−1 , 𝑡))

15

slide-16
SLIDE 16

INCOMPLETE DOMAIN MODELS

 Predicate set 𝑺: clear(x – object), on-table(x – object),

  • n(x – object, y – object), holding(x – object), hand-

empty, light(x – object), dirty(x – object)

 Operators 𝑷

 Name (signature): pick-up(x – object)  Preconditions: hand-empty, clear(x)  Possible preconditions: light(x)  Effects: ~hand-empty, holding(x), ~clear(x)  Possible effects: dirty(x)  Incomplete domain 𝑬

= 〈𝑺, 𝑷〉

 At “schema” level with typed variables (no objects)  With K “annotations”, we have 2𝐿 possible complete models,

  • ne of which is the true model.

16

Incompleteness in deterministic domains Stochastic domains

slide-17
SLIDE 17

PLANNING PROBLEM WITH INCOMPLETE

DOMAIN

Set of typed objects {𝑝1, … , 𝑝𝑙}  Together with predicate set 𝑄, we have a

set of grounded propositions 𝐺

 Together with operators 𝑃, we have a set of

grounded actions 𝐵

Initial state: 𝐽 ∈ 𝐺 Goals: 𝐻 ⊆ 𝐺 Find: a plan 𝜌 “achieves” 𝐻 starting

from 𝐽

 An ill-defined solution concept!  Need a definition for “goal achievement”

17

slide-18
SLIDE 18

TRANSITION FUNCTION

18

 Under 𝑬

, applying 𝝆 in s results in a set of possible states: 𝜹 𝝆, 𝒕 = 𝜹𝑬𝒋(𝝆, 𝒕)

𝑬𝒋∈≪𝑬 ≫

 The probability of ending up in 𝒕′ ∈ 𝜹(𝝆, 𝒕) is equal

to 𝑸𝒔(𝑬𝒋)

𝑬𝒋∈≪𝑬 ≫, 𝒕′=𝜹𝑬𝒋(𝝆,𝒕)

where 𝑸𝒔 (𝑬𝒋) is the probability of 𝑬𝒋 being the true model.

slide-19
SLIDE 19

TRANSITION FUNCTION

20

 STRIPS Execution (SE):  Generous Execution (GE):

𝜹𝑬( 𝒃 , 𝒕): 𝛿𝐻𝐹

𝐸 (〈𝑏〉, 𝑡) = 𝑡 ∖ 𝐸𝑓𝑚𝐸 𝑏 ∪ 𝐵𝑒𝑒𝐸 𝑏 ,

𝑗𝑔 𝑄𝑠𝑓𝐸 𝑏 ⊆ 𝑡 𝑡, 𝑝𝑢ℎ𝑓𝑠𝑥𝑗𝑡𝑓 𝛿𝑇𝐹

𝐸 (〈𝑏〉, 𝑡) = 𝑡 ∖ 𝐸𝑓𝑚𝐸 𝑏 ∪ 𝐵𝑒𝑒𝐸 𝑏 ,

𝑗𝑔 𝑄𝑠𝑓𝐸 𝑏 ⊆ 𝑡 𝑡⊥ = {⊥} , 𝑝𝑢ℎ𝑓𝑠𝑥𝑗𝑡𝑓 ⊥ ∉ 𝐺 𝑄𝑠𝑓𝐸 𝑏 ⊈ 𝑡⊥, 𝐻 ⊈ 𝑡⊥

slide-20
SLIDE 20

21  Proposition set 𝐺 = {𝑞1, 𝑞2, 𝑞3}  Initial state 𝐽 = {𝑞2}  Goal 𝐻 = {𝑞3}

slide-21
SLIDE 21

A MEASURE FOR PLAN ROBUSTNESS

 Naturally, we prefer plan that succeeds in as

many complete models as possible

22

𝑆 𝜌 = |Π| 2𝐿

RGE 𝜌 = 6/8 RGE 𝜌 = 4/8 𝑺𝑻𝑭 𝝆 ≤ 𝑺𝑯𝑭(𝝆)

slide-22
SLIDE 22

A BIT MORE GENERAL…

 Predicate set 𝑺: clear(x – object), on-table(x –

  • bject), on(x – object, y – object), holding(x – object),

hand-empty, light(x – object), dirty(x – object)

 Operators 𝑷

 Name (signature): pick-up(x – object)  Preconditions: hand-empty, clear(x)  Possible preconditions: light(x) with a weight of 0.8  Effects: ~hand-empty, holding(x), ~clear(x)  Possible effects: dirty(x) with an unspecified weight

 Treat weights as probabilities with random

variables

 Robustness measure:

23

𝑺 𝝆 ≝ 𝐐𝐬 (𝑬𝒋)

𝑬𝒋 ∈ 〈〈𝑬 〉〉:𝜹𝑬𝒋 𝝆,𝑱 ⊨𝑯

slide-23
SLIDE 23

CONTENT

A measure for plan quality  Robustness of plan 𝑆 𝜌 ∈ [0,1] Plan robustness assessment  Reduced to weighted model counting  Complexity Synthesizing robust plans  Compilation approach  Heuristic search approach

24

slide-24
SLIDE 24

PLAN ROBUSTNESS ASSESSMENT

Computation:  Given 𝐸

, 𝑄 = 〈𝐺, 𝐵, 𝐽, 𝐻〉, a plan 𝜌

 Construct a set of correctness constraints 𝚻(𝝆) for

the execution of 𝜌:

 State transitions caused by actions are correct.  The goal 𝐻 is satisfied in the last state.

 Then: 𝑆(𝜌) is computed from the weighted

model count of 𝚻(𝝆)

25

slide-25
SLIDE 25

26

𝐽 𝐻 𝜌

𝚻(𝝆)

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒒𝒃𝒋

𝒒𝒔𝒇 ⇒

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒒𝒃𝒏

𝒆𝒇𝒎 ⇒

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒒𝒃𝒋

𝒒𝒔𝒇 ⇒ (𝒒𝒃𝒏 𝒆𝒇𝒎 ⇒

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

)

slide-26
SLIDE 26

PLAN ROBUSTNESS ASSESSMENT

 Complexity

The problem of computing 𝑆(𝜌) for a plan π to a problem 〈D , I, G〉 is #P-complete.

 Membership:

 Have a Counting TM non-deterministically guess a

complete model, and check the correctness of the plan.

 The number of accepting branches output: the number

  • f complete models under which the plan succeeds.

 Completeness:

 There exists a counting reduction from the problem of

counting satisfying assignments for Monotone-2-SAT problem to Robustness-Assessment (RA) problem

27

slide-27
SLIDE 27

CONTENT

A measure for plan quality  Robustness of plan 𝑆 𝜌 ∈ [0,1] Plan robustness assessment  Reduced to weighted model counting  Complexity Synthesizing robust plans  Compilation approach  Heuristic search approach

28

slide-28
SLIDE 28

COMPILATION APPROACH

 The realization of possible preconditions / effects is

determined by unknown variables 𝑞𝑏

𝑞𝑠𝑓, 𝑞𝑏 𝑏𝑒𝑒, 𝑞𝑏 𝑒𝑓𝑚

 Thus, can be compiled away using “conditional effects”

 If 𝑞𝑏

𝑞𝑠𝑓 = 𝑢𝑠𝑣𝑓 then 𝑞 is a precondition of 𝑏.

 Domain incompleteness  State incompleteness

 Conformant probabilistic planning problem!

29

slide-29
SLIDE 29

COMPILATION EXAMPLE

Compiled “pick-up”

30

slide-30
SLIDE 30

COMPILATION: EXPERIMENTAL RESULTS

 Using Probabilistic-FF planner (Domshlak &

Hoffmann, 2006)

Synthesizing Robust Plans under Incomplete Domain Models (NIPS 2013)

 Normally fails with large problem instances

31

Incomplete Logistics domain

slide-31
SLIDE 31

CONTENT

A measure for plan quality  Robustness of plan 𝑆 𝜌 ∈ [0,1] Plan robustness assessment  Reduced to weighted model counting  Complexity Synthesizing robust plans  Compilation approach  Heuristic search approach

32

slide-32
SLIDE 32

APPROXIMATE TRANSITION FUNCTION

33

 Not explicitly maintain set of resulting states  Successor state:

𝛿 𝑇𝐹 𝑏 , 𝑡 = 𝑡 ∪ 𝐵𝑒𝑒 𝑏 ∪ 𝐵𝑒𝑒 𝑏 ∖ 𝐸𝑓𝑚 𝑏 , 𝑗𝑔 𝑄𝑠𝑓 𝑏 ⊆ 𝑡

 Possible delete effects might not take effects!

 Recursive definition for 𝛿

𝑇𝐹(𝜌, 𝑡)

Completeness: Any solution in the complete STRIPS action model exists in the solution space of the problem with incomplete domain. Soundness: For any plan returned under incomplete STRIPS domain semantics, there is one complete STRIPS model under which the plan succeeds.

𝜹 𝝆, 𝒕 = 𝜹𝑬𝒋(𝝆, 𝒕)

𝑬𝒋∈≪𝑬 ≫

slide-33
SLIDE 33

ANYTIME APPROACH FOR GENERATING ROBUST PLANS

1.

Initialize: 𝜀 = 0

2.

Repeat

Find plan 𝜌 s.t. 𝑆 𝜌 > 𝜀 (Stochastic)

If plan found: 𝜀 = 𝑆(𝜌)

Until time bound reaches

3.

Return 𝜌 and 𝑆(𝜌) if plan found

34

slide-34
SLIDE 34

USE OF UPPER BOUND

 Reduce exact weighted model counting

35

𝐽 𝑡

𝜌

If (𝐻 ⊆ 𝑡) and 𝑉 𝜌 > 𝜀 then 𝑥𝑛𝑑(𝜌) 𝑽 𝝆 ≥ 𝒙𝒏𝒅(𝝆) (Upper bound for 𝑆(𝜌))

slide-35
SLIDE 35

USE OF LOWER BOUND

 How to…  Compute ℎ 𝑡, 𝜀 = |𝜌

|

37

𝐽 𝑡 𝑡G 𝜌𝑙 𝜌

Find: 𝝆 s.t. wmc 𝜌𝑙 ∘ 𝜌 > 𝜀 Avoid invoking 𝑥𝑛𝑑 ∘ during the construction of 𝜌 ! Find: 𝝆 s.t. L 𝜌𝑙 ∘ 𝜌 > 𝜀 𝐌 𝝆 ≤ 𝒙𝒏𝒅(𝝆)

(Lower bound for 𝑆(𝜌))

slide-36
SLIDE 36

LOWER BOUND FOR 𝑺(𝝆)

38

𝚻(𝝆)

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒒𝒃𝒋

𝒒𝒔𝒇 ⇒

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒒𝒃𝒏

𝒆𝒇𝒎 ⇒

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒒𝒃𝒋

𝒒𝒔𝒇 ⇒ (𝒒𝒃𝒏 𝒆𝒇𝒎 ⇒

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

)

slide-37
SLIDE 37

LOWER BOUND FOR 𝑺(𝝆)

39

𝚻(𝝆)

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

¬𝒒𝒃𝒋

𝒒𝒔𝒇 ∨

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

¬𝒒𝒃𝒏

𝒆𝒇𝒎 ∨

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

¬𝒒𝒃𝒋

𝒒𝒔𝒇 ∨ ¬𝒒𝒃𝒏 𝒆𝒇𝒎 ∨

𝒒𝒃𝒍

𝒃𝒆𝒆 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝒙 ¬𝒒 = 𝟐 − 𝒙(𝒒)

Σ(𝜌) as a set of clauses with positive literals.

slide-38
SLIDE 38

LOWER BOUND FOR 𝑺(𝝆)

40

Given positive clauses 𝑑, 𝑑′ : Pr 𝑑 𝑑′) ≥ Pr(𝑑) Given Σ 𝜌 = {𝑑1, 𝑑2, … , 𝑑𝑜} : 𝑥𝑛𝑑 Σ 𝜌 = Pr 𝑑1 ∧ 𝑑2 ∧ ⋯ ∧ 𝑑𝑜 = Pr 𝑑1 Pr 𝑑2 𝑑1 … Pr 𝑑𝑜 𝑑𝑜−1, … , 𝑑1 ≥ Pr(𝑑𝑗)

𝑑𝑗∈Σ(𝜌)

𝑀 𝜌 = Pr (𝑑𝑗)

𝑑𝑗∈Σ(𝜌)

≤ 𝑆(𝜌)

(Equality holds when all clauses are independent)

slide-39
SLIDE 39

UPPER BOUND FOR 𝑆(𝜌)

41

Σ 𝜌 = {𝑑1, … , 𝑑𝑜} An (trivial) upper bound:  𝑉 𝜌 ≥ min

𝑑𝑗∈Σ(𝜌) Pr

(𝑑𝑗)

A much tighter bound:

¬𝑞 𝑞 𝑏𝑗

𝑞𝑏𝑙

𝑏𝑒𝑒 𝐷𝑞

𝑗 ≤𝑙≤𝑗−1,𝑞∈𝐵𝑒𝑒

(𝑏𝑙)

𝑏𝑙 𝑞 𝐽 𝐻

  • Clauses contains variables

corresponding to one specific predicate

  • Thus, Σ(𝜌) is highly

decomposable into “connected components”

𝑉 𝜌 = min

𝑑∈Σ𝑘 Pr

(𝑑)

𝑘

{𝒚𝟐, 𝒚𝟑} {𝒚𝟑, 𝒚𝟒} {𝒚𝟓, 𝒚𝟔}

slide-40
SLIDE 40

FIND: 𝝆 S.T. 𝐌 𝝆𝒍 ∘ 𝝆 > 𝜺

 Build relaxed planning graph

 Ignoring known & possible delete effects

 Propagate clauses for propositions and actions  Extract relaxed plan

42

𝐽 𝑡 𝑡G 𝜌𝑙 𝜌

slide-41
SLIDE 41

RELAXED PLANNING GRAPH

PROPOSITIONAL LAYER 𝑴𝟐

43

𝑞𝑘 ¬𝑞𝑘

𝑱 𝑀1 = 𝑡𝑙+1 = 𝛿 𝑇𝐹(𝜌𝑙, 𝐽)

𝜌𝑙

𝑏1 𝑏𝑙

Establishment constraints (if needed) and protection constraints for 𝑞𝑘 at state 𝑡𝑙+1

𝚻𝒒𝒌(𝟐)

slide-42
SLIDE 42

RELAXED PLANNING GRAPH

ACTION LAYER 𝑩𝒖

44

𝑴𝒖 𝐵𝑢 = {𝑏 𝑏 ∈ 𝐵, 𝑄𝑠𝑓 𝑏 ⊆ 𝑀𝑢} ∪ {𝑜𝑝𝑝𝑞𝑞 𝑞 ∈ 𝑀𝑢} 𝑞𝑗 𝑞𝑘 𝑩𝒖 𝑏𝑛

𝚻𝒒𝒋(𝒖) 𝚻𝒒𝒌(𝒖)

𝑞𝑘 𝑏𝑛

𝑞𝑠𝑓

𝚻𝒃𝒏 𝒖 = 𝚻𝐪𝐣 𝐮 ∧ (𝒒𝒌 𝒃𝒏

𝒒𝒔𝒇 ⇒ 𝚻𝐪𝐤(𝐮))

slide-43
SLIDE 43

RELAXED PLANNING GRAPH

ACTION LAYER 𝑩𝒖

45

𝑴𝒖 𝐵𝑢 = {𝑏 𝑏 ∈ 𝐵, 𝑄𝑠𝑓 𝑏 ⊆ 𝑀𝑢} ∪ {𝑜𝑝𝑝𝑞𝑞 𝑞 ∈ 𝑀𝑢} 𝑞𝑗 𝑞𝑘 𝑩𝒖

𝚻𝒒𝒋(𝒖) 𝚻𝒒𝒌(𝒖)

𝒐𝒑𝒑𝒒𝒒𝒋

𝚻𝒒𝒋(𝒖)

𝒐𝒑𝒑𝒒𝒒𝒌

𝚻𝒒𝒌(𝒖)

slide-44
SLIDE 44

RELAXED PLANNING GRAPH

PROPOSITIONAL LAYER 𝑴𝒖+𝟐

46

𝑀𝑢+1 = {𝑞 |𝑏 ∈ 𝐵𝑢, 𝑞 ∈ 𝐵𝑒𝑒 𝑏 ∪ 𝐵𝑒𝑒 𝑏 } 𝑩𝒖 𝑏𝑛 𝑏𝑚 𝑞𝑗

𝚻𝒃𝒏(𝒖) 𝚻𝒃𝒎(𝒖)

𝑴𝒖+𝟐

Σ𝑞𝑗 𝑢 + 1 = 𝑏𝑠𝑕𝑛𝑏𝑦Σ 𝑚(Σ ∧ Σ𝑙) Σ ∈ {Σ𝑏𝑛 𝑢 , 𝑞𝑗 𝑏𝑚

𝑞𝑠𝑓 ⇒ Σ𝑏𝑚(𝑢)}

𝚻𝒒𝒏(𝒖 + 𝟐)

slide-45
SLIDE 45

RELAXED PLAN EXTRACTION

OVERVIEW

47

𝒉

𝑀𝑈 𝑀1 𝐵1 𝐵𝑈−1

𝑏5 𝒒𝟗 𝑞6

𝑀𝑈−1

𝑏1 𝑞4 𝑞2

Best supporting action for 𝑕 at layer T 𝚻𝒒𝒋 𝒖 + 𝟐 = 𝒃𝒔𝒉𝒏𝒃𝒚𝚻 𝒎(𝚻 ∧ 𝚻𝒍)

𝑏3

 𝜌

in total order

 Succeed when:  All know preconditions are supported  𝑚 Σ𝑙 ∧ Σ𝜌′ > 𝜀

𝑀2

slide-46
SLIDE 46

RELAXED PLAN EXTRACTION

WHEN TO INSERT ACTIONS?

 A supporting action 𝑏𝑐𝑓𝑡𝑢 is inserted only if

needed

 Depending on:  Relation between: subgoal and “relaxed plan state”  Robustness of the current 𝜌

and 𝜌 ∪ {𝑏𝑐𝑓𝑡𝑢}

48

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

slide-47
SLIDE 47

RELAXED PLAN EXTRACTION

SUBGOAL V.S RP STATE

49

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒 𝒒

slide-48
SLIDE 48

RELAXED PLAN EXTRACTION

 𝑞 ∈ 𝑄𝑠𝑓 𝑏 , 𝑞 ∉ 𝑡→𝑏: insert 𝑏𝑐𝑓𝑡𝑢 into 𝜌  This type of subgoal makes the relaxed plan

“incomplete”

50

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒 No actions in 𝜌𝑙 and 𝜌 supporting this subgoal

slide-49
SLIDE 49

RELAXED PLAN EXTRACTION

51

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒 𝒕→𝒃

+

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

𝑏

𝒕→𝒃 + 𝒒

 For these subgoals, supporting actions inserted if the

insertion increases the robustness of the current relaxed plan.

𝑚 Σ𝜌 ∧ Σ𝜌

∪ 𝑏𝑐𝑓𝑡𝑢

> 𝑚(Σ𝜌 ∧ Σ𝜌

)

slide-50
SLIDE 50

RELAXED PLAN EXTRACTION

52

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

𝑏

𝒕→𝒃 + 𝒕→𝒃

+

𝒒

 For these subgoals, no supporting actions needed!

slide-51
SLIDE 51

FIND 𝝆 S.T. 𝑺 𝝆 > 𝜺: SEARCH ALGORITHM

 Stochastic local search with failed bounded restarts (Coles et al.,

2007)

53

h s, 𝜀 = 100

𝜀 ∈ (0,1]

Depth bound

  • reached. Failed.

ℎ 𝑡′, 𝜀 = 55 𝑔𝑏𝑗𝑚𝑑𝑝𝑣𝑜𝑢 = 0 𝑞𝑠𝑝𝑐𝑓𝑑𝑝𝑣𝑜𝑢 = 1 𝑔𝑏𝑗𝑚𝑑𝑝𝑣𝑜𝑢 = 0 If 𝑔𝑏𝑗𝑚𝑑𝑝𝑣𝑜𝑢 = 𝑔𝑏𝑗𝑚𝑐𝑝𝑣𝑜𝑒 then double depth bound

ℎ 𝑡′′, 𝜀 = 0

Goal reached 𝑔𝑏𝑗𝑚𝑐𝑝𝑣𝑜𝑒 = 32, 64,128, … 𝒃𝟐𝟏 𝒃𝟐𝟑 𝒃𝟑𝟏 𝑞𝑠𝑝𝑐𝑓𝑑𝑝𝑣𝑜𝑢 = 2 If probecount = 𝑞𝑠𝑝𝑐𝑓𝑐𝑝𝑣𝑜𝑒 then increment 𝑔𝑏𝑗𝑚𝑑𝑝𝑣𝑜𝑢 𝑞𝑠𝑝𝑐𝑓𝑑𝑝𝑣𝑜𝑢 = 0 Better state found.

𝜀 = 𝑆(𝜌)

ℎ(𝑡, 𝜀): how far it is approximately from s to a goal state so that the resulting plan has approximate robustness > 𝜀.

slide-52
SLIDE 52

EXPERIMENTAL RESULTS

54

Number of instances for which PISA produces better, equal and worse robust plans compared to DeFault.

 Domains:

 Zenotravel, Freecell, Satellite, Rover (215 domains x 10 problems =

2150 instances)

 Parc Printer (300 instances)

slide-53
SLIDE 53

EXPERIMENTAL RESULTS

55

Total time in seconds (log scale) to generate plans with the same robustness by PISA and DeFault.

slide-54
SLIDE 54

DISSERTATION OVERVIEW

56

“Model-lite” Planning

Preference incompleteness Domain incompleteness

 Representation: two levels

  • f incompleteness

 User preferences exist, but

totally unknown

 Partially specified  Full set of plan

attributes

 Parameterized value

function, unknown trade-off values

 Representation  Actions with possible

preconditions / effects

 Optionally with weights

for being the real ones

 Solution concept: plan sets  Solving techniques:

synthesizing high quality plan sets

 Solution concept: “robust”

plans

 Solving techniques:

synthesizing robust plans

slide-55
SLIDE 55

DISSERTATION OVERVIEW

57

“Model-lite” Planning

Preference incompleteness Domain incompleteness

 Representation: two levels

  • f incompleteness

 User preferences exist, but

totally unknown

 Partially specified  Full set of plan

attributes

 Parameterized value

function, unknown trade-off values

 Representation  Actions with possible

preconditions / effects

 Optionally with weights

for being the real ones

 Solution concept: plan sets  Solving techniques:

synthesizing high quality plan sets

 Solution concept: “robust”

plans

 Solving techniques:

synthesizing robust plans

Publication

 Domain independent approaches

for finding diverse plans. IJCAI (2007)

 Planning with partial preference

  • models. IJCAI (2009)

 Generating diverse plans to

handle unknown and partially known user preferences. AIJ 190 (2012) (with Biplav Srivastava, Subbarao Kambhampati, Minh Do, Alfonso Gerevini and Ivan Serina)

Publication

 Assessing and Generating

Robust Plans with Partial Domain Models. ICAPS-WS (2010)

 Synthesizing Robust Plans under

Incomplete Domain Models. AAAI-WS (2011), NIPS (2013)

 A Heuristic Approach to

Planning with Incomplete STRIPS Action Models. ICAPS (2014) (with Subbarao Kambhampati, Minh Do)

slide-56
SLIDE 56

THANK YOU! Q & A

58