On representing planning domains under uncertainty Felipe Meneguzzi - PowerPoint PPT Presentation

On representing planning domains under uncertainty Felipe Meneguzzi – CMU Yuqing Tang – CUNY Simon Parsons – CUNY Katia Sycara - CMU BR OOKLYN COLLE GE

Outline • Planning – Markov Decision Processes – Hierarchical Task Networks • States and State-Space • Using HTNs to represent MDPs • Increasing Efficiency • Future Work • Conclusions 2

Planning • Planning algorithms more or less divided into: – Deterministic – Probabilistic • Formalisms differ significantly – Domain representation – Concept of solution • Plan • Policy 3

Blogohar Scenario (Burnett) N Map - East Blogohar Region Party B - Military Force W E S Haram Region (Missile Range) Tersa NR2 Missile Covert mission Site (Day 2) NR1 Haram Force Cost 120 10 40 5 130 20 300 70 Base B HW 40 (50) SR2 Cost Force Cost Force 30 10 90 10 10 5 30 5 Aria Region 40 20 100 20 A C 500 70 30 (38) Rina Region 10 (12) SR1 B1 Legend Escort Cost per one-way trip B2 Minefield Major road Force Required Cost of each vehicle R2 Surina 30 Bridge + 10 + 10 = 20 50 Town 4

Blogohar Scenario • Original scenario consists of two players planning for concurrent goals – NGO – Military • Here, we consider a (simplified) planning task for the military planner – Select forces to attack militant strongholds – Move forces to strongholds and attacking 5

Hierarchical Task Networks • Offshoot of classical planning • Domain representation more intuitive to human planners – Actions (state modification operators) – Tasks (goals and subgoals) – Methods (recipes for refining tasks) • Problem comprises – Initial State – Task 6

HTN Domain – Actions a a ( V , T ) • attack(Vehicle, Target) • move(Vehicle, From, To, Road) a mv ( V , F , T , R ) 7

HTN Methods • Defeat Insurgents at Stronghold A t DI ( T ) – Precondition: Target = A – Task to decompose: defeatInsurgents(A) – Tasks replacing defeatInsurgents(A): • attackWithHumvee(A) • attackWithAPC(A) 8

HTN Methods t AHu ( T ) • Attack T with Humvee – Precondition: vehicle ( humvee , V ) ∧ ¬ committed ( V ) – Task to decompose: attackWithHumvee(T) – Tasks replacing attackWithHumvee(T) : • move(V,T) • attack(V,T) – this is an action 9

HTN Methods t AA ( T ) • Attack T with APC – Precondition: vehicle ( apc , V ) ∧ ¬ committed ( V ) – Task to decompose: attackWithAPC(T) – Tasks replacing attackWithAPC(T) : • move(V,T) • attack(V,T) – this is an action 10

HTN Methods t Mv ( V , T ) • Move (Route 1) – Precondition: Target = A – Task to decompose: move(V,T) – Tasks replacing move(V,T) : • move(V,base,tersa,nr1) –These are basic moves • move(V,tersa,haram,nr2) • move(V,haram,a,sr2) 11

HTN Methods t Mv ( V , T ) • Move (Route 2) – Precondition: Target = A – Task to decompose: move(V,T) – Tasks replacing move(V,T) : • move(V,base,haram,sr1) –These are basic moves • move(V,haram,a,sr2) 12

Methods Summary } , t AHu ( T ) ! t AA ( T ) • Defeat Insurgents ( ) m DI ( T ) = T = a , t DI ( T ) , t AHu ( T ) , t AA ( T ) { { } • Attack with # & vehicle ( humvee , V ) " ¬ committed ( V ), m AHu ( T ) = % ( } , t Mv ( V , T ) ! t a ( V , T ) Humvee % ( { { } t AHu ( T ) , t Mv ( V , T ) , t a ( V , T ) $ ' # & vehicle ( apc , V ) " ¬ committed ( V ), • Attack with m AA ( T ) = % ( } , t Mv ( V , T ) ! t a ( V , T ) % ( { { } t AA ( T ) , t Mv ( V , T ) , t a ( V , T ) $ ' APC " % T = a , t Mv ( V , T ) , $ ' • Move Mv ( V , T ) = $ ' { } , t mv ( V , base , tersa , nr 1) , t mv ( V , tersa , haram , nr 2) , t mv ( V , haram , a , sr 2) m 1 $ ' (Route 1) t mv ( V , base , tersa , nr 1) ! t mv ( V , tersa , haram , nr 2) ! t mv ( V , haram , a , sr 2) $ ' { } # & • Move ⎛ ⎞ T = a , t Mv ( V , T ) , ⎜ ⎟ Mv ( V , T ) = ⎜ ⎟ { } , (Route 2) t mv ( V , base , haram , hw ) , t mv ( V , tersa , a , sr 2) m 2 ⎜ ⎟ t mv ( V , base , haram , hw )  t mv ( V , tersa , a , sr 2) ⎜ ⎟ { } 13 ⎝ ⎠

HTN Problem • How to execute task defeatInsurgents(a) t DI ( T ) – Decompose task through the methods in the domain until actions reached – Ordered actions is the solution 14

Decomposed Problem tDI(a) tAHu(a) tAA(a) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T) aa(V,T) aa(V,T) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) 15

HTN Solution tDI(a) tAHu(a) tAA(a) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T) aa(V,T) aa(V,T) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) 16

Markov Decision Processes • Mathematical model for decision-making in a partially controllable environment • Domain is represented as a tuple Σ = ( S , A , P ) where: – S is the entire state space – A is the set of available actions – P is a state transition function 17

MDP Domain • Represented as a hypergraph • Connections are not necessarily structured • All reachable states are represented • State transition function specifies how actions relate states 18

Computing an MDP policy • An MDP policy is computed using the notion of expected value of a state: ⎡ ⎤ ∑ V * ( s ) = max a ( s '| s ) V * ( s ') a ∈ A ( s ) u ( a , s ) + Pr ⎢ ⎥ ⎣ ⎦ s ' ∈ S • Expected value comes from a reward function • An optimal policy is a policy that maximizes the expected value of every state ⎡ ⎤ ∑ π * ( s ) = argmax a ∈ A ( s ) u ( a , s ) + a ( s '| s ) V * ( s ') Pr ⎢ ⎥ ⎣ ⎦ s ' ∈ S 19

MDP Solution • Solution for an MDP is a policy • Policy associates an optimal action to every state • Instead of a sequential plan, policy provides contingencies for every state state0  actionB state1  actionD state2  actionA 20

States Hierarchical Task Network Markov Decision Process • Not enumerated • MDP domain explicitly exhaustively enumerates all relevant states • State consists of properties of the • Formally speaking, MDP environment states are monolithic entities vehicle ( humvee , h 1) ∧ vehicle ( apc , a 2) • Implicitly represent the • Each action modifies same properties properties of the expressed in HTN state environment • Large state-spaces make • Set of properties induces the algorithm flounder a very large state space 21

State Space Size Hierarchical Task Network Markov Decision Process • Set of actions induces a • MDP solver must consult smaller state space (still the entire state space quite large) • State-space reduction • Set of methods induces a techniques include: smaller still state space – Factorization – ϵ -homogeneous • HTN planning consults aggregation this latter state space 22

HTNs to represent MDPs • We propose using HTNs to represent MDPs • Advantages are twofold: – HTNs are more intuitive to SMEs – Resulting MDP state-space can be reduced using HTN methods as heuristic 23

Fully Expanded HTN tDI(a) tAHu(a) tAA(a) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T) aa(V,T) aa(V,T) tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) 24

Reachable states tDI(a) tAHu(a) tAA(a) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T) aa(V,T) aa(V,T) tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) 25

Conversion in a nutshell • State-space comes from the reachable primitive actions induced my HTN methods • Probabilities are uniformly distributed over a planner’s choice • Reward function can be computed using the target states at the end of a plan (Simari’s approach) 26

Reachable States s0 aa(V,T) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) aa(V,T) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) 27

Conversion example 28

Increasing Efficiency • State aggregation using Binary Decision Diagrams (BDDs): – BDDs are a compact way of representing multiple logic properties – One BDD can represent multiple (factored) states 29

Limitations and Future Work • Limitations – Current conversion models only uncertainty from the human planner – Probabilities uniformly distributed among choices • Future Work – Evaluate quality of compression through ϵ - homogeneity – Compute probabilities from the world 30

Conclusions • Planning in coalitions is important • Automated tools for planning need to have a representation amenable to SME • Our technique offers advantages over either one of the the single approaches: – Representation using HTNs for SMEs – Underlying stochastic model for military planning using MDPs 31

QUESTIONS? 32

On representing planning domains under uncertainty Felipe Meneguzzi - PowerPoint PPT Presentation

On representing planning domains under uncertainty Felipe Meneguzzi CMU Yuqing Tang CUNY Simon Parsons CUNY Katia Sycara - CMU BR OOKLYN COLLE GE Outline Planning Markov Decision Processes Hierarchical Task Networks

Uncertainty AIMA Chapter 13 Outline Uncertainty Uncertainty Probability Syntax and

Decision Making Under Uncertainty Making Decisions Under Uncertainty AI C LASS 10 (C H .

Decision Making Privacy-Motivated . . . under Uncertainty: Uncertainty Leads to . . .

UNCERTAINTY IN KNOWLEDGE Ch. 9 Uncertainty in Knowledge 1 Sources of Uncertainty

Bi-Continuous Domains and Some Old Problems in Domain Theory Talk at Domains IX Klaus Keimel

Partial Orders for Reconstructing Open . . . Representing Uncertainty, Extending Allens . . .

Intro to Artificial Intelligence CS 171 Reasoning Under Uncertainty Chapter 13 and 14.1-14.2

FE design optimization under uncertainty as a Constrained optimization problem under uncertainty

Reasoning under Uncertainty Reasoning under Uncertainty Course: CS40022 Course: CS40022

Decision making under uncertainty Course overview Christos Dimitrakakis October 29, 2013 . .

Reasoning under Uncertainty: Reasoning under Uncertainty: Issues and Other Approaches Issues and

Can you trust your models uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

7 Modelling Uncertainty Bayes theorem 7 Modelling Uncertainty Bayes theorem

Uncertainty and its Representa/on @kordinglab Uncertainty ma7ers

CPSC 875 CPSC 875 John D McGregor John D. McGregor C10 Error Design Uncertainty Uncertainty

CS325 Artificial Intelligence Ch. 17 Planning Under Uncertainty Cengiz Gnay, Emory Univ.

compounds in the second language Serkan Uygun & Aye Grel Yeditepe University &

Notes on NSD mee+ngs h.ps://conferences.lbl.gov/category/73/ Complete schedule un+l June 2019

9:'8(41+; 34.'()#%5+01#/4&$1'6+7%(41814('()#% "#$%&'()#%*+#,+-#.)/+01#.1'22)%.

26th International Cartographic Conference, Dresden, Germany, 25 30 August 2013 Exploring the

Presentations by the funders and providers, 13-17 May 2019 What Creative NZ do Funding

Science of Computational Logic Working Material 1 Steffen H olldobler International

INSPIRE CHALLENGE ACHIEVE Year 7 Parent Information Evening 6.00pm 7.00pm AGENDA Welcome

Lecture 4: Regularization II Princeton University COS 495 Instructor: Yingyu Liang Review

On representing planning domains under uncertainty Felipe Meneguzzi - PowerPoint PPT Presentation

On representing planning domains under uncertainty Felipe Meneguzzi CMU Yuqing Tang CUNY Simon Parsons CUNY Katia Sycara - CMU BR OOKLYN COLLE GE Outline Planning Markov Decision Processes Hierarchical Task Networks

Uncertainty AIMA Chapter 13 Outline Uncertainty Uncertainty Probability Syntax and

Decision Making Under Uncertainty Making Decisions Under Uncertainty AI C LASS 10 (C H .

Decision Making Privacy-Motivated . . . under Uncertainty: Uncertainty Leads to . . .

UNCERTAINTY IN KNOWLEDGE Ch. 9 Uncertainty in Knowledge 1 Sources of Uncertainty

Bi-Continuous Domains and Some Old Problems in Domain Theory Talk at Domains IX Klaus Keimel

Partial Orders for Reconstructing Open . . . Representing Uncertainty, Extending Allens . . .

Intro to Artificial Intelligence CS 171 Reasoning Under Uncertainty Chapter 13 and 14.1-14.2

FE design optimization under uncertainty as a Constrained optimization problem under uncertainty

Reasoning under Uncertainty Reasoning under Uncertainty Course: CS40022 Course: CS40022

Decision making under uncertainty Course overview Christos Dimitrakakis October 29, 2013 . .

Reasoning under Uncertainty: Reasoning under Uncertainty: Issues and Other Approaches Issues and

Can you trust your models uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

7 Modelling Uncertainty Bayes theorem 7 Modelling Uncertainty Bayes theorem

Uncertainty and its Representa/on @kordinglab Uncertainty ma7ers

CPSC 875 CPSC 875 John D McGregor John D. McGregor C10 Error Design Uncertainty Uncertainty

CS325 Artificial Intelligence Ch. 17 Planning Under Uncertainty Cengiz Gnay, Emory Univ.

compounds in the second language Serkan Uygun &amp; Aye Grel Yeditepe University &amp;

Notes on NSD mee+ngs h.ps://conferences.lbl.gov/category/73/ Complete schedule un+l June 2019

9:'8(41+; 34.'()#%5+01#/4&amp;$1'6+7%(41814('()#% &quot;#$%&amp;'()#%*+#,+-#.)/+01#.1'22)%.

26th International Cartographic Conference, Dresden, Germany, 25 30 August 2013 Exploring the

Presentations by the funders and providers, 13-17 May 2019 What Creative NZ do Funding

Science of Computational Logic Working Material 1 Steffen H olldobler International

INSPIRE CHALLENGE ACHIEVE Year 7 Parent Information Evening 6.00pm 7.00pm AGENDA Welcome

Lecture 4: Regularization II Princeton University COS 495 Instructor: Yingyu Liang Review

compounds in the second language Serkan Uygun & Aye Grel Yeditepe University &

9:'8(41+; 34.'()#%5+01#/4&$1'6+7%(41814('()#% "#$%&'()#%*+#,+-#.)/+01#.1'22)%.