On representing planning domains under uncertainty
Felipe Meneguzzi – CMU Yuqing Tang – CUNY Simon Parsons – CUNY Katia Sycara - CMU
BR OOKLYN COLLE GE
On representing planning domains under uncertainty Felipe Meneguzzi - - PowerPoint PPT Presentation
On representing planning domains under uncertainty Felipe Meneguzzi CMU Yuqing Tang CUNY Simon Parsons CUNY Katia Sycara - CMU BR OOKLYN COLLE GE Outline Planning Markov Decision Processes Hierarchical Task Networks
BR OOKLYN COLLE GE
2
3
4
NR1
HW
SR2
Map - East Blogohar Region
B1 Surina
Aria Region Rina Region Haram Region
R2 Major road Bridge Town
Legend W S N E
Haram
B2 Tersa
NR2
SR1
Party B - Military Force
A B C 10 (12) 40 (50) 30 (38)
Base
Minefield
(Missile Range)
Missile Site
20 40
Cost
Force
10 5 10 30 10 + 10 = 20 30 +
Escort Cost per one-way trip Covert mission (Day 2)
70 300
Cost
Force
10 20 5 40 120 130 70 500
Cost
Force
10 20 5 30 90 100
Cost of each vehicle Force Required
50
5
6
7
8
9
10
11
12
13
mDI(T ) = T = a,t DI (T ), t AHu(T ),t AA(T )
mAHu(T ) = vehicle(humvee,V)"¬committed(V), t AHu(T ), t Mv(V ,T ),t a(V ,T )
# $ % % & ' ( ( mAA(T ) = vehicle(apc,V)"¬committed(V), t AA(T ), t Mv(V ,T ),t a(V ,T )
# $ % % & ' ( ( m1
Mv(V ,T ) =
T = a,t Mv(V ,T ), t mv(V ,base,tersa,nr1),t mv(V ,tersa,haram,nr2),t mv(V ,haram,a,sr2)
t mv(V ,base,tersa,nr1) ! t mv(V ,tersa,haram,nr2) ! t mv(V ,haram,a,sr2)
" # $ $ $ $ % & ' ' ' ' m2
Mv(V ,T ) =
T = a,t Mv(V ,T ), t mv(V ,base,haram,hw),t mv(V ,tersa,a,sr2)
t mv(V ,base,haram,hw) t mv(V ,tersa,a,sr2)
⎛ ⎝ ⎜ ⎜ ⎜ ⎜ ⎞ ⎠ ⎟ ⎟ ⎟ ⎟
14
15
tDI(a) tAHu(a) tAA(a)
tmv(V,base,haram,hw) tmv(V,haram,a,sr2)
aa(V,T) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T)
tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R)
aa(V,T)
amv(V,F,T,R) amv(V,F,T,R)
16
tDI(a) tAHu(a) tAA(a)
tmv(V,base,haram,hw) tmv(V,haram,a,sr2)
aa(V,T) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T)
tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R)
aa(V,T)
amv(V,F,T,R) amv(V,F,T,R)
17
18
19
π*(s) = argmaxa∈A(s) u(a,s) + Pr
a (s'| s)V *(s') s'∈S
⎡ ⎣ ⎢ ⎤ ⎦ ⎥ V *(s) = max
a∈A(s) u(a,s) +
Pr
a (s'| s)V *(s') s'∈S
⎡ ⎣ ⎢ ⎤ ⎦ ⎥
20
21
vehicle(humvee,h1)∧vehicle(apc,a2)
– Factorization – ϵ-homogeneous aggregation
22
23
24
tDI(a) tAHu(a) tAA(a)
tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2)
aa(V,T) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T)
tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R)
aa(V,T)
amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R)
tDI(a) tAHu(a) tAA(a)
tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2)
aa(V,T) tMv(V,T) ta(V,T) tMv(V,T) ta(V,T)
tmv(V,base,tersa,nr1) tmv(V,tersa,haram,nr2) tmv(V,haram,a,sr2) tmv(V,base,haram,hw) tmv(V,haram,a,sr2) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R)
aa(V,T)
amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R)
25
26
27
aa(V,T)
amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R)
aa(V,T)
amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R) amv(V,F,T,R)
s0
28
29
30
31
32