Proactive Dynamic DCOPs
Khoi Hoang
Ferdinando Fioretto Ping Hou Makoto Yokoo William Yeoh Roie Zivan
Proactive Dynamic DCOPs Khoi Hoang Ferdinando Fioretto Ping Hou - - PowerPoint PPT Presentation
Proactive Dynamic DCOPs Khoi Hoang Ferdinando Fioretto Ping Hou Makoto Yokoo William Yeoh Roie Zivan Proactive Dynamic DCOPs Khoi Hoang Ferdinando Fioretto Ping Hou Makoto Yokoo William Yeoh Roie Zivan Content Electric Vehicle
Ferdinando Fioretto Ping Hou Makoto Yokoo William Yeoh Roie Zivan
Ferdinando Fioretto Ping Hou Makoto Yokoo William Yeoh Roie Zivan
times
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2 Vehicle B Vehicle C Utility 1 1 7 … … … 23 23 Vehicle C Vehicle A Utility 2 1 10 … … … 23 23 5
1
X = {x }n
i i=
{ }
x x X
D D
∈
=
1
{ }m
i i
F f
=
=
1
{ }p
i i
A a
=
=
: { }
fi
i x x X
f D
+ ∈
× → ∪ ⊥ °
,
( ) ( )
f
f F x x
F f
σ
σ σ
∈ ⊆
=∑ arg max ( )
x
x F x
∗ =
, , , , A X D F α
α
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
where each followed DCOP changes based on stochastic events:
sequence
1 2
n
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
starting time
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
, , , , , , , , ,
Y
A X D F T h c p γ α
1
X = {x }n
i i= 1
{ }p
i i
A a
=
=
Y X ⊆
{ }
x x X
D D
∈
= { }
y y Y
D
∈
Ω = Ω ⊆
1
{ }m
i i
F f
=
=
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
, , , , , , , , ,
Y
A X D F T h c p γ α
c∈°
[ )
0,1 γ ∈
{ }
y y Y
T T
∈
=
: [0,1]
y y y
T Ω ×Ω → ⊆ °
y Y ∈
{ }
Y y y Y
p p
∈
=
α
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
, , , , , , , , ,
Y
A X D F T h c p γ α
Objective functions Sum of utility functions over first h time steps Switching cost Long-term utility at last time step Vehicle A Vehicle B Utility 1 3 … … … 23 23 2
, , , , , , , , ,
Y
A X D F T h c p γ α
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2 Sum of constraints with random variables Sum of constraints w/o random variables Probability of random variable taking a value
, , , , , , , , ,
Y
A X D F T h c p γ α
Vehicle A Vehicle B Utility 1 3 … … … 23 23 2 Long-term utility w/o random variables Long-term utility with random variables Long-term expected utility (Bellman equation)
Ø Collapse h+1 DCOPs into a single DCOP Ø Use any off-the-shelf exact DCOP algorithm
Ø Start with initial assignments: random or heuristics Ø Use local search approach Ø Reuse information and heuristic building pseudo-tree
t=0 x1 x2 Utility u11 1 u12 1 u13 1 1 u14 t=1 x1 x2 Utility u21 1 U22 1 u23 1 1 u24 t=2 x1 x2 Utility u31 1 u32 1 u33 1 1 u34 Collapsed table x1 x2 Aggregated utility 0,0,0 0,0,0 u11 + u21 + u31 0,0,0 0,0,1 u11 + u21 + u32 … … … 1,1,1 1,1,1 u14 + u24 + u34
DCOP algorithms
Collapsed table x1 x2 Aggregated utility 0,0,0 0,0,0 u11 + u21 + u31 0,0,0 0,0,1 u11 + u21 + u32 … … … 1,1,1 1,1,1 u14 + u24 + u34
Ø Changes in random variables’ values Ø Take advantage of information (initial probabilities, transition functions) Ø Decision variables incur switching costs
Magazine 33(3):53–65.
constraint reasoning. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 1466–1469.