ER-DCOPS: A FRAMEWORK FOR DCOP WITH UNCERTAINTY IN CONSTRAINT UTILITIES
Tiep Le, Ferdinando Fioretto, William Yeoh, Tran Cao Son, Enrico Pontelli Computer Science Department New Mexico State University
1
ER-DCOPS: A FRAMEWORK FOR DCOP WITH UNCERTAINTY IN CONSTRAINT - - PowerPoint PPT Presentation
1 ER-DCOPS: A FRAMEWORK FOR DCOP WITH UNCERTAINTY IN CONSTRAINT UTILITIES Tiep Le, Ferdinando Fioretto, William Yeoh, Tran Cao Son, Enrico Pontelli Computer Science Department New Mexico State University 2 OUTLINE BACKGROUND &
1
2
3
4
x3 x1 x2
5
x3 x1 x2
6
x3 x1 x2
7
x3 x1 x2
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
8
x3 x1 x2
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
Worker x1 owns variable x1 Worker x2 owns variable x2 Assistant robot x3 owns variable x3
9
x3 x1 x2
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
Worker x1 owns variable x1 Worker x2 owns variable x2 Assistant robot x3 owns variable x3
10
x2 x3 U23 (Fail) (Success) 40 1 (Fail) (Success) 50
11
x2 x3 U23 (Fail) (Success) 40 1 (Fail) (Success) 50
Good 50% 50% 50% 50% Bad 90% 10% 90% 10%
12
x2 x3 U23 (Fail) (Success) 40 1 (Fail) (Success) 50
Good 50% 50% 20% 80% Bad 90% 10% 50% 50%
13
x2 x3 U23 (Fail) (Success) 40 1 (Fail) (Success) 50
Good 50% 50% 20% 80% Bad 90% 10% 50% 50%
14
15
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
16
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x2 x3 U23 (Fail) (Success) 40 1 (Fail) (Success) 50
17
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50
18
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% Bad 90% 10% 50% 50%
19
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% Bad 90% 10% 50% 50% x1 x3 r1 U13 0 0 (Fail) 1 (Success) 50 1 0 (Fail) 1 (Success) 30 Good 10% 90% 30% 70% Bad 30% 70% 50% 50%
20
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% x1 x3 r1 U13 0 0 (Fail) 1 (Success) 50 1 0 (Fail) 1 (Success) 30 Good 10% 90% 30% 70%
21
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% x1 x3 r1 U13 0 0 (Fail) 1 (Success) 50 1 0 (Fail) 1 (Success) 30 Good 10% 90% 30% 70%
EU 45 21 EU 20 40
22
x1 x2 x3 EU 39 1 40
x1 x2 x3 EU 65 1 41
Optimal assignment if bad weather (EU = 40) Optimal assignment if good weather (EU = 65)
23
Regret 40-39=1 40-40=0 Regret 65-65=0 65-61=4
Assignment x1 = x2 = x3 = 0 has regret of 1 if bad weather regret of 0 if good weather x1 x2 x3 EU 39 1 40 x1 x2 x3 EU 65 1 41
24
Regret 40-39=1 40-40=0 Regret 65-65=0 65-61=4 x1 x2 x3 ER 0 12%*0 + 88%*1 = 0.88 1 12%*4 + 88%*0 = 0.48 12% 88%
x1 x2 x3 EU 39 1 40 x1 x2 x3 EU 65 1 41
25
x1 x3 EU 35 1 15
x2 x3 EU 4 1 25 x1 x3 EU 45 1 21
x2 x3 EU 20 1 40 Regret 40-39=1 40-40=0 Regret 65-65=0 65-61=4 x1 x2 x3 ER 0 12%*0 + 88%*1 = 0.88 1 12%*4 + 88%*0 = 0.48 12% 88%
The solution minimizes the expected-regret
26
27
x3 x1 x2
28
x3 x1 x2
x1 x3 r1 U13 0 0 (Fail) 1 (Success) 50 1 0 (Fail) 1 (Success) 30 Good 10% 90% 30% 70% Bad 30% 70% 50% 50% x3 1 EU(Good) 45 21 EU(Bad) 35 15 EU = Expected Utility
29
x3 x1 x2
x2 x3 r2 U23 0 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% Bad 90% 10% 50% 50% x3 1 EU(Good) 20 40 EU(Bad) 4 25 EU = Expected Utility
30
x3 x1 x2
x3 1 EU(Good) 45+20=65 21+40=61 EU(Bad) 35+4=39 15+25=40 EU = Expected Utility
31
x1 x3 Expected-Regret 0 12%*(45-45) + 88%*(15-35) = -17.6 1 12%*(45-21) + 88%*(15-15) = 2.88 x1 x3 1 EU(Good) 45 21 EU(Bad) 35 15 x2 x3 1 EU(Good) 20 40 EU(Bad) 4 25 x2 x3 Expected-Regret 0 12%*(20-20) + 88%*(25-4) = 18.48 1 12%*(20-40) + 88%*(25-25) = 2.4
32
33
34
35
|X| ASP-ER-DPOP GPU-ER-DPOP FRODO-ER 8 3.1 0.1 0.3 13 9.4 0.2 61.1 18 44.1 N/A N/A 23 120.8 N/A N/A
runtime in second N/A: not available Random Graphs
36
|D| ASP-ER-DPOP GPU-ER-DPOP FRODO-ER 4 4.5 0.1 1.8 6 8.9 0.1 33.6 8 22.2 1.2 143.2 10 80.4 4.8 N/A 12 121.2 15.4 N/A
Power Network Problems
37
102 103 104 105 106 3 5 7 9 11 13 15 Simulated Runtime (ms) Domain Size |A| = 13, |X| = 74, |F| = 51 ASP-ER Frodo-ER 102 103 104 105 106 3 5 7 9 11 13 15 Simulated Runtime (ms) Domain Size |A| = 37, |X| = 218, |F| = 147 ASP-ER Frodo-ER 102 103 104 105 106 3 5 7 9 11 13 15 Simulated Runtime (ms) Domain Size |A| = 124, |X| = 748, |F| = 497 ASP-ER Frodo-ER
decision variables;
38
Belief Space’s Size Better Worse Equal 5 45% 20% 35% 10 36% 28% 36% 15 47% 20% 33%
Compare Actual Regret ER-DCOP solution vs UR-DCOP solution
39
utilities.
DCOPs.
rules to prune the search space.
actual regret (belief space exhibits normal distribution).
40
41
42
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
43
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x3 Umax 50 1 30
44
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x3 Umax 50 1 30 x3 Umax 40 1 50
45
x1 x3 U13 1 1 1 1 1 1 2 Dx1 = Dx2 = [0,1]. U(X1,X2) = X1 + X2 domain_x1(0..1). domain_x2(0..1). utility1_2(U,X1,X2) ß domain_x1(X1), domain_x2(X2), U = X1 + X2. Implicit Representation
46
x1 x2 U12 1
1
1 1
Dx1 = Dx2 = [0,1]. The message U(X1,X) = 0 if X1 = X2 = 0; otherwise, -∞ domain_x1(0..1). domain_x2(0..1). utility1_2(0,X1,X2) ß domain_x1(X1), domain_x2(X2), X1 = 0, X2= 0.
47
48
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% Good: 12% Bad: 88% x1 x3 r1 U13 0 0 (Fail) 1 (Success) 50 1 0 (Fail) 1 (Success) 30 Good 10% 90% 30% 70%
util1_3(5,0,0). (facts or rules)
x1 x3 x2
49
table_max_a3(16,0,0) table_max_a3(25,0,1) table_max_a3(25,1,0) table_max_a3(40,1,1) table_info(a3,a2,x2,0,1) table_info(a3,a1,x1,0,1)
solution(a1,x1,1)
table_max_a2(33,0) table_max_a2(45,1) table_info(a2,a1,x1,0,1)
From agent a3
table_row_a2(V0+V1,X1) ← x1_cons_x2(V0,X1,X2), table_max_a3(V1,X1,X2). table_max_a2(U,X1) ← U = #max[ table_row_a2(V,X1)=V ]
To agent a1
0 { row(U,X2) } ← table_max_a2(U,X1), solution(a1,x1,X1), x1_cons_x2(V0,X1,X2), table_max_a3(V1,X1,X2), U == V0+V1 ← not 1 {row(U,X2) } 1 solution(a2,x2,X2) ← row(U,X2)
From agent a1
solution(a2,x2,0) solution(a1,x1,1)
To agent a3
Ia2 Ma3 I’a2
x1 x3 x2
Agent Controller in Agent 2
50
51
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80%
52
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% Bad 90% 10% 50% 50%
53
x2 x3 r2 U23 0 (Fail) 1 (Success) 40 1 0 (Fail) 1 (Success) 50 Good 50% 50% 20% 80% x1 x3 r1 U13 0 0 (Fail) 1 (Success) 50 1 0 (Fail) 1 (Success) 30 Good 10% 90% 30% 70%
EU 45 21 EU 20 40
54
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
55
56
x3 x1 x2
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
57
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
1: (A. Petcu et al. IJCAI 2005)
58
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x3 Umax 50 1 30
59
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x3 Umax 40 1 50 x3 Umax 50 1 30
60
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x3 Umax 40 1 50 x3 Umax 50 1 30 x3 Umax 50+40=90 1 30+50=80
61
x1 x3 U13 50 1 30 x2 x3 U23 40 1 50
x3 x1 x2
x3 Umax 40 1 50 x3 Umax 50 1 30 x3 Umax 90 1 80