An Algorithm better than AO*?
Blai Bonet Universidad Sim´
- n Bol´
ıvar Caracas, Venezuela H´ ector Geffner ICREA and Universitat Pompeu Fabra Barcelona, Spain 7/2005
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 1
An Algorithm better than AO*? Blai Bonet Universidad Sim on Bol - - PowerPoint PPT Presentation
An Algorithm better than AO*? Blai Bonet Universidad Sim on Bol var Caracas, Venezuela H ector Geffner ICREA and Universitat Pompeu Fabra Barcelona, Spain 7/2005 An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 1
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 1
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 2
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 3
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 4
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 5
s′∈F (a,s) V (s′)
s′∈F (a,s) Pa(s′|s)V (s′)
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 6
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 7
s V ∗(s)−V (s) iterations (provided
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 8
ldfs-driver(s0) begin repeat solved := ldfs(s0) until solved return (V, π) end ldfs(s) begin if s is solved or terminal then if s is terminal then V (s) := cT (s) Mark s as solved return true flag := false foreach a ∈ A(s) do if QV (a, s) > V (s) then continue flag := true foreach s′ ∈ F (a, s) do flag := ldfs(s′) & [QV (a, s) ≤ V (s)] if ¬flag then break if flag then break if flag then π(s) := a Mark s as solved else V (s) := mina∈A(s) QV (a, s) return flag end An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 9
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 10
problem |S| V ∗ Nvi |A| |F | |π∗| coins-10 43 3 2 172 3 9 coins-60 1,018 5 2 315K 3 12 mts-5 625 17 14 4 4 156 mts-35 1, 5M 573 322 4 4 220K mts-40 2, 5M 684 – 4 4 304K diag-60-10 29,738 6 8 10 2 119 diag-60-28 > 15M 6 – 28 2 119 rules-5000 5,000 156 158 50 50 4,917 rules-20000 20,000 592 594 50 50 19,889
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 11
0.001 0.01 0.1 1 10 100 1000 10 20 30 40 50 60 70 time in seconds number of coins coins / h = 0 LDFS / B-LDFS VI AO* / LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 1e-05 0.0001 0.001 0.01 0.1 1 10 100 1000 10 20 30 40 50 60 70 number of coins coins / h = h1(#vi/2) LDFS / B-LDFS VI AO* LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 0.001 0.01 0.1 1 10 100 1000 10 20 30 40 50 60 70 number of coins coins / h = h2(#vi/2) LDFS / B-LDFS VI AO* / LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 0.0001 0.001 0.01 0.1 1 10 100 1000 5 10 15 20 25 30 35 40 45 time in seconds size of maze mts / h = 0 CFC VI LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO*/CFC Min-Max LRTA* 1e-05 0.0001 0.001 0.01 0.1 1 10 100 1000 5 10 15 20 25 30 35 40 45 size of maze mts / h = h1(#vi/2) CFC VI LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO*/CFC Min-Max LRTA* 1e-05 0.0001 0.001 0.01 0.1 1 10 100 1000 5 10 15 20 25 30 35 40 45 size of maze mts / h = h2(#vi/2) CFC VI LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO*/CFC Min-Max LRTA* 1 10 100 5000 10000 15000 20000 25000 time in seconds number of atoms rules systems / max rules = 50, max body = 50 / h = zero AO* VI / LDFS / B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 1 10 100 5000 10000 15000 20000 25000 number of atoms rules systems / max rules = 50, max body = 50 / h = h1(#vi/2) AO* VI LDFS / B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 1 10 100 5000 10000 15000 20000 25000 number of atoms rules systems / max rules = 50, max body = 50 / h = h2(#vi/2) AO* VI LDFS / B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA*
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 12
0.001 0.01 0.1 1 10 100 10 20 30 40 50 60 70 time in seconds number of states diagnosis / #tests = 10 / h = 0 VI AO* LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 1e-05 0.0001 0.001 0.01 0.1 1 10 20 30 40 50 60 70 number of states diagnosis / #tests = 10 / h = h1(#vi/2) VI AO* LRTA* LDFS / B-LDFS Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 0.0001 0.001 0.01 0.1 1 10 10 20 30 40 50 60 70 number of states diagnosis / #tests = 10 / h = h2(#vi/2) VI AO* LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 0.01 0.1 1 10 100 1000 10 15 20 25 30 time in seconds number of tests diagnosis / #states = 60 / h = 0 VI AO* LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 0.0001 0.001 0.01 0.1 1 10 100 1000 10 15 20 25 30 number of tests diagnosis / #states = 60 / h = h1(#vi/2) VI AO* LRTA* LDFS / B-LDFS Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA* 0.001 0.01 0.1 1 10 100 1000 10 15 20 25 30 number of tests diagnosis / #states = 60 / h = h2(#vi/2) VI AO* LDFS B-LDFS LRTA* Value Iteration LDFS Bounded LDFS AO* Min-Max LRTA*
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 13
An Algorithm Better than AO*? B. Bonet and H. Geffner; 7/05 14