SLIDE 10 POMDP application
- UAI paper persented on Friday:
Marc Toussaint, Laurent Charlin, Pascal Poupart: Hierarchical POMDP Controller Optimization by Likelihood Maximization
N′1 N2 N1 N0 O S S′ O′ A N′0 E0 E1 N′2 N′2 N2 N1 N0 O S A S′ O′ N′0 N′1 N2 N1 N0 S S′ N′0 N′1 N′2
N2N1N0SS′ N2N1N0S′ N2N′2N1N0S′ N′2N1N0S′ N′2N1N′1N0S′ N′2N′1N0S′ N′2N′1N0N′0S′
Problem |S|, |A|, |O| V ∗ HSVI2 Best results from [1] ML approach (avg. over 10 runs) V nodes t(s) V nodes t(s) V paint 4, 4, 2 3.28 3.29±0.04 (1,3) <1 3.29 (5,3) 0.96±0.3 3.26±0.004 shuttle 8, 3, 5 32.7 32.9±0.8 (1,3) 2 31.87 (5,3) 2.81±0.2 31.6±0.5 4x4 maze 16, 4, 2 3.7 3.75±0.1 (1,2) 30 3.73 (3,3) 2.8±0.8 3.72±8e−5 chain-of-chains 10, 4, 1 157.1 157.1±0 (3,3) 10 0.0 (10,3) 6.4±0.2 151.6±2.6 handwashing 84, 7, 12 1052 N/A N/A (10,5) 655±2 984±1 cheese-taxi 33, 7, 10 5.3 2.53±0.3 N/A (10,3) 311±14 −9±11(2.25∗)