Fast approximate planning in POMDPs
Geoff Gordon
ggordon@cs.cmu.edu
Joelle Pineau, Geoff Gordon, Sebastian Thrun. Point-based value iteration: an anytime algorithm for POMDPs
Fast approximate planningin POMDPs – p.1/37
Fast approximate planning in POMDPs Geoff Gordon - - PowerPoint PPT Presentation
Fast approximate planning in POMDPs Geoff Gordon ggordon@cs.cmu.edu Joelle Pineau, Geoff Gordon, Sebastian Thrun. Point-based value iteration: an anytime algorithm for POMDPs Fast approximate planningin POMDPs p.1/37 Overview POMDPs are
Geoff Gordon
ggordon@cs.cmu.edu
Joelle Pineau, Geoff Gordon, Sebastian Thrun. Point-based value iteration: an anytime algorithm for POMDPs
Fast approximate planningin POMDPs – p.1/37
Fast approximate planningin POMDPs – p.2/37
Fast approximate planningin POMDPs – p.3/37
Fast approximate planningin POMDPs – p.4/37
Fast approximate planningin POMDPs – p.5/37
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1
Fast approximate planningin POMDPs – p.6/37
Fast approximate planningin POMDPs – p.7/37
Fast approximate planningin POMDPs – p.8/37
2
Fast approximate planningin POMDPs – p.9/37
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 3
Fast approximate planningin POMDPs – p.10/37
a
Fast approximate planningin POMDPs – p.11/37
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 3
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.8 1.9 2 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8
Fast approximate planningin POMDPs – p.12/37
v∈V ((bTa) × wz) · v
v∈V b · Ta(wz × v)
Fast approximate planningin POMDPs – p.13/37
Fast approximate planningin POMDPs – p.14/37
above representation due to [Cassandra et al]
Fast approximate planningin POMDPs – p.15/37
Fast approximate planningin POMDPs – p.16/37
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 3
Fast approximate planningin POMDPs – p.17/37
Fast approximate planningin POMDPs – p.18/37
Fast approximate planningin POMDPs – p.19/37
Fast approximate planningin POMDPs – p.20/37
Fast approximate planningin POMDPs – p.21/37
dbV (b)
Fast approximate planningin POMDPs – p.22/37
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.8 1.9 2 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8
Fast approximate planningin POMDPs – p.23/37
v1∈Vaz1
v2∈Vaz2
Fast approximate planningin POMDPs – p.24/37
Fast approximate planningin POMDPs – p.25/37
Fast approximate planningin POMDPs – p.26/37
Fast approximate planningin POMDPs – p.27/37
b′∈∆ min b∈B b − b′1
Fast approximate planningin POMDPs – p.28/37
Fast approximate planningin POMDPs – p.29/37
Fast approximate planningin POMDPs – p.30/37
Fast approximate planningin POMDPs – p.31/37
Fast approximate planningin POMDPs – p.32/37
Fast approximate planningin POMDPs – p.33/37
Fast approximate planningin POMDPs – p.34/37
Fast approximate planningin POMDPs – p.35/37
Fast approximate planningin POMDPs – p.36/37
Fast approximate planningin POMDPs – p.37/37