SLIDE 35 Motivation Value Function Approximation Related Work Summary Function Approximation Methods FVI Model-selection Main Results
Basins of Convergence of the Max Bellman Error
Plotting pi → ||BEˆ
V(T )||∞, for each pi ∈ {c1, c2, c3, τ} (for j = i,
pj are held fixed at default values)
2 4 6 8 10 12 14 16 18 20 55 60 65 70 75 80 85 90 95 100
LWR Bandwidth (τ) Parameter Max Bellman Error
2 4 6 8 10 12 14 16 18 20 500 1000 1500 2000 2500 3000 3500 4000
LWR T
in Scaling Parameter
Max Bellman Error
2 4 6 8 10 12 14 16 18 20 50 100 150 200 250 300 350 400 450
LWR T
Max Bellman Error
2 4 6 8 10 12 14 16 18 20 1 2 3 4 5 6 7 x 10
4
LWR Time Scaling Parameter Max Bellman Error
Daniel Urieli, Peter Stone Model-Selection for Non-Parametric Function Approximation