SLIDE 16 Introduction Data-Dependent Error Analysis Applications and Comparisons of Bounds Summary
Standard Bounds vs. Theorems 1 & 2 / Exploration Case
Markov chain: 200 states; k = 50; ξ: uniform
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 20 40 60 80 100 120 140 160 180 α=0.99, ||αP||=0.995, λ ∈ [0,1] standard I standard II
- Thm. 1, S=S1: e ¬⊥ S1
- Thm. 1, S=S2: e ⊥ S2
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 α=0.99, ||αP||=0.995, λ ∈ [0,1] standard II
- Thm. 2, S=S1: e ¬⊥ S1
- Thm. 1, S=S2: e ⊥ S2
- Thm. 2, S=S2: e ⊥ S2
Bound λ Bound λ
❄
❄
❄
STD II
❄
❄
❄
- Thm. 2/⊥
- In general, ΠA is not necessarily a contraction.
need to choose λ properly; TD(0) can always be safely applied.
- The first bound needs A, so do the standard bounds for the
contraction case.