Following the Flattened Leader
Wojciech Kot lowski1 Peter Gr¨ unwald1 Steven de Rooij2
1National Research Institute for Mathematics and Computer Science (CWI)
The Netherlands
2University of Cambridge
COLT 2010
1 / 14
Following the Flattened Leader lowski 1 unwald 1 Steven de Rooij 2 - - PowerPoint PPT Presentation
Following the Flattened Leader lowski 1 unwald 1 Steven de Rooij 2 Wojciech Kot Peter Gr 1 National Research Institute for Mathematics and Computer Science (CWI) The Netherlands 2 University of Cambridge COLT 2010 1 / 14 Outline 1
1National Research Institute for Mathematics and Computer Science (CWI)
2University of Cambridge
1 / 14
2 / 14
2 / 14
2 / 14
2 / 14
3 / 14
3 / 14
3 / 14
3 / 14
3 / 14
3 / 14
4 / 14
4 / 14
µi(·)
4 / 14
µi(·) ⇐
4 / 14
µi(·) ⇐
µ◦
i (·), ˆ
i = #1+1 n+2 (Laplace’s rule of succesion).
i : 1 2, 2 3, 1 2, 3 5, 1 2, 4 7, 5 8, 5 9, 3 5, 7 11, 7 12.
4 / 14
µi(·) ⇐
µ◦
i (·), ˆ
i = #1+1 n+2 (Laplace’s rule of succesion).
i : 1 2, 2 3, 1 2, 3 5, 1 2, 4 7, 5 8, 5 9, 3 5, 7 11, 7 12.
4 / 14
5 / 14
5 / 14
5 / 14
2 log n + O(1) (asympt. optimal).
5 / 14
2 log n + O(1) (asympt. optimal).
5 / 14
2 log n + O(1) (asympt. optimal).
n:
n = n0x0 + n i=1 xi
5 / 14
2 log n + O(1) (asympt. optimal).
n:
n = n0x0 + n i=1 xi
2 log n + O(1), worst case: c ≫ 1.
5 / 14
6 / 14
7 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes 2 4 6 8 10 12 14 2 4 6 8 10 n regret [nats] ML Bayes
7 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes 2 4 6 8 10 12 14 2 4 6 8 10 n regret [nats] ML Bayes
7 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes 2 4 6 8 10 12 14 2 4 6 8 10 n regret [nats] ML Bayes
7 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes 2 4 6 8 10 12 14 2 4 6 8 10 n regret [nats] ML Bayes
7 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes 2 4 6 8 10 12 14 2 4 6 8 10 n regret [nats] ML Bayes
7 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes 2 4 6 8 10 12 14 2 4 6 8 10 n regret [nats] ML Bayes
7 / 14
8 / 14
µ◦
n(xn+1)n + n0 + 1
2(xn+1 − ˆ
n)T I(ˆ
n)(xn+1 − ˆ
n)
2
9 / 14
µ◦
n(xn+1)n + n0 + 1
2(xn+1 − ˆ
n)T I(ˆ
n)(xn+1 − ˆ
n)
2
n
µ X
9 / 14
10 / 14
11 / 14
12 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes −2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
Flattened Bayes
12 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes −2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
Flattened Bayes
12 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes −2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
Flattened Bayes
12 / 14
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
ML Bayes −2 −1 1 2 0.0 0.1 0.2 0.3 0.4 x p(
Flattened Bayes
12 / 14
13 / 14
14 / 14