New Perspectives for Multi-Armed Bandits and Their Applications
Vianney Perchet
Workshop Learning & Statistics IHES, January 19 2017 CMLA, ENS Paris-Saclay
New Perspectives for Multi-Armed Bandits and Their Applications - - PowerPoint PPT Presentation
New Perspectives for Multi-Armed Bandits and Their Applications Vianney Perchet Workshop Learning & Statistics IHES, January 19 2017 CMLA, ENS Paris-Saclay Motivations & Objectives Classical Examples of Bandits Problems Size of
Workshop Learning & Statistics IHES, January 19 2017 CMLA, ENS Paris-Saclay
3
3
3
3
3
4
4
4
4
4
4
4
4
4
4
4
4
4
5
t ∈ R (sub-)Gaussian,
1, Xi 2, . . . , ∼ N
1 , Xπ2 2 , . . . , Xπt−1 t−1
t=1 EXπt t = ∑T t=1 µπt
i∈{1,2} T
t=1
T
t=1
T
t=1
7
i
i t +
t=1 1{πt = i} and X i t = 1 Ti
t
s:is=i Xi s.
k log(T) ∆k
∆
8
⋆ t +
i t +
i
∆2
i ; each mistake costs ∆i.
i log(T) ∆2
i
i log(T) ∆i
i log(T) ∆i 9
k log(T∆k) ∆k
∆min
T ε ≤ T2/3 10
11
12
13
14
t ∈ [0, 1] ∼ νk(ωt), E[Xk|ω] = µk(ω)
t=1 µπ⋆(ωt)(ωt) − µπt(ωt) 15
k
k and
k
k
16
16
17
17
17
17
17
17
18
18
18
18
18
18
19
19
19
K log(K) T
2β+d , bin side
K log(K) T
1 2β+d .
20
21
21
K log(K) T
2β+d .
22
β 2β+d
23
24
25
26
27
28
29
30
31
5%T ?? 32
33