Internship Defense
David Taralla
University of Liège
Internship Defense David Taralla University of Lige Thursday 19 - - PowerPoint PPT Presentation
Internship Defense David Taralla University of Lige Thursday 19 December 2013 Contents Introduction Context Basic idea From the idea to the theoretical implementation Conclusion Internship Defense David Taralla University of Lige
University of Liège
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
For example:
3 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
4 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
5 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
i.e. the algorithm performing the best on average 6 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
One cannot pull half an arm!
Existing methods not really adapted to big cardinality with finite budget
Length up to 5 → #AlgoSpace = 3155: this method is not easily scalable 7 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
⇒ Perform some kind of information transfer from a (set of) arm(s) to another ⇒ This internship was about this problem 8 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get maximal information → Reduce required samples amount!
9 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Create sampling plan Add resulting data to memory Get a regressor using RLS on data gathered so far Get best arm a∗ using predictions Are we confident enough for a∗? Return a∗ Prune arm space Get lower & upper confidence bounds No Yes 10 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Create sampling plan
(Erratum — Report says we maximize J(γ). That is incorrect, we minimize J(γ)).
11 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far
φa, ˆ θ + η 12 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far
φa, ˆ θ + η
12 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far
φa, ˆ θ + η
12 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far
φa, ˆ θ + η
ra = φa, ˆ θ 12 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far
φa, ˆ θ + η
ra = φa, ˆ θ
12 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far — Kernels —
13 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far — Kernels —
13 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far — Kernels —
13 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far — Regularization parameter λ —
⇒ Minimize e(λ) = 1 n
n
(fD−i ,λ(ai ) − ri )2
1. Get ˆ α — O(n3) (1 matrix inversion) 2. Do it for n different datasets — O(n) ⇒ If M evaluations of e(λ), total complexity of O(Mn4)!
⇒ If M evaluations of e(λ), achievable total complexity of O(n3 + Mn2) 14 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get a regressor using RLS on data gathered so far — Regularization parameter λ —
Mean error when predicting the mean reward of an algorithm
2 4 6 8 10 12 14 16 18 20 22 24 1.00E-06 1.00E-05 0.0001 0.001 0.01 0.1 1 10 100 1000 10000 Mean errors using GCV Lambda
15 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Get lower & upper confidence bounds
16 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Prune arm space
17 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
18 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
Create sampling plan Add resulting data to memory Get a regressor using RLS on data gathered so far Get best arm a∗ using predictions Are we confident enough for a∗? Return a∗ Prune arm space Get lower & upper confidence bounds No Yes 19 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
20 / 21
Internship Defense David Taralla University of Liège 1st Master in Engineering Sciences
21 / 21