SLIDE 1
Multi-armed bandits
S Bubeck, N Cesa-Bianchi Foundations and Trends in Machine Learning 2012
* Real title: regret analysis of stochastic and nonstochastic multi-armed bandit problems
Multi-armed bandits S Bubeck, N Cesa-Bianchi Foundations and Trends - - PowerPoint PPT Presentation
Multi-armed bandits S Bubeck, N Cesa-Bianchi Foundations and Trends in Machine Learning 2012 * Real title: regret analysis of stochastic and nonstochastic multi-armed bandit problems Overview Stochastic, adversarial, extensions &
S Bubeck, N Cesa-Bianchi Foundations and Trends in Machine Learning 2012
* Real title: regret analysis of stochastic and nonstochastic multi-armed bandit problems
π’ π
π’ π i.i.d. sampled from distribution of arm π
reward
π
π’ π΅π’
reward πβ Best action
reward
π π’ β [0,1]
π π’ very low.
reward ????