SLIDE 1
Overview
- Bandit problem
- Contextual bandits
- Epoch-Greedy algorithm
The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits - - PowerPoint PPT Presentation
The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits Authors: John Langford, Tom Zhang Presented by: Ben Flora Overview Bandit problem Contextual bandits Epoch-Greedy algorithm Overview Bandit problem Contextual
Exploration (unbiased input) Black Box: Transforms Input to hypotheses Hypotheses (best arm)
Context
T n steps Exploration T-n Steps Exploitation
T Regret n T T Regret n T T Regret n T
t
ε ε