Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Branislav Kveton, Google Research Csaba Szepesvári, DeepMind and University of Alberta Sharan Vaswani, Mila, University of Montreal Zheng Wen, Adobe Research Mohammad Ghavamzadeh, Facebook AI Research Tor Lattimore, DeepMind