An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting
Cem Kalkanlı, Ayfer ¨ Ozg¨ ur
Stanford University
ISIT, June 2020
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 1 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian - - PowerPoint PPT Presentation
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting Cem Kalkanl, Ayfer Ozg ur Stanford University ISIT, June 2020 An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 1
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 1 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 2 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 3 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 4 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 5 / 13
1
2
t u
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 6 / 13
1
2
3
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 7 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 8 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 9 / 13
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 10 / 13
1 Use of the earlier proposition:
2 uT
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 11 / 13
1 Use the lemma:
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 12 / 13
2 Overall bound on the Bayesian regret:
3 Show that T
t Kt−1ut
An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 13 / 13