A Preference-Based Bandit Framework for Personalized Recommendation
Maryam Tavakol and Ulf Brefeld
Paderborn, Nov 8, 2016
A Preference-Based Bandit Framework for Personalized Recommendation - - PowerPoint PPT Presentation
A Preference-Based Bandit Framework for Personalized Recommendation Maryam Tavakol and Ulf Brefeld Paderborn, Nov 8, 2016 Introduction Personalized Recommendation Preference Learning Multi-armed bandits 2 Recommendation 3 Recommendation
Maryam Tavakol and Ulf Brefeld
Paderborn, Nov 8, 2016
2
Personalized Recommendation
Preference Learning Multi-armed bandits
3
4
Item i ≻ Item k:
{Shirt-Polo shirt, Blue-White, Women-Women, Cheap-Expensive}
5
6
User 1 User 2 … User m User 1 + User 2 + … + User m
E[rt,ik|ut = uj] = β>
t zik + θ>zik
confidence interval (General case of LinUCB)
7
8
9
α θ βj
max
α
− 1 2C α>α + r>α
1 2α>[ZZ> + 1 µ( X
j
φj ⌦ φ>
j ) ZZ>]α
10
t zik + θ>zik
c q z>
ik(Z>Z + λI)1zik
11
12
Questions?
Thanks for your attention
Email: tavakol@leuphana.de