recommendations as treatments
play

Recommendations as Treatments: Debiasing Learning and Evaluation - PowerPoint PPT Presentation

Recommendations as Treatments: Debiasing Learning and Evaluation ICML 2016, NYC Schnabel , Adith Swaminathan , Tob obias Sc Ashudeep Singh , Navin Chandak , Thorsten Joachims Cornell University, Google Funded in part


  1. Recommendations as Treatments: Debiasing Learning and Evaluation ICML 2016, NYC Schnabel † , Adith Swaminathan † , Tob obias Sc Ashudeep Singh † , Navin Chandak § , Thorsten Joachims † † Cornell University, § Google Funded in part through NSF Awards IIS-1247637, IIS-1217686, IIS-1513692.

  2. Recommendations as Treatments: Debiasing Learning and Evaluation Movie recommendation Horror Romance Drama 5 5 1 3 5 1 3 5 5 Horror Lovers 5 5 1 3 5 5 5 3 O 5 5 1 3 3 Observed Y/N 1 1 5 5 3 3 1 5 3 5 5 Romance Lovers 5 5 5 5 3 Y 1 5 5 3 True Rating ⇒ Data is Missing Not At Random (MNAR) Example adapted from (Steck et al., 2010) 2

  3. Recommendations as Treatments: Debiasing Learning and Evaluation Selection Bias in Recommendation  Why is there selection bias? o User-induced bias (e.g., browsing) o System-induced bias (e.g., advertising)  Question: What happens if we ignore selection bias? (Marlin et al., 2007; Steck, 2011; Hernándandez-Lobato et al., 2014) 3

  4. Recommendations as Treatments: Debiasing Learning and Evaluation Evaluating Recommendations under Selection Bias Horror Romance Drama ෠ 𝑍 5 5 5 5 1 1 3 3 Recommend 5 1 3 5 5 5 5 Horror Lovers 5 5 1 3 5 5 1 3 5 5 5 5 5 5 3 3 O 5 5 5 5 1 1 3 3 3 3 Observed Y/N 1 1 5 5 3 3 1 1 5 5 3 3 1 5 3 5 5 5 5 Romance Lovers 5 5 3 3 5 5 5 3 5 5 5 3 Y 1 1 5 5 5 5 3 3 True Rating ⇒ Observed ratings are misleading due to selection bias 4

  5. Recommendations as Treatments: Debiasing Learning and Evaluation Evaluating Predicted Ratings under Selection Bias ෠ ෠ 𝑍 𝑍 1 2 Pred Ratings (worse) Pred Ratings (better) Horror Romance Drama Horror Romance Drama 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 1 3 5 1 3 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 Horror Horror Lovers Lovers 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 1 5 3 1 5 3 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 Romance Romance Lovers Lovers 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5

  6. Recommendations as Treatments: Debiasing Learning and Evaluation Evaluating Predicted Ratings under Selection Bias ෠ ෠ 𝑍 𝑍 1 2 Pred Ratings (worse) Pred Ratings (better) Horror Romance Drama Horror Romance Drama 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 1 3 5 1 3 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 Horror Lovers Horror Lovers 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 1 5 3 1 5 3 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 Romance Romance Lovers Lovers 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 6

  7. Recommendations as Treatments: Debiasing Learning and Evaluation Evaluating Predicted Ratings under Selection Bias ෠ ෠ 𝑍 𝑍 1 2 Pred Ratings (worse) Pred Ratings (better) Horror Romance Drama Horror Romance Drama 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 1 3 5 1 3 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 Horror Horror Lovers Lovers 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 5 5 5 5 5 1 1 1 1 5 5 5 5 5 1 1 1 1 1 5 5 5 5 5 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 5 3 1 5 3 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 Romance 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 Romance Lovers Lovers 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 3 3 3 3 5 3 3 3 3 3 1 1 1 1 1 5 5 5 5 1 1 1 1 1 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 ⇒ Observed losses are misleading due to selection bias 7

  8. Recommendations as Treatments: Debiasing Learning and Evaluation Recommendations as Treatments  Question : How can we fix the effects of selection bias? o Connection to potential outcomes framework Counterfactual Outcomes 𝑍 Observed Outcomes ෨ 𝑍 treatments movies treatments 5 5 1 3 5 5 5 1 3 5 5 1 3 5 5 5 3 patients users 5 5 1 3 3 patients 1 1 5 5 3 3 5 5 1 5 3 5 5 5 5 3 1 5 5 3 ⇒ Understand assignment mechanism (Imbens & Ruben, 2015) 8

  9. Recommendations as Treatments: Debiasing Learning and Evaluation Debiasing Evaluation Propensities P Horror Romance Drama  Assignment mechansim for recommendation: o 𝑄 𝑣,𝑗 = 𝑄 𝑃 𝑣,𝑗 = 1 p p/10 p/2  Use Inverse-Propensity-Scoring Estimator (IPS) to obtain unbiased estimate: p/10 p p/2 1 1 2 𝑆 𝐽𝑄𝑇 ෠ ෠ 𝑣,𝑗 − ෠ 𝑍|𝑄 = ෍ 𝑍 𝑍 𝑣,𝑗 𝑉 ⋅ 𝐽 𝑄 𝑣,𝑗 𝑣,𝑗 :𝑃 𝑣𝑗 =1 (Little & Rubin, 2002; Cortes et al., 2008; Bickel et al., 2009; Sugiyama & Kawanabe, 2012). 9

  10. Recommendations as Treatments: Debiasing Learning and Evaluation Propensity estimation  Two settings: o Experimental ̶- Propensities are under our control; known by design (e.g., ad placement) o Observational ̶- Users self-select; need to Observations O estimate 𝑄 𝑣,𝑗 Horror Romance Drama 1 0 1 0 0 1 0 0 0 0 0 0 1 0 0  Estimate parameter of binary random 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 variables : 0 1 0 1 0 0 0 0 1 0 1 0 0 0 1 1 0 0 0 1 1 0 0 1 0 0 1 0 0 1 𝑄 𝑣,𝑗 = 𝑄 𝑃 𝑣,𝑗 = 1 | 𝑌, ෨ 𝑍 0 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 1 1 0 0 0 0  Variety of models: Logistic Regression, 1 0 0 0 0 0 1 1 0 0 0 0 0 1 0 Naïve Bayes, etc. 10

  11. Recommendations as Treatments: Debiasing Learning and Evaluation Debiasing Evaluation  Robustness to selection bias: Severity of Selection Bias Severity of Selection Bias 11

  12. Recommendations as Treatments: Debiasing Learning and Evaluation Debiasing Evaluation  Robustness to inaccurate propensities: IPS-est More accurate propensities More accurate propensities 12

  13. Recommendations as Treatments: Debiasing Learning and Evaluation Debiasing Learning  Empirical Risk Minimization (ERM) successful in many settings (Cortes & Vapnik, 1995)  Use ERM together with Inverse-Propensity-Scoring Estimator (IPS) 𝑍 𝐹𝑆𝑁 = argmin ෠ 𝑆 𝐽𝑄𝑇 ෠ ෠ 𝑍 | 𝑄 ෠ 𝑍∈ℋ  For matrix factorization with MSE loss: 1 2 + 𝜇 2 + 𝑋 𝐺 𝑍 𝐹𝑆𝑁 = argmin ෠ 2 ෍ 𝑍 𝑣,𝑗 − 𝑊 𝑣 𝑋 𝑊 𝐺 𝑗 𝑄 𝑣,𝑗 𝑊,𝑋 𝑃 𝑣,𝑗 =1 propensity weight 13

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend