support vector machines
play

SUPPORT VECTOR MACHINES FOR DIFFERENTIAL PREDICTION Finn Kuusisto 1 - PowerPoint PPT Presentation

SUPPORT VECTOR MACHINES FOR DIFFERENTIAL PREDICTION Finn Kuusisto 1 , Vitor Santos Costa 2 , Houssam Nassif 3 , Elizabeth Burnside 1 , David Page 1 , and Jude Shavlik 1 1 University of Wisconsin Madison 2 University of Porto 3 Amazon 9/18/2014


  1. SUPPORT VECTOR MACHINES FOR DIFFERENTIAL PREDICTION Finn Kuusisto 1 , Vitor Santos Costa 2 , Houssam Nassif 3 , Elizabeth Burnside 1 , David Page 1 , and Jude Shavlik 1 1 University of Wisconsin – Madison 2 University of Porto 3 Amazon 9/18/2014 Support Vector Machines for Differential Prediction

  2. DIFFERENTIAL PREDICTION Goal Use modeling techniques to gain insight about the differences between two subgroups of a population. 9/18/2014 Support Vector Machines for Differential Prediction

  3. UPLIFT MODELING ( R A D C L I F F E & S I M P S O N , 2 0 0 8 ) How do we choose which customers to target with some marketing activity? 9/18/2014 Support Vector Machines for Differential Prediction

  4. UPLIFT MODELING ( R A D C L I F F E & S I M P S O N , 2 0 0 8 ) How do we choose which customers to target with some marketing activity? Persuadables Customers who respond positively to marketing activity. Sure Things Customers who respond positively regardless. Lost Causes Customers who respond negatively regardless. Sleeping Dogs Customers who respond negatively to marketing activity. 9/18/2014 Support Vector Machines for Differential Prediction

  5. UPLIFT MODELING ( R A D C L I F F E & S I M P S O N , 2 0 0 8 ) True customer groups are unknown. Target Control Response No Response Response No Response Persuadables, Sleeping Dogs, Sleeping Dogs, Persuadables, Sure Things Lost Causes Sure Things Lost Causes 9/18/2014 Support Vector Machines for Differential Prediction

  6. UPLIFT MODELING Lift The number of true positives that a classifier achieves at a given proportion of the population labeled positive. Uplift The difference in lift produced by a classifier between target and control subgroups. 𝑩𝑽𝑽 = 𝑩𝑽𝑴 𝑼 βˆ’ 𝑩𝑽𝑴 𝑫 9/18/2014 Support Vector Machines for Differential Prediction

  7. TASK: ADVERSE COX-2 INHIBITOR EFFECTS ο‚‘ Non-steroidal anti-inflammatory drug (NSAID) ο‚‘ Significantly reduced occurrence of adverse gastrointestinal effects common to other NSAIDs (e.g. ibuprofen) ο‚‘ Rapid and widespread acceptance for treatment of ailments such as arthritis ο‚‘ Later clinical trials showed increased risk of myocardial infarction (MI), or β€œheart attack” Identify patients who are susceptible to an increased risk of MI as a direct result of taking COX-2 inhibitors. 9/18/2014 Support Vector Machines for Differential Prediction

  8. UPLIFT MODELING TO MEDICINE: COX-2 INHIBITORS Want Identify patients who demonstrate an increased risk of MI as a direct result of being treated with COX-2 inhibitors. Main Assumption Patients with an increased risk of MI due to treatment with COX-2 inhibitors are directly analogous to customers with an increased chance of buying due to targeting – the persuadables. 9/18/2014 Support Vector Machines for Differential Prediction

  9. METHODS ο‚‘ Compared SVM Upl against 4 alternate SVM methods ο‚‘ 10-fold cross-validation for evaluation ο‚‘ Cost parameters selected from 10 through 10 βˆ’6 ο‚‘ Mann-Whitney test at 95% confidence for per-fold AUU comparison 9/18/2014 Support Vector Machines for Differential Prediction

  10. RESULTS: COX-2 INHIBITORS COX-2 No COX-2 SVM Upl Model AUU AUL AUL 𝒒 -value π‘‡π‘Šπ‘ π‘‰π‘žπ‘š 50.7 123.4 72.7 - 0.004 * Two-Cost 20.0 126.2 106.3 0.002 * COX-2-Only 13.8 151.5 137.7 0.002 * Standard 1.2 147.7 146.5 0.037 * Flipped 28.5 102.2 73.6 0.002 * Baseline 0.0 0.0 0.0 9/18/2014 Support Vector Machines for Differential Prediction

  11. RESULTS: COX-2 INHIBITORS 9/18/2014 Support Vector Machines for Differential Prediction

  12. HOW Extend previous SVM work maximizing AUC (Joachims, 2005) to maximize AUU instead. 9/18/2014 Support Vector Machines for Differential Prediction

  13. SVM FOR UPLIFT Let the positive skew of data be: 𝑄 𝜌 = 𝑄 + 𝑂 Then ( Tuffery, 2011) : 𝐡𝑉𝑀 = 𝑄 Γ— 𝜌 2 + 1 βˆ’ 𝜌 𝐡𝑉𝐷 9/18/2014 Support Vector Machines for Differential Prediction

  14. SVM FOR UPLIFT 𝐡𝑉𝑉 = 𝐡𝑉𝑀 π‘ˆ βˆ’ 𝐡𝑉𝑀 𝐷 = 𝑄 π‘ˆ Γ— 𝜌 π‘ˆ 2 + 1 βˆ’ 𝜌 π‘ˆ 𝐡𝑉𝐷 π‘ˆ βˆ’ 𝑄 𝐷 Γ— 𝜌 𝐷 2 + 1 βˆ’ 𝜌 𝐷 𝐡𝑉𝐷 𝐷 𝑛𝑏𝑦 𝐡𝑉𝑉 ≑ 𝑛𝑏𝑦 𝑄 π‘ˆ Γ— 1 βˆ’ 𝜌 π‘ˆ 𝐡𝑉𝐷 π‘ˆ βˆ’ 𝑄 𝐷 Γ— 1 βˆ’ 𝜌 𝐷 𝐡𝑉𝐷 𝐷 ∝ 𝑛𝑏𝑦 𝐡𝑉𝐷 π‘ˆ βˆ’ 𝑄 𝐷 Γ— 1 βˆ’ 𝜌 𝐷 𝐡𝑉𝐷 𝐷 𝑄 π‘ˆ Γ— 1 βˆ’ 𝜌 π‘ˆ πœ‡ 𝑛𝑏𝑦 𝐡𝑉𝑉 ≑ 𝑛𝑏𝑦 𝐡𝑉𝐷 π‘ˆ βˆ’ πœ‡π΅π‘‰π· 𝐷 9/18/2014 Support Vector Machines for Differential Prediction

  15. TASK: IN SITU BREAST CANCER ο‚‘ Most common cancer in women ο‚‘ Two basic stages: In situ and invasive  In situ cancer cells are localized  Invasive cancer cells have infiltrated surrounding tissue ο‚‘ Younger women tend to have more aggressive in situ cancer ο‚‘ Older women sometimes have indolent in situ cancer Identify older patients with indolent in situ breast cancer. 9/18/2014 Support Vector Machines for Differential Prediction

  16. UPLIFT MODELING TO MEDICINE: BREAST CANCER Want Identify older patients with in situ breast cancer that is distinct from that of younger patients. Main Assumption Older patients with in situ breast cancer that is distinct from that of younger patients, who tend to have aggressive cancer, have a decreased risk of invasive progression. 9/18/2014 Support Vector Machines for Differential Prediction

  17. RESULTS: BREAST CANCER Older Younger SVM Upl Model AUU AUL AUL 𝒒 -value π‘‡π‘Šπ‘ π‘‰π‘žπ‘š 19.2 64.3 45.1 - Two-Cost 13.5 74.3 60.8 0.432 0.037 * Older-Only 5.9 67.7 61.9 0.049 * Standard 11.0 75.4 64.3 0.020 * Flipped 4.8 53.9 49.1 0.004 * Baseline 11.0 66.0 55.0 9/18/2014 Support Vector Machines for Differential Prediction

  18. RESULTS: BREAST CANCER 9/18/2014 Support Vector Machines for Differential Prediction

  19. UPLIFT MODELING SIMULATION ο‚‘ Generated synthetic customer population ο‚‘ Subjected customer population randomly to simulated marketing activity ο‚‘ Measured uplift as usual ο‚‘ Measured ROC with Persuadables as the positive class, others as negative 9/18/2014 Support Vector Machines for Differential Prediction

  20. UPLIFT MODELING SIMULATION: UPLIFT CURVE 9/18/2014 Support Vector Machines for Differential Prediction

  21. UPLIFT MODELING SIMULATION: PERSUADABLE ROC 9/18/2014 Support Vector Machines for Differential Prediction

  22. CONCLUSIONS & FUTURE WORK ο‚‘ Extended previous SVM work on AUC maximization to AUU ο‚‘ Results suggest SVM Upl achieves better uplift than many alternate SVM methods ο‚‘ May want to make performance guarantees for control group ο‚‘ May want to interpret learned model ο‚‘ Better verification that maximizing uplift is appropriate goal 9/18/2014 Support Vector Machines for Differential Prediction

  23. THANKS Questions? 9/18/2014 Support Vector Machines for Differential Prediction

  24. SELECTED REFERENCES Radcliffe, N. and Simpson. R.: Identifying who can be saved and who will be driven away by retention activity. Journal of Telecommunications Management (2008). Tuffery, S.: Data Mining and Statistics for Decision Making. John Wiley & Sons, 2 nd edn. (2011). Joachims, T.: A support vector method for multivariate performance measuers. In: Proceedings of the 22 nd International Conference on Machine Learning (2005). 9/18/2014 Support Vector Machines for Differential Prediction

  25. APPENDIX 9/18/2014 Support Vector Machines for Differential Prediction

  26. SVM FOR ROC ( J O A C H I M S , 2 0 0 5 ) β€² as a predicted label on pairs of positive ( i ) and negative ( j ) examples, Define 𝑧 π‘—π‘˜ β€² = 1 if 𝑧 𝑗 β€² > 𝑧 π‘˜ β€² , and -1 otherwise. where 𝑧 π‘—π‘˜ βˆ— that maximizes: Joachims ’ algorithm to maximize AUC corresponds to finding 𝑧 π‘—π‘˜ 𝐱 π‘ˆ Ξ¨ 𝑦 , 𝑧 β€² , 𝑧 β€² + Ξ” 𝐡𝑉𝐷 𝑧 Where: 𝑄 𝑂 𝑄 𝑂 = 1 β€² = 𝐱 π‘ˆ 1 β€² 𝐲 𝑗 βˆ’ 𝐲 π‘˜ β€² β€² , 𝑧 𝐱 π‘ˆ Ξ¨ 𝐲 , 𝑧 Ξ” 𝐡𝑉𝐷 𝑧 2 1 βˆ’ 𝑧 π‘—π‘˜ 2 𝑧 π‘—π‘˜ 𝑗=1 π‘˜=1 𝑗=1 π‘˜=1 9/18/2014 Support Vector Machines for Differential Prediction

  27. SVM FOR UPLIFT Let the positive skew of data be: 𝑄 𝜌 = 𝑄 + 𝑂 Then: 𝐡𝑉𝑀 = 𝑄 𝜌 2 + 1 βˆ’ 𝜌 𝐡𝑉𝐷 9/18/2014 Support Vector Machines for Differential Prediction

  28. SVM FOR UPLIFT 𝜌 𝐡 𝜌 𝐢 𝐡𝑉𝑉 = 𝑄 2 + 1 βˆ’ 𝜌 𝐡 𝐡𝑉𝐷 𝐡 βˆ’ 𝑄 𝐢 2 + 1 βˆ’ 𝜌 𝐢 𝐡𝑉𝐷 𝐢 𝐡 𝑛𝑏𝑦 𝐡𝑉𝑉 ≑ 𝑛𝑏𝑦 𝑄 𝐡 1 βˆ’ 𝜌 𝐡 𝐡𝑉𝐷 𝐡 βˆ’ 𝑄 𝐢 1 βˆ’ 𝜌 𝐢 𝐡𝑉𝐷 𝐢 ∝ 𝑛𝑏𝑦 𝐡𝑉𝐷 𝐡 βˆ’ 𝑄 𝐢 1 βˆ’ 𝜌 𝐢 𝐡𝑉𝐷 𝐢 𝑄 𝐡 1 βˆ’ 𝜌 𝐡 Then: Defining: πœ‡ = 𝑄 𝐢 1 βˆ’ 𝜌 𝐢 𝑛𝑏𝑦 𝐡𝑉𝑉 ≑ 𝑛𝑏𝑦 𝐡𝑉𝐷 𝐡 βˆ’ πœ‡π΅π‘‰π· 𝐢 𝑄 𝐡 1 βˆ’ 𝜌 𝐡 9/18/2014 Support Vector Machines for Differential Prediction

  29. SVM FOR UPLIFT Now simply redefine the AUC optimization: 𝑄 𝐡 𝑂 𝐡 𝑄 𝐢 𝑂 𝐢 β€² = 𝐱 π‘ˆ 1 + πœ‡π± π‘ˆ 1 β€² 𝐲 𝑗 βˆ’ 𝐲 π‘˜ β€² 𝐱 π‘ˆ Ξ¨ 𝐲 , 𝑧 2 𝑧 π‘—π‘˜ 2 𝑧 π‘šπ‘™ 𝐲 π‘š βˆ’ 𝐲 𝑙 𝑗=1 π‘˜=1 𝑙=1 π‘š=1 𝑄 𝐡 𝑂 𝐡 𝑄 𝐢 𝑂 𝐢 = 1 + πœ‡ 1 β€² , 𝑧 β€² β€² Ξ” 𝐡𝑉𝑉 𝑧 2 1 βˆ’ 𝑧 π‘—π‘˜ 2 1 βˆ’ 𝑧 π‘šπ‘™ 𝑗=1 π‘˜=1 𝑙=1 π‘š=1 9/18/2014 Support Vector Machines for Differential Prediction

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend