predicting customer purchase to improve bank marketing
play

Predicting Customer Purchase to Improve Bank Marketing - PowerPoint PPT Presentation

Predicting Customer Purchase to Improve Bank Marketing Effectiveness Group 6 Sandy Wu, Andy Hsu, Wei-Zhu Chen, Samantha Chien Business Goal Problem Business Goal Stakeholders Re-calling wrong customers Improve marketing


  1. Predicting Customer Purchase to Improve Bank Marketing Effectiveness Group 6 Sandy Wu, Andy Hsu, Wei-Zhu Chen, Samantha Chien

  2. Business Goal Problem Business Goal Stakeholders ● Re-calling “wrong” customers Improve marketing effectiveness Bank Marketing Team ● by targeting the right customers Bank Service Employees ● High labor costs ● Customers ● Harming customer relationship Opportunities Challenges ● ● Gain revenues and lower costs by having Using credit-scores ● more efficient marketing results Worsen the poor and rich disparity ● Very harmful for the mispredicted ones

  3. Data Mining Goal Data Mining Goal Predict whether a certain customer will subscribe a term deposit or not ● Predictive, Forward-looking ● Supervised task ● Outcome variable : Subscribe/Not Subscribe ● Ranking (Find most likely subscribers) Methods Performance Unbalanced Data Classification ● ROC curves ● Lift Charts ● SMOTE Oversampling ● Na ï ve Bayes ● Sensitivity/Specificity ● Logistic Regression ● F1-score ● Decision Tree ● Random Forest

  4. Data Description & Preparation Demographic data Customer Current Campaign data Previous Campaign Social & Economic data Credit data data Data Source : UCI Machine Learning Repository Partition : training/test = 0.7/0.3 Data Size : 41,188 Rows, 21 Columns Data Prep : Input Features : 'age', 'job', 'marital', 'education', 'default', 'housing', 1. normalization 2. dummies 'loan', 'contact', 'month', 'DOW, 'campaign', 'pdays', 'previous', 3. pdays 4. duration 'poutcome', 'emp.var.rate', 'cons.price.idx', 'cons.conf.idx', 'euribor3m', Training set SMOTE Oversampling 'nr.employed' (imbalance ratio = #0 / #1 = 790.27%) Output Variable : y (subscribed: yes/no)

  5. Data Visualization DOW / Output Bar Chart ttttttttt Duration / Output Box Plot ttttttttt Age / Output Box Plot Previous / Campaign Scatter Plot

  6. Method Results No Oversampled Oversampled Methods Accuracy Sensitivity Specificity Methods Accuracy Sensitivity Specificity AUC F1 Logistic 64.47% 0.22 0.98 Logistic 81.57% 0.63 0.84 0.79 0.87 Regression Regression Decision Tree 66.04% 0.16 0.99 Decision Tree 85.60% 0.58 0.84 0.77 0.87 Naïve 88.73% 0 1 (Benchmark) Random Forest 78.96% 0.64 0.81 0.79 0.87 Lift Chart of DT Naïve Bayes 63.45% 0.75 0.62 0.76 0.80 Naïve 88.73% 0 1 (Benchmark)

  7. Logistic Regression Coefficienttttttttt Variable Coefficient Method Results (Oversampled) Intercept -0.019621 Pdays / Output Pie Chart ttttttttt Nr.employed / Output Box Plot ttttttttt Contact / Output Pie Chart ttttttttt Pdays / Previous Scatter Plot ttttttttt Random Forest Variable weightstttttttt Age -0.146709 1) Age 0.303402 campaign 0.273018 2) campaign 0.220551 pdays -0.133803 3) pdays 0.093758 previous -3.050539 4) previous 0.068632 emp.var.rate 1.627592 employment rate contact 5) emp.var.rate 0.055829 cons.price.idx 0.179927 pdays campaign 6) cons.price.idx 0.041118 cons.conf.idx 0.460359 7) cons.conf.idx 0.028743 euribor3m 0.869257 8) euribor3m 0.02847 nr.employed -0.004862

  8. Performance Evaluation

  9. Other Findings & Comparisons RandomForest in Different Conditions No SMOTE v.s. SMOTE

  10. Recommendations ● Features might have low correlations among them ○ Ask domain experts and include more related financial record columns ● More data samples may be better (~40,000 rows now) ● Including ordinal columns may bring about improvement in predictions

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend