visualization for classification
play

Visualization for Classification ROC, AUC, Confusion Matrix Mahdi - PowerPoint PPT Presentation

Class Website CX4242: Visualization for Classification ROC, AUC, Confusion Matrix Mahdi Roozbahani Lecturer, Computational Science and Engineering, Georgia Tech Visualizing Classification Performance Confusion matrix


  1. Class Website CX4242: Visualization for Classification ROC, AUC, Confusion Matrix Mahdi Roozbahani Lecturer, Computational Science and Engineering, Georgia Tech

  2. Visualizing Classification Performance Confusion matrix https://en.wikipedia.org/wiki/Confusion_matrix 3

  3. Hard to spot trends and patterns Much easier! http://research.microsoft.com/en-us/um/redmond/groups/cue/publications/CHI2009-EnsembleMatrix.pdf 4

  4. Very important: Find out what “ positive ” means Predicated Cat Dog Cat 5 3 Actual Dog 2 4 5

  5. “ False Alarm ” easy to remember in security applications Very important: Find out what “ positive ” means https://en.wikipedia.org/wiki/Confusion_matrix 6

  6. Visualizing Classification Performance using ROC curve (Receiver Operating Characteristic)

  7. Polonium’s ROC Curve Positive class: malware Negative class: benign Ideal 85% True Positive Rate 1% False Alarms True Positive Rate % of bad correctly labeled False Positive Rate (False Alarms) % of good labeled as bad 8

  8. Measuring Classification Performance using AUC (Area under the ROC curve) Ideal 85% True Positive Rate 1% False Alarms

  9. If a machine learning algorithm achieves 0.9 AUC (out of 1.0) , that’s a great algorithm, right? 10

  10. Be Careful with AUC! 11

  11. Weights in combined models Bagging / Random forests • Majority voting Let people play with the weights? 13

  12. EnsembleMatrix http://research.microsoft.com/en-us/um/redmond/groups/cue/publications/CHI2009-EnsembleMatrix.pdf 14

  13. Improving performance • Adjust the weights of the individual classifiers • Data partition to separate problem areas o Adjust weights just for these individual parts • Caveat: evaluation used one dataset http://research.microsoft.com/en-us/um/redmond/groups/cue/publications/CHI2009-EnsembleMatrix.pdf 15

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend