Introduction to Machine Learning Evaluation: Measures for Binary - PowerPoint PPT Presentation

Introduction to Machine Learning Evaluation: Measures for Binary Classification: ROC visualization compstat-lmu.github.io/lecture_i2ml

LABELS: ROC SPACE Plot True Positive Rate and False Positive Rate: 1.00 True Class y C2 + − ● unclear winner Pred. + TP FP 0.75 C1 ● ˆ y FN TN − dominates ● TPR C3 0.50 TP TPR = 0.25 TP + FN FP FPR = FP + TN 0.00 0.00 0.25 0.50 0.75 1.00 FPR � c Introduction to Machine Learning – 1 / 16

LABELS: ROC SPACE The best classifier lies on the top-left corner The diagonal ≈ random labels (with different proportions). Assign positive x as "pos" with 25% probability → TPR = 0 . 25. Assign negative x as "pos" with 25% probability → FPR = 0 . 25. 1.00 ● Best Pos−100% ● 0.75 ● Pos−75% TPR 0.50 0.25 ● Pos−25% 0.00 ● Pos−0% 0.00 0.25 0.50 0.75 1.00 FPR � c Introduction to Machine Learning – 2 / 16

LABELS: ROC SPACE In practice, we should never obtain a classifier below the diagonal. Inverting the predicted labels (0 → 1 and 1 → 0) will result in a reflection at the diagonal. 1.00 C2 0.75 ● TPR 0.50 C1 0.25 ● 0.00 0.00 0.25 0.50 0.75 1.00 FPR � c Introduction to Machine Learning – 3 / 16

LABEL DISTRIBUTION IN TPR AND FPR TPR and FPR are insensitive to the class distribution: Not affected by changes in the ratio n + / n − (at prediction). Example 1: Example 2: Proportion n + / n − = 1 Proportion n + / n − = 2 Actual Positive Actual Negative Actual Positive Actual Negative Pred. Positive 40 25 Pred. Positive 80 25 Pred. Negative 10 25 Pred. Negative 20 25 MCE = 35/100 MCE = 45/150 = 30/100 TPR = 0 . 8 TPR = 0 . 8 FPR = 0 . 5 FPR = 0 . 5 Note: If class proportions differ during training, the above is not true. Estimated posterior probabilities can change! � c Introduction to Machine Learning – 4 / 16

FROM PROBABILITIES TO LABELS: ROC CURVE Remember: Both probabilistic and scoring classifiers can output classes by thresholding. h ( x ) := [ π ( x )) ≥ c ] h ( x ) = [ f ( x ) ≥ c ] or To draw a ROC curve : 1.00 Iterate through all possible True Positive Rate thresholds c 0.75 0.50 → Visual inspection of all possible thresholds / results 0.25 0.00 0.00 0.25 0.50 0.75 1.00 False Positive Rate � c Introduction to Machine Learning – 5 / 16

ROC CURVE 1.00 # Truth Score 1 Pos 0.95 True Positive Rate 2 Pos 0.86 0.75 3 Pos 0.69 4 Neg 0.65 0.50 5 Pos 0.59 6 Neg 0.52 7 Pos 0.51 0.25 8 Neg 0.39 9 Neg 0.28 10 Neg 0.18 0.00 11 Pos 0.15 0.00 0.25 0.50 0.75 1.00 12 Neg 0.06 False Positive Rate c = 0.9 → TPR = 0.167 → FPR = 0 � c Introduction to Machine Learning – 6 / 16

ROC CURVE 1.00 # Truth Score 1 Pos 0.95 True Positive Rate 2 Pos 0.86 0.75 3 Pos 0.69 4 Neg 0.65 0.50 5 Pos 0.59 6 Neg 0.52 7 Pos 0.51 0.25 8 Neg 0.39 9 Neg 0.28 10 Neg 0.18 0.00 11 Pos 0.15 0.00 0.25 0.50 0.75 1.00 12 Neg 0.06 False Positive Rate c = 0.6 → TPR = 0.5 → FPR = 0.167 � c Introduction to Machine Learning – 9 / 16

ROC CURVE 1.00 # Truth Score 1 Pos 0.95 True Positive Rate 2 Pos 0.86 0.75 3 Pos 0.69 4 Neg 0.65 0.50 5 Pos 0.59 6 Neg 0.52 7 Pos 0.51 0.25 8 Neg 0.39 9 Neg 0.28 10 Neg 0.18 0.00 11 Pos 0.15 0.00 0.25 0.50 0.75 1.00 12 Neg 0.06 False Positive Rate � c Introduction to Machine Learning – 12 / 16

ROC CURVE The closer the curve to the top-left corner, the better If ROC curves cross, a different model can be better in different parts of the ROC space 1.00 model 0.75 very good TPR 0.50 ok1 ok2 0.25 bad 0.00 0.00 0.25 0.50 0.75 1.00 FPR � c Introduction to Machine Learning – 13 / 16

AUC: AREA UNDER ROC CURVE The AUC (in [0,1]) is a single metric to evaluate scoring classifiers AUC = 1: Perfect classifier AUC = 0.5: Randomly ordered 1.00 True Positive Rate 0.75 0.50 0.25 0.00 0.00 0.25 0.50 0.75 1.00 False Positive Rate � c Introduction to Machine Learning – 14 / 16

AUC: AREA UNDER ROC CURVE Interpretation: Probability that classifier ranks a random positive higher than a random negative observation Truth Score Choose a random positive 1 0.9 1 0.76 1 0.76 1 0.7 0 0.5 Choose a random negative 1 0.45 0 0.3 0 0.3 0 0.1 AUC = 0.9167 Classifier ranks the positive higher than the negative (with probability 0.9167) � c Introduction to Machine Learning – 15 / 16

PARTIAL AUC Sometimes it can be useful to look at a specific region under the ROC curve ⇒ partial AUC (pAUC). Examples: focus on a region with low FPR or a region with high TPR: 1.0 1.0 0.8 0.8 0.6 0.6 tpr tpr Partial AUC: 0.086 Partial AUC: 0.128 0.4 0.4 0.2 0.2 0.0 0.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 fpr fpr � c Introduction to Machine Learning – 16 / 16

Introduction to Machine Learning Evaluation: Measures for Binary - PowerPoint PPT Presentation

Introduction to Machine Learning Evaluation: Measures for Binary Classification: ROC visualization compstat-lmu.github.io/lecture_i2ml LABELS: ROC SPACE Plot True Positive Rate and False Positive Rate: 1.00 True Class y C2 +

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning Evaluation: Measures for Binary Classification: ROC Measures

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

MA162: Finite mathematics . Jack Schmidt University of Kentucky December 3, 2012 Schedule:

Experiences of of La Landing Machine Le Learning onto Market-Scale Mobile Malware Detection

and Evaluation CMSC 678 UMBC Central Question: How Well Are We Doing? Precision, Recall,

Concept Drift Albert Bifet March 2012 COMP423A/COMP523A Data Stream Mining Outline 1.

Information Theory and Software Testing David Clark David Clark IT and ST Papers Squeeziness: A

12. Classical statistics Andrej Bogdanov Estimators X = ( X 1 , , X n ) independent samples ^

1 2 Where in the World is Stepping Up? American Psychiatric Association (San Diego, Calif.) 3