of true positives true positive rate of known positives
play

# of true positives true positive rate = # of known positives - PowerPoint PPT Presentation

True positive rate (Sensitivity) # of true positives true positive rate = # of known positives (Proportion of actual positives that are correctly identified) True negative rate (Specificity) # of true negatives true negative rate = # of known


  1. True positive rate (Sensitivity) # of true positives true positive rate = # of known positives (Proportion of actual positives that are correctly identified)

  2. True negative rate (Specificity) # of true negatives true negative rate = # of known negatives (Proportion of actual negatives that are correctly identified)

  3. False positive rate (1 – Specificity) # of false positives false positive rate = # of known negatives (Proportion of actual negatives that are incorrectly identified)

  4. Sensitivity and specificity depend on a chosen cutoff malignant false positives cutoff false negatives benign

  5. Sensitivity and specificity depend on a chosen cutoff false positives malignant cutoff false negatives benign

  6. Do Part 1 of the worksheet now

  7. We usually plot the true pos. rate vs. the false pos. rate for all possible cutoffs ROC curve Receiver Operating Characteristic curve

  8. Image from: http://en.wikipedia.org/wiki/Receiver_operating_characteristic

  9. The area under the curve tells us how good a model’s predictions are perfect good worst case

  10. Let’s look at the performance of several different models for the biopsy data set

  11. Predictor M1 clump_thickness ✔ normal_nucleoli marg_adhesion bare_nuclei uniform_cell_shape bland_chromatin

  12. Predictor M1 M2 clump_thickness ✔ ✔ normal_nucleoli ✔ marg_adhesion bare_nuclei uniform_cell_shape bland_chromatin

  13. Predictor M1 M2 M3 clump_thickness ✔ ✔ ✔ normal_nucleoli ✔ ✔ marg_adhesion ✔ bare_nuclei uniform_cell_shape bland_chromatin

  14. Predictor M1 M2 M3 M4 clump_thickness ✔ ✔ ✔ ✔ normal_nucleoli ✔ ✔ ✔ marg_adhesion ✔ ✔ bare_nuclei ✔ uniform_cell_shape bland_chromatin

  15. Predictor M1 M2 M3 M4 M5 clump_thickness ✔ ✔ ✔ ✔ ✔ normal_nucleoli ✔ ✔ ✔ ✔ marg_adhesion ✔ ✔ ✔ bare_nuclei ✔ ✔ uniform_cell_shape ✔ bland_chromatin ✔

  16. Model Area Under Curve (AUC) M1 0.909 M2 0.968 M3 0.985 M4 0.995 M5 0.996

  17. Things usually look much worse in real life Best AUC (solid line): 0.70 Keller, Mis, Jia, Wilke. Genome Biol. Evol. 4:80-88, 2012

  18. Calculating ROC curves in R

  19. Using geom_roc() from the plotROC package

  20. Using geom_roc() from the plotROC package # fit a logistic regression model glm_out <- glm(outcome ~ clump_thickness, data = biopsy, family = binomial)

  21. Using geom_roc() from the plotROC package # fit a logistic regression model glm_out <- glm(outcome ~ clump_thickness, data = biopsy, family = binomial) # prepare data for ROC plotting df <- data.frame(predictor = predict(glm_out, biopsy), known_truth = biopsy$outcome, model = 'M1')

  22. Using geom_roc() from the plotROC package # fit a logistic regression model glm_out <- glm(outcome ~ clump_thickness, data = biopsy, family = binomial) # prepare data for ROC plotting df <- data.frame(predictor = predict(glm_out, biopsy), known_truth = biopsy$outcome, model = 'M1') # the aesthetic names are not the most intuitive # `d` (disease) holds the known truth # `m` (marker) holds the predictor values p <- ggplot(df, aes(d = known_truth, m = predictor)) + geom_roc(n.cuts = 0) + coord_fixed() p # make plot

  23. Calculating the area under the curve (AUC) # the function calc_auc needs to be called on a plot object # that uses geom_roc(): calc_auc(p) # PANEL group AUC # 1 1 -1 0.908878 # Warning message: # In verify_d(data$d) : # D not labeled 0/1, assuming benign = 0 and malignant = 1!

  24. Do Part 2 of the worksheet now

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend