week 2 video 5
play

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive - PowerPoint PPT Presentation

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a few times during the last few weeks Fitting to the noise as well as the signal Over-Fitting 25 25 20 20 15 15 10 10 5 5 0 0 0 5 10


  1. Week 2 Video 5 Cross-Validation and Over-Fitting

  2. Over-Fitting ¨ I’ve mentioned over-fitting a few times during the last few weeks ¨ Fitting to the noise as well as the signal

  3. Over-Fitting 25 25 20 20 15 15 10 10 5 5 0 0 0 5 10 15 20 25 0 5 10 15 20 25 Good fit Over fit

  4. Reducing Over-Fitting ¨ Use simpler models ¤ Fewer variables (BiC, AIC, Occam’s Razor) ¤ Less complex functions (MDL)

  5. Eliminating Over-Fitting? ¨ Every model is over-fit in some fashion ¨ The questions are: ¤ How bad? ¤ What is it over-fit to?

  6. Assessing Generalizability ¨ Does your model transfer to new contexts? ¨ Or is it over-fit to a specific context?

  7. Training Set/Test Set ¨ Split your data into a training set and test set

  8. Notes ¨ Model tested on unseen data ¨ But uses data unevenly

  9. Cross-validation 9 ¨ Split data points into N equal-size groups

  10. Cross-validation 10 ¨ Train on all groups but one, test on last group ¨ For each possible combination

  11. Cross-validation 11 ¨ Train on all groups but one, test on last group ¨ For each possible combination

  12. Cross-validation 12 ¨ Train on all groups but one, test on last group ¨ For each possible combination

  13. Cross-validation 13 ¨ Train on all groups but one, test on last group ¨ For each possible combination

  14. Cross-validation 14 ¨ Train on all groups but one, test on last group ¨ For each possible combination

  15. Cross-validation 15 ¨ Train on all groups but one, test on last group ¨ For each possible combination

  16. You can do both! ¨ Use cross-validation to tune algorithm parameters or select algorithms ¨ Use held-out test set to get less over-fit final estimate of model goodness

  17. How many groups? ¨ K-fold ¤ Pick a number K, split into this number of groups ¨ Leave-out-one ¤ Every data point is a fold

  18. How many groups? ¨ K-fold ¤ Pick a number K, split into this number of groups ¤ Quicker; preferred by some theoreticians ¨ Leave-out-one ¤ Every data point is a fold ¤ More stable ¤ Avoids issue of how to select folds (stratification issues)

  19. Cross-validation variants ¨ Flat Cross-Validation ¤ Each point has equal chance of being placed into each fold ¨ Stratified Cross-Validation ¤ Biases fold selection so that some variable is equally represented in each fold ¤ The variable you’re trying to predict ¤ Or some variable that is thought to be an important context

  20. Student-level cross-validation ¨ Folds are selected so that no student’s data is represented in two folds ¨ Allows you to test model generalizability to new students ¨ As opposed to testing model generalizability to new data from the same students

  21. Student-level cross-validation ¨ Usually seen as the minimum cross-validation needed, in the EDM conference ¨ Papers that don’t pay attention to this issue are usually rejected ¤ OK to explicitly choose something else and discuss that choice ¤ Not OK to just ignore the issue and do what’s easiest

  22. Student-level cross-validation ¨ Easy to do with Batch X-Validation in RapidMiner

  23. Other Levels Sometimes Used for Cross-Validation ¨ Lesson/Content ¨ School ¨ Demographic (Urban/Rural/Suburban, Race, Gender) ¨ Software Package ¨ Session (in MOOCs, behavior in later sessions differs from behavior in earlier sessions – Whitehill et al., 2017)

  24. Important Consideration ¨ Where do you want to be able to use your model? ¤ New students? ¤ New schools? ¤ New populations? ¤ New software content? ¨ Make sure to cross-validate at that level

  25. Next Lecture ¨ More on Generalization and Validity

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend