Lecture 7: Cross-Validation Instructor: Prof. Shuai Huang - PowerPoint PPT Presentation

Dec 23, 2022 •504 likes •640 views

Lecture 7: Cross-Validation Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of Washington Underfit, Good fit, and Overfit 2 = 0 + 1 1 + 2 2 + 11 1 2

Lecture 7: Cross-Validation Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of Washington
Underfit, Good fit, and Overfit 𝑔 𝒚 𝑔 𝒚 2 = 𝛾 0 + 𝛾 1 𝑦 1 + 𝛾 2 𝑦 2 + 𝛾 11 𝑦 1 2 𝑔 𝒚 = 𝛾 0 + 𝛾 1 𝑦 1 + 𝛾 2 𝑦 2 = 𝛾 0 + 𝛾 1 𝑦 1 + 𝛾 2 𝑦 2 + 𝛾 11 𝑦 1 2 + 𝛾 12 𝑦 1 𝑦 2 + 𝛾 112 𝑦 1 2 𝑦 2 + 𝛾 22 𝑦 2 2 + 𝛾 12 𝑦 1 𝑦 2 + 𝛾 22 𝑦 2 2 + ⋯ + 𝛾 122 𝑦 1 𝑦 2
Danger of R-squared • When number of variables increases, in theory, the R- squared won’t decrease; in practice, it always increases. Thus, it is not a good metric to take into consideration of model complexity 𝑆 2 = 1 − 𝑇𝑇𝐹 𝑇𝑇𝑈 • This is because that: ST is always fixed, while SSE could only decrease if more variables are put into the model even if these new added variables have no relationship with the outcome variable
Danger of R- squared (cont’d) • Further, the R-squared is compounded by the variance of predictors as well. As the underlying regression model is 𝑍 = 𝛾𝑌 + 𝜗 , • The variance of 𝑍 , 𝑤𝑏𝑠 𝑍 = 𝛾 2 𝑤𝑏𝑠 𝑌 + 𝑤𝑏𝑠(𝜗) . The R-squared takes the form as 𝛾 2 𝑤𝑏𝑠 𝑌 R-squared= 𝛾 2 𝑤𝑏𝑠 𝑌 +𝑤𝑏𝑠(𝜗) . • Thus, it seems that R-squared is not only impacted by how well 𝑌 can predict 𝑍 , but also by the variance of 𝑌 as well.
The truth about training error • Just as the R-squared, it will continue to decrease if the model is mathematically more complex (therefore, more able to shape itself to make its prediction correct on data points that are due to noise)
Fix R-squared: AIC/BIC/? IC… • The definition of AIC (Akaike Information Criterion) 𝐵𝐽𝐷 = 2𝑙 − 2 ln ෠ 𝑀 • The definition of BIC (Bayesian Information Criterion) 𝐶𝐽𝐷 = ln 𝑂 𝑙 − 2 ln ෠ 𝑀
Training and testing data • A simple strategy: if a model is good, then it should perform well on an unseen testing data (that represents the future data – which is of course unseen in the model training stage)
K-Fold cross-validation • For example, K=4
Random sampling method • How to conduct the training/testing data scheme, when we only have access to a dataset (usually we take this dataset as “training data” – a concept taken for granted)?
Other dimensions of “error” • The TP, FP, FN, TN
The ROC curve (Receiver Operating Characteristics) • Consider a logistic regression model
R lab • Download the markdown code from course website • Conduct the experiments • Interpret the results • Repeat the analysis on other datasets

Recommend

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. 1 / 44 Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the

1.13k views • 66 slides

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Outline Last Time Cross-Validation STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12 April 2016 Outline Last Time Cross-Validation Outline Last Time Cross-Validation Outline Last Time

381 views • 25 slides

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method Transfer, partial and cross validation Team members: In scope Life cycle of a method after first full validation or relation Team

252 views • 10 slides

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

Classier evaluation Classier evaluation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Classier evaluation Classier evaluation Leave-one-out Cross-Validation Resampled validation set

984 views • 51 slides

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

302 views • 27 slides

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The Shadow of the Cross The Shadow of the Cross OT Glimpses of the Cross OT Glimpses of the Cross Heb 8:5 & 10:1 Heb 8:5 & 10:1 OT Glimpses

359 views • 35 slides

Validation of National Burn Severity Validation of National Burn Severity Validation of National

Validation of National Burn Severity Validation of National Burn Severity Validation of National Burn Severity Validation of National Burn Severity Mapping Project Techniques Within Mapping Project Techniques Within the Apalachicola National

212 views • 17 slides

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values are correct some types of validation: preventing blank values (email address) ensuring the type of values integer, real number,

284 views • 24 slides

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013 TIN2010-20900-C04 Albacete Index Introduction Multilabel Classification Cross-Validation and Stratified Cross-Validation Methods and

682 views • 43 slides

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance Methods Overfitting Avoidance Decision Trees Decision Trees Reduce error pruning Reduce error pruning Cost Cost- -complexity pruning

699 views • 17 slides

Criticality experiments and benchmarks for for validation of cross validation of cross sections:

Criticality experiments and benchmarks Criticality experiments and benchmarks for for validation of cross validation of cross sections: the neptunium sections: the neptunium case case L.S.Leong, L.Tassan-Got, L.Audouin, C. Paradela,

464 views • 21 slides

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation for Covariate Shift (1) (2) Masashi Sugiyama , Benjamin Blankertz , (2,3) (2) Matthias Krauledat , Guido Dornhege , (3,2) Klaus-Robert

623 views • 26 slides

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far Various metrics (e.g., accuracy, F-measure, RMSE, ) Evaluation protocol setups Split Validation Cross Validation Special

1.04k views • 55 slides

Learning From Data Lecture 13 Validation and Model Selection The Validation Set Model Selection

Learning From Data Lecture 13 Validation and Model Selection The Validation Set Model Selection Cross Validation M. Magdon-Ismail CSCI 4100/6100 recap: Regularization Regularization combats the effects of noise by putting a leash on the

349 views • 31 slides

in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101

Accelerating Cross-Validation in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101 [Wikipedia] Popular Model Validation Technique to avoid overfitting, for better generalization useful when not

430 views • 20 slides

Learning From Data Lecture 14 Three Learning Principles Occams Razor Sampling Bias Data

Learning From Data Lecture 14 Three Learning Principles Occams Razor Sampling Bias Data Snooping M. Magdon-Ismail CSCI 4100/6100 recap: Validation and Cross Validation Validation Cross Validation D ( N ) D 1 D 2 D N D train D

1.01k views • 58 slides

Ridge/Lasso Regression, Model selection Xuezhi Wang Computer Science Department Carnegie Mellon

Ridge/Lasso Regression Model Selection Ridge/Lasso Regression, Model selection Xuezhi Wang Computer Science Department Carnegie Mellon University 10701-recitation, Apr 22 Lasso Ridge/Lasso Regression Model Selection Outline Ridge/Lasso

584 views • 27 slides

Modern MDL meets Data Mining Insight, Theory, and Practice Jilles Kenji Vreeken Yamanishi

2019 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Modern MDL meets Data Mining Insight, Theory, and Practice Jilles Kenji Vreeken Yamanishi CISPA Helmholtz Center The University of Tokyo for Information

636 views • 39 slides

Second-Order Bias-Corrected AIC for Selecting Structural Equation Models Kentaro H AYASHI

Vienna WS 1 Second-Order Bias-Corrected AIC for Selecting Structural Equation Models Kentaro H AYASHI Department of Psychology , University of Hawaii at Manoa (E-mail: hayashik@hawaii.edu) AND Hirokazu Y ANAGIHARA Department of Mathematics ,

519 views • 25 slides

CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu

CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu January 19, 2016 Announcements Team formation due next Wednesday Homework 1 out by tomorrow 2 Todays Schedule Course Project

757 views • 32 slides

Motivation Partial Wave Analysis Up to know: worked on + with

Study to Determine the Quantum Numbers of Resonances with PAWIAN th 2019| P ANDA CM 19/3 GSI | Jenny Ptz November 6 Motivation Partial Wave Analysis Up to know: worked on + with analysis of

416 views • 28 slides

Lesson 3: Likelihood-based inference for POMP models Aaron A. King, Edward L. Ionides, Kidus

Lesson 3: Likelihood-based inference for POMP models Aaron A. King, Edward L. Ionides, Kidus Asfaw 1 / 74 Outline Introduction 1 The likelihood function 2 Likelihood of a POMP model Computing the likelihood 3 Sequential Monte Carlo

1.38k views • 74 slides

A comparisons of some criteria for states selection of the latent Markov model for longitudinal

A comparisons of some criteria for states selection of the latent Markov model for longitudinal data Silvia Bacci 1 , Francesco Bartolucci , Silvia Pandolfi , Fulvia Pennoni Dipartimento di Economia, Finanza e Statistica -

497 views • 26 slides

BAYESIAN OPTIMIZATION FOR AUTOMATED MODEL SELECTION Gustavo Malkomes Chip Schaff Roman Garnett

BAYESIAN OPTIMIZATION FOR AUTOMATED MODEL SELECTION Gustavo Malkomes Chip Schaff Roman Garnett Washington University in St. Louis Probabilistic Scientific Computing 06.06.2017 INTRODUCTION GP Model selection Problem Gaussian processes

606 views • 42 slides