Agenda
Regularization: Ridge Regression and the LASSO
Statistics 305: Autumn Quarter 2006/2007
Wednesday, November 29, 2006
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Regularization: Ridge Regression and the LASSO Statistics 305: - - PowerPoint PPT Presentation
Agenda Regularization: Ridge Regression and the LASSO Statistics 305: Autumn Quarter 2006/2007 Wednesday, November 29, 2006 Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO Agenda Agenda 1 The
Agenda
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Agenda
1 The Bias-Variance Tradeoff 2 Ridge Regression
3 Cross Validation
4 The LASSO 5 Model Selection, Oracles, and the Dantzig Selector 6 References Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
ls has well known properties (e.g., Gauss-Markov, ML)
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part I: The Bias-Variance Tradeoff
Model Complexity Squared Error
Bias−Variance Tradeoff
Prediction Error Bias^2 Variance
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
ridge λ=∞ = 0 (intercept-only model)
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
DF Coefficient 2 4 6 8 10 age sex bmi map tc ldl hdl tch ltg glu
Ridge Regression Coefficient Paths
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
ridge λ
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
1 , v⊤ 2 , . . . , v⊤ p ) is a p × p matrix orthogonal matrix
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part II: Ridge Regression
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
−k (z) to the training
k
−k (z))2
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
30 35 40 45 50 55 0.16 0.18 0.20 0.22 0.24
CV Bands from a Ridge Regression on Spam Data
df Squared Error
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part III: Cross Validation
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
j=1 |ˆ
j | (equivalently, λ = 0), we obtain no shrinkage
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
1
2
3
j r).
4
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part IV: The LASSO
* * * * * ** * ** * * * 0.0 0.2 0.4 0.6 0.8 1.0 −500 500 |beta|/max|beta| Standardized Coefficients * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * * * * * * * ** * ** * * *
LASSO
5 2 1 4 9 2 4 7 10 12 * * * * * ** * ** * 0.0 0.2 0.4 0.6 0.8 1.0 −500 500 |beta|/max|beta| Standardized Coefficients * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** * * * * * * ** * ** *
LAR
5 2 1 4 9 2 4 7 10 * * * * * ** * * * * * * * 0.0 0.2 0.4 0.6 0.8 1.0 −500 500 |beta|/max|beta| Standardized Coefficients * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * * * * * * * ** * * * * * * *
Forward Stagewise
5 2 1 4 9 2 4 7 14
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part V: Model Selection, Oracles, and the Dantzig Selector
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part VI: References
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part VI: References
http://www.acm.caltech.edu/~emmanuel/papers/DantzigSelector.pdf.
http://cran.r-project.org/src/contrib/Descriptions/lars.html. Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO
Part VI: References
Statistics 305: Autumn Quarter 2006/2007 Regularization: Ridge Regression and the LASSO