Random forests and wine Machine Learning Toolbox Random forests - PowerPoint PPT Presentation

MACHINE LEARNING TOOLBOX Random forests and wine

Machine Learning Toolbox Random forests ● Popular type of machine learning model ● Good for beginners ● Robust to overfi � ing ● Yield very accurate, non-linear models

Machine Learning Toolbox Random forests ● Unlike linear models, they have hyperparameters ● Hyperparameters require manual specification ● Can impact model fit and vary from dataset-to-dataset ● Default values o � en OK, but occasionally need adjustment

Machine Learning Toolbox Random forests ● Start with a simple decision tree ● Decision trees are fast, but not very accurate

Machine Learning Toolbox Random forests ● Improve accuracy by fi � ing many trees ● Fit each one to a bootstrap sample of your data ● Called bootstrap aggregation or bagging ● Randomly sample columns at each split

Machine Learning Toolbox Random forests # Load some data   > library(caret) > library(mlbench) > data(Sonar) # Set seed > set.seed(42) # Fit a model > model <- train(Class~., data = Sonar, method = "ranger" ) # Plot the results   > plot(model)

MACHINE LEARNING TOOLBOX Let’s practice!

MACHINE LEARNING TOOLBOX Explore a wider model space

Machine Learning Toolbox Random forests require tuning ● Hyperparameters control how the model is fit ● Selected "by hand" before the model is fit ● Most important is mtry ● N umber of randomly selected variables used at each split ● Lower value = more random ● Higher value = less random ● Hard to know the best value in advance

Machine Learning Toolbox caret to the rescue! ● Not only does caret do cross-validation… ● It also does grid search ● Select hyperparameters based on out-of-sample error

Machine Learning Toolbox Example: sonar data ● tuneLength argument to caret::train() ● Tells caret how many di ff erent variations to try # Load some data   > library(caret) > library(mlbench) > data(Sonar) # Fit a model with a deeper tuning grid > model <- train(Class~., data = Sonar, method = "ranger", tuneLength = 10 ) # Plot the results > plot(model)

Machine Learning Toolbox Plot the results

MACHINE LEARNING TOOLBOX Custom tuning grids

Machine Learning Toolbox Pros and cons of custom tuning ● Pass custom tuning grids to tuneGrid argument ● Advantages ● Most flexible method for fi � ing caret models ● Complete control over how the model is fit ● Disadvantages ● Requires some knowledge of the model ● Can dramatically increase run time

Machine Learning Toolbox Custom tuning example # Define a custom tuning grid   > myGrid <- data.frame(mtry = c(2, 3, 4, 5, 10, 20)) # Fit a model with a custom tuning grid > set.seed(42) > model <- train(Class ~ ., data = Sonar, method = "ranger", tuneGrid = myGrid) # Plot the results   > plot(model)

Machine Learning Toolbox Custom tuning

MACHINE LEARNING TOOLBOX Introducing glmnet

Machine Learning Toolbox Introducing glmnet ● Extension of glm models with built-in variable selection ● Helps deal with collinearity and small samples sizes ● Two primary forms ● Lasso regression Penalizes number of non-zero coe ffi cients ● Ridge regression Penalizes absolute magnitude of coe ffi cients ● A � empts to find a parsimonious (i.e. simple) model ● Pairs well with random forest models

Machine Learning Toolbox Tuning glmnet models ● Combination of lasso and ridge regression ● Can fit a mix of the two models ● alpha [0, 1]: pure lasso to pure ridge ● lambda (0, infinity): size of the penalty

Machine Learning Toolbox Example: "don't overfit" # Load data   > overfit <- read.csv("http://s3.amazonaws.com/assets.datacamp.com/ production/course_1048/datasets/overfit.csv") # Make a custom trainControl   > myControl <- trainControl( method = "cv", number = 10, summaryFunction = twoClassSummary, classProbs = TRUE, # Super important! verboseIter = TRUE )

Machine Learning Toolbox Try the defaults # Fit a model   > set.seed(42) > model <- train(y ~ ., overfit, method = "glmnet", trControl = myControl) # Plot results   > plot(model) ● 3 values of alpha ● 3 values of lambda

Machine Learning Toolbox Plot the results

MACHINE LEARNING TOOLBOX glmnet with custom tuning grid

Machine Learning Toolbox Custom tuning glmnet models ● 2 tuning parameters: alpha and lambda ● For single alpha , all values of lambda fit simultaneously ● Many models for the "price" of one

Machine Learning Toolbox Example: glmnet tuning # Make a custom tuning grid   > myGrid <- expand.grid( alpha = 0:1, lambda = seq(0.0001, 0.1, length = 10) ) # Fit a model   > set.seed(42) > model <- train(y ~ ., overfit, method = "glmnet", tuneGrid = myGrid, trControl = myControl) # Plot results   > plot(model)

Machine Learning Toolbox Compare models visually

Machine Learning Toolbox Full regularization path > plot(model$finalModel)

Random forests and wine Machine Learning Toolbox Random forests - PowerPoint PPT Presentation

MACHINE LEARNING TOOLBOX Random forests and wine Machine Learning Toolbox Random forests Popular type of machine learning model Good for beginners Robust to overfi ing Yield very accurate, non-linear models Machine

& WINE Astessing Wine and Aru Wine Viscosity = Weight as in Line Wine Color = Emotion of

Chapter 9 Object recognition Random Forests 9.9 Random forests 2 9.9 Random forests

Perfect Pairings food+wine By Paris Food And Wine By Paris Food And Wine ParisFoodAndWine.net

MANILA WINE COURSES Lear Learn Fren ench h Wine ine & & Wine ine Tas asting ting

DeepView - Swiss-PdbViewer Install Wine Swiss-PdbViewer Wine--install

STK-IN4300 Details of Random Forests Statistical Learning Methods in Data Science Adaptive

Random Forests September 29, 2019 Random Forests September 29, 2019 1 / 30 Motto The clearest

A Look at our Wyoming Forests December 18 - 20, 2013 Governors Task Force on Forests Forests

Vin Vert (Green Wine) Jada Wierck, Meredith Kolvey, Denisha The History of Wine Wine used

DMC Services Gastonimic & Local Experiences GASTO LOCAL EXPERIENCES. wine tasting Wine

FOR ORGANIC STILL WINE 2012-22 January 2019 CONTEXT ORGANIC WINE 2012-17 ORGANIC WINE 2017-22

Random Forests What, Why, And How Andy Liaw Biometrics Research, Merck & Co., Inc.

Mangrove forests and sea level rise 1 / 48 00001 - 00:00:01 Mangrove forests and sea level rise

Forests and Climate Forests and Climate Keeping Earth a Livable Place Keeping Earth a Livable

South- -East East Pahang Pahang Peat Peat South Swamp Forests, Malaysia Swamp Forests,

Tony and Kay Smith No -one pours new wine into old wineskins. If he does, the wine will burst

Introduction to Multilevel Analysis Prof. Dr. Ulrike Cress Knowledge Media Research Center

On-line Random Forests Amir Saffari, Christian Leistner, Jakob Santner Martin Godec, Horst

OBJECTIVES Describe the state of the science in intervention research with dementia caregivers

Random Linear Network Coding on Programmable Switches D. Gonalves 1 , S. Signorello 1 , F . M.

RapidChain:Scaling Blockchain via Full Sharding Jinghui Liao Outlines Background

Random Projections, Margins, Kernels and Feature Selection Adithya Pediredla Rice University

Masked Ballot Voting for Receipt-Free Online Elections Roland Wen and Richard Buckland School of

Range Trees Carola Wenk Slides courtesy of Charles Leiserson with small changes by Carola Wenk

Random forests and wine Machine Learning Toolbox Random forests - PowerPoint PPT Presentation

MACHINE LEARNING TOOLBOX Random forests and wine Machine Learning Toolbox Random forests Popular type of machine learning model Good for beginners Robust to overfi ing Yield very accurate, non-linear models Machine

&amp; WINE Astessing Wine and Aru Wine Viscosity = Weight as in Line Wine Color = Emotion of

Chapter 9 Object recognition Random Forests 9.9 Random forests 2 9.9 Random forests

Perfect Pairings food+wine By Paris Food And Wine By Paris Food And Wine ParisFoodAndWine.net

MANILA WINE COURSES Lear Learn Fren ench h Wine ine &amp; &amp; Wine ine Tas asting ting

DeepView - Swiss-PdbViewer Install Wine Swiss-PdbViewer Wine--install

STK-IN4300 Details of Random Forests Statistical Learning Methods in Data Science Adaptive

Random Forests September 29, 2019 Random Forests September 29, 2019 1 / 30 Motto The clearest

A Look at our Wyoming Forests December 18 - 20, 2013 Governors Task Force on Forests Forests

Vin Vert (Green Wine) Jada Wierck, Meredith Kolvey, Denisha The History of Wine Wine used

DMC Services Gastonimic &amp; Local Experiences GASTO LOCAL EXPERIENCES. wine tasting Wine

FOR ORGANIC STILL WINE 2012-22 January 2019 CONTEXT ORGANIC WINE 2012-17 ORGANIC WINE 2017-22

Random Forests What, Why, And How Andy Liaw Biometrics Research, Merck &amp; Co., Inc.

Mangrove forests and sea level rise 1 / 48 00001 - 00:00:01 Mangrove forests and sea level rise

Forests and Climate Forests and Climate Keeping Earth a Livable Place Keeping Earth a Livable

South- -East East Pahang Pahang Peat Peat South Swamp Forests, Malaysia Swamp Forests,

Tony and Kay Smith No -one pours new wine into old wineskins. If he does, the wine will burst

Introduction to Multilevel Analysis Prof. Dr. Ulrike Cress Knowledge Media Research Center

On-line Random Forests Amir Saffari, Christian Leistner, Jakob Santner Martin Godec, Horst

OBJECTIVES Describe the state of the science in intervention research with dementia caregivers

Random Linear Network Coding on Programmable Switches D. Gonalves 1 , S. Signorello 1 , F . M.

RapidChain:Scaling Blockchain via Full Sharding Jinghui Liao Outlines Background

Random Projections, Margins, Kernels and Feature Selection Adithya Pediredla Rice University

Masked Ballot Voting for Receipt-Free Online Elections Roland Wen and Richard Buckland School of

Range Trees Carola Wenk Slides courtesy of Charles Leiserson with small changes by Carola Wenk

& WINE Astessing Wine and Aru Wine Viscosity = Weight as in Line Wine Color = Emotion of

MANILA WINE COURSES Lear Learn Fren ench h Wine ine & & Wine ine Tas asting ting

DMC Services Gastonimic & Local Experiences GASTO LOCAL EXPERIENCES. wine tasting Wine

Random Forests What, Why, And How Andy Liaw Biometrics Research, Merck & Co., Inc.