Introd u ction to Random Forest TR E E - BASE D MOD E L S IN R - PowerPoint PPT Presentation

Mar 30, 2024 •122 likes •308 views

Introd u ction to Random Forest TR E E - BASE D MOD E L S IN R Erin LeDell Instr u ctor Random Forest be er performance sample s u bset of the feat u res impro v ed v ersion of bagging red u ced correlation bet w een the sampled trees TREE

Introd u ction to Random Forest TR E E - BASE D MOD E L S IN R Erin LeDell Instr u ctor
Random Forest be � er performance sample s u bset of the feat u res impro v ed v ersion of bagging red u ced correlation bet w een the sampled trees TREE - BASED MODELS IN R
Random Forest in R library(randomForest) ?randomForest TREE - BASED MODELS IN R
randomForest E x ample library(randomForest) # Train a default RF model (500 trees) model <- randomForest(formula = response ~ ., data = train) TREE - BASED MODELS IN R
Let ' s practice ! TR E E - BASE D MOD E L S IN R
Understanding the Random Forest model o u tp u t TR E E - BASE D MOD E L S IN R Erin LeDell Instr u ctor
Random Forest o u tp u t # Print the credit_model output print(credit_model) Call: randomForest(formula = default ~ ., data = credit_train) Type of random forest: classification Number of trees: 500 No. of variables tried at each split: 4 OOB estimate of error rate: 24.12% Confusion matrix: no yes class.error no 516 46 0.08185053 yes 147 91 0.61764706 TREE - BASED MODELS IN R
O u t - of - bag error matri x # Grab OOB error matrix & take a look err <- credit_model$err.rate head(err) OOB no yes [1,] 0.3414634 0.2657005 0.5375000 [2,] 0.3311966 0.2462908 0.5496183 [3,] 0.3232831 0.2476636 0.5147929 [4,] 0.3164933 0.2180294 0.5561224 [5,] 0.3197756 0.2095808 0.5801887 [6,] 0.3176944 0.2115385 0.5619469 TREE - BASED MODELS IN R
O u t - of - bag error estimate OOB # Look at final OOB error rate 0.24125 oob_err <- err[nrow(err), "OOB"] print(oob_err) print(credit_model) Call: randomForest(formula = default ~ ., data = credit_train) Type of random forest: classification Number of trees: 500 No. of variables tried at each split: 4 OOB i f 24 12% TREE - BASED MODELS IN R
Plot the OOB error rates TREE - BASED MODELS IN R
Let ' s practice ! TR E E - BASE D MOD E L S IN R
OOB error v s . test set error TR E E - BASE D MOD E L S IN R Erin LeDell Instr u ctor
Ad v antages & Disad v antages of OOB estimates ? Can e v al u ate y o u r model w itho u t a separate test set ? Comp u ted a u tomaticall y b y the randomForest() f u nction ? OOB Error onl y estimates error ( not AUC , log - loss , etc .) ? Can ' t compare Random Forest performance to other t y pes of models TREE - BASED MODELS IN R
Let ' s practice ! TR E E - BASE D MOD E L S IN R
T u ning a Random Forest model TR E E - BASE D MOD E L S IN R Erin LeDell Instr u ctor
Random Forest H y perparameters ntree : n u mber of trees mtr y : n u mber of v ariables randoml y sampled as candidates at each split sampsi z e : n u mber of samples to train on nodesi z e : minim u m si z e ( n u mber of samples ) of the terminal nodes ma x nodes : ma x im u m n u mber of terminal nodes TREE - BASED MODELS IN R
T u ning mtr y w ith t u neRF () # Execute the tuning process set.seed(1) res <- tuneRF(x = train_predictor_df, y = train_response_vector, ntreeTry = 500) # Look at results print(res) mtry OOBError 2.OOB 2 0.2475 4.OOB 4 0.2475 8.OOB 8 0.2425 TREE - BASED MODELS IN R
Let ' s practice ! TR E E - BASE D MOD E L S IN R

Recommend

U.S. Forest Service Forest Service U.S. Forest Inventory and Analysis Forest Service Research

U.S. Forest Service Forest Service U.S. Forest Inventory and Analysis Forest Service Research & Development This overview provides a summary of various activities of FIA that are of critical importance to National Forest Systems.

873 views • 27 slides

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number is a number chosen as if by chance from some specified distribution such that selection of a large set of these numbers reproduces the underlying

720 views • 12 slides

Random Forest Applied Multivariate Statistics Spring 2012 Overview Intuition of Random

Random Forest Applied Multivariate Statistics Spring 2012 Overview Intuition of Random Forest The Random Forest Algorithm De-correlation gives better accuracy Healthy Diseased Out-of-bag error (OOB-error) Healthy Variable

528 views • 14 slides

Epping Forest Arts Epping Forest Arts Epping Forest Councils Epping Forest Councils Arts

Epping Forest Arts Epping Forest Arts Epping Forest Councils Epping Forest Councils Arts Development Service Arts Development Service Julie Chandler Community & Cultural Services Manager Arts Background Arts Background The Council

150 views • 12 slides

Forest management associations Forest owners own associations Forest Management Association is

Forest management associations Forest owners own associations Forest Management Association is forest owners own association There are 76 Forest Management Associations in Finland (2016) about 300 offices, almost in every

399 views • 17 slides

CURRENT U.S. FOREST DATA AND MAPS Forest age FIA MapMaker Forest ownership TPO Data CURRENT

CURRENT U.S. FOREST DATA AND MAPS Forest age FIA MapMaker Forest ownership TPO Data CURRENT U.S. FOREST DATA Timber harvest AND MAPS Urban influence Forest covertypes Top 10 species Return to FIA Home Return to FIA Home NEXT

301 views • 8 slides

INTROD TRODUCT CTION TO TO PRI RIOR ORITY TY-BASED ED B BUDGET ET BUDGETI TING F FOR

INTROD TRODUCT CTION TO TO PRI RIOR ORITY TY-BASED ED B BUDGET ET BUDGETI TING F FOR OR ES ESUHSD Pres esen ented b by M y Mar arcu cus B Bat attle Assoc ociate Su Supe peri rintendent f for or Busin iness a and d

228 views • 21 slides

Introd u ction to a u dio data in P y thon SP OK E N L AN G U AG E P R OC E SSIN G IN P YTH ON

Introd u ction to a u dio data in P y thon SP OK E N L AN G U AG E P R OC E SSIN G IN P YTH ON Daniel Bo u rke Machine Learning Engineer / Yo u T u be Creator Dealing w ith a u dio files in P y thon Di erent kinds all of a u dio les mp

463 views • 17 slides

Introd u ction to P y D u b SP OK E N L AN G U AG E P R OC E SSIN G IN P YTH ON Daniel Bo u

Introd u ction to P y D u b SP OK E N L AN G U AG E P R OC E SSIN G IN P YTH ON Daniel Bo u rke Machine Learning Engineer / Yo u T u be Creator Installing P y D u b $ pip install pydub If u sing les other than .wav , install ffmpeg v ia

428 views • 26 slides

Introd u ction IN TE R ME D IATE IN TE R AC TIVE DATA VISU AL IZATION W ITH P L OTLY IN R

Introd u ction IN TE R ME D IATE IN TE R AC TIVE DATA VISU AL IZATION W ITH P L OTLY IN R Adam Lo y Statistician , Carleton College Moti v ation Is it easier to see the changes o v er time based on the animation ? Or the faceted v ie w s ?

474 views • 31 slides

Introd u ction VISU AL IZIN G G E OSPATIAL DATA IN P YTH ON Mar y v an Valkenb u rg Data

Introd u ction VISU AL IZIN G G E OSPATIAL DATA IN P YTH ON Mar y v an Valkenb u rg Data Science Program Manager , Nash v ille So w are School Location 1854 cholera o u tbreak in London 600+ deaths VISUALIZING GEOSPATIAL DATA IN PYTHON Sno

525 views • 26 slides

Introd u ction to signals FIN AN C IAL TR AD IN G IN R Il y a Kipnis Professional Q u antitati

Introd u ction to signals FIN AN C IAL TR AD IN G IN R Il y a Kipnis Professional Q u antitati v e Anal y st and R programmer What are signals ? Signals are the interactions of : Market data w ith indicators Indicators w ith other indicators E

523 views • 24 slides

Introd u ction to E x plorator y Data Anal y sis STATISTIC AL TH IN K IN G IN P YTH ON ( PAR T 1

Introd u ction to E x plorator y Data Anal y sis STATISTIC AL TH IN K IN G IN P YTH ON ( PAR T 1 ) J u stin Bois Lect u rer at the California Instit u te of Technolog y E x plorator y data anal y sis The process of organi z ing , plo ing ,

734 views • 45 slides

Introd u ction to iterators P YTH ON DATA SC IE N C E TOOL BOX ( PAR T 2 ) H u go Bo w ne -

Introd u ction to iterators P YTH ON DATA SC IE N C E TOOL BOX ( PAR T 2 ) H u go Bo w ne - Anderson Data Scientist at DataCamp Iterating w ith a for loop We can iterate o v er a list u sing a for loop employees = ['Nick', 'Lore', 'Hugo'] for

379 views • 25 slides

Introd u ction to EFA FAC TOR AN ALYSIS IN R Jennifer Br u sso w Ps y chometrician Ps y cho +

Introd u ction to EFA FAC TOR AN ALYSIS IN R Jennifer Br u sso w Ps y chometrician Ps y cho + metrics ps y cho = " of the mind " metrics = " related to meas u rement " FACTOR ANALYSIS IN R Learning objecti v es R u n a u

414 views • 30 slides

Introd u ction to the NASA fireball data set BU IL D IN G DASH BOAR D S W ITH SH IN YDASH BOAR

Introd u ction to the NASA fireball data set BU IL D IN G DASH BOAR D S W ITH SH IN YDASH BOAR D L u c y D ' Agostino McGo w an Postdoctoral fello w in Biostatistics at Johns Hopkins Uni v ersit y Data backgro u nd 1 h ps :// cneos . jpl .

174 views • 14 slides

A Side-Channel Assisted Cryptanalytic Attack Against QcBits Mlissa Rossi Mike Hamburg

A Side-Channel Assisted Cryptanalytic Attack Against QcBits Mlissa Rossi Mike Hamburg Michael Hutter Mark E. Marson Possible path for post-quantum security Error-correcting codes Quantum computers may threaten the mathematical

892 views • 66 slides

One-Way ANOVA (MD3) Paul Gribble Winter, 2019 . . . . . . . . . . . . . . . . .

One-Way ANOVA (MD3) Paul Gribble Winter, 2019 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Review from last class sample vs population estimating population

406 views • 25 slides

Uncertainty in Eddy Sources of Random Error Random Errors: . . . Covariance Measurements:

Random and . . . Random Errors of . . . Random Errors: . . . Uncertainty in Eddy Sources of Random Error Random Errors: . . . Covariance Measurements: Systematic Error Systematic . . . An Overview Based on How to Use . . . a Recent Book

510 views • 13 slides

Week 1: Introduc/on Random errors 2 1.2B Random errors 3

1 Week 1: Introduc/on Random errors 2 1.2B Random errors 3 Laser distance measurement Repeated individual measurements mean = truth Width measurement no. 4 Random errors where

398 views • 12 slides

Section 1.3: More Probability and Decisions: Continuous Random Variables Jared S. Murray The

Section 1.3: More Probability and Decisions: Continuous Random Variables Jared S. Murray The University of Texas at Austin McCombs School of Business OpenIntro Statistics, Chapter 3.1 1 Continuous Random Variables Suppose we are trying to

297 views • 27 slides

Probability Basics Part 1: What is Probability? INFO-1301, Quantitative Reasoning 1 University

Probability Basics Part 1: What is Probability? INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder September 26, 2016 Prof. Michael Paul Prof. William Aspray Variables We can describe events like coin flips as variables

472 views • 13 slides

Simulating Random Walks on Graphs in the Streaming Model Ce Jin Tsinghua University ITCS 2019

Simulating Random Walks on Graphs in the Streaming Model Ce Jin Tsinghua University ITCS 2019 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ce Jin (Tsinghua University)

926 views • 51 slides

A Random Walk through CS70 CS70 Summer 2016 - Lecture 8B David Dinh 09 August 2016 UC Berkeley

A Random Walk through CS70 CS70 Summer 2016 - Lecture 8B David Dinh 09 August 2016 UC Berkeley 1 Today (and tomorrow, and Wednesday) Review: what have we done in class? Future classes: where do you go next? Applications: how is the stuff

606 views • 27 slides