1 1 1 1 1 1 1 1 1 1 1 1 1 1 Jerome F riedman T rev - PowerPoint PPT Presentation

� � � � Stanford Univ ersit y Decem b er �� Bo osting� � Stanford Univ ersit y Decem b er �� Bo osting� � Classi�cation Problem Additiv e Logistic Regression a Statistical View of Bo osting 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Jerome F riedman� T rev or Hastie� Rob Tibshirani 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 0 1 0 1 1 1 0 1 1 1 1 1 0 0 1 1 1 0 1 1 0 1 1 1 1 1 0 1 1 0 0 1 0 1 1 1 0 0 0 1 0 1 0 0 1 1 1 1 0 0 0 0 1 0 1 Stanford Univ ersit y 1 1 1 0 0 0 0 0 0 0 1 0 0 0 1 1 1 1 1 0 1 0 1 1 1 0 0 0 0 0 0 1 0 0 0 0 1 0 0 1 0 1 1 1 1 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 1 1 0 1 1 1 1 1 1 0 0 1 0 0 0 1 0 1 0 0 1 0 0 0 0 1 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 0 0 1 1 0 0 0 0 1 1 1 0 0 1 1 0 0 0 0 0 1 0 1 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 1 1 1 1 0 0 0 0 0 1 0 0 1 0 1 0 0 1 1 1 0 1 1 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 1 1 0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 0 1 0 1 0 0 00 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 1 0 0 1 1 1 1 1 1 1 0 1 0 1 0 1 0 0 1 0 1 0 1 1 0 1 0 1 1 0 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 1 1 0 1 0 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 1 0 0 0 1 0 0 0 0 0 0 0 1 0 1 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 1 1 1 1 0 1 1 1 1 1 0 0 0 0 0 0 0 1 1 1 00 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 1 0 1 0 0 1 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 1 1 1 1 1 1 0 0 0 1 1 1 1 1 1 1 1 1 0 1 0 1 0 0 1 0 1 1 1 1 1 1 1 1 0 1 0 1 1 1 1 1 1 1 1 0 1 0 1 1 1 1 1 0 0 1 0 0 1 1 1 1 1 1 0 0 0 0 0 1 1 1 0 0 1 0 1 1 1 1 0 1 1 1 1 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Thanks to Bogdan P op escu for helpful and v ery liv ely 1 1 discussions on the history of b o osting� and for help in preparing that part of this talk p Data � X � Y � � R � f � � � g � X is predictor� feature� Y is class lab el� resp onse� � X � Y � ha v e join t probabilit y distribution D � Email� trevor�stat�stanford �ed u Goal� Based on N training pairs � X � Y � dra wn Ftp� stat�stanford�edu� pub�hastie i i � from D pro duce a classi�er C � X � � f � � � g WWW� http��www�stat�sta nfo rd� edu �� tre vor � Goal� c ho ose C to ha v e lo w generalization error � � R � C � � P � C � X � � � Y � D These transparencies are a v ailable via ftp� � E �� D � ftp��stat�stanford �e du� pub �ha sti e� boo st� ��p s � � X � � � Y � C � � � �

� � � � Stanford Univ ersit y Decem b er �� Bo osting� � Stanford Univ ersit y Decem b er �� Bo osting� � Deterministic Concepts Classi�cation T rees 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 94/200 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 0 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 1 1 1 x.2<-1.06711 1 1 1 1 0 0 1 1 1 1 1 1 1 0 0 0 1 1 1 0 0 0 0 1 0 0 1 1 1 1 1 1 1 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 1 x.2>-1.06711 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0 0 0 0 0 1 11 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 00 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 1 1 0/34 72/166 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 0 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 x.2<1.14988 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 x.2>1.14988 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 40/134 0/32 x.1<1.13632 x.1>1.13632 0 1 p X � R has distribution D � 23/117 0/17 x.1<-0.900735 x.1>-0.900735 C � X � is deterministic function � concept class� 1 0 Goal� Based on N training pairs 5/26 2/91 x.1<-1.1668 x.2<-0.823968 x.1>-1.1668 x.2>-0.823968 � X � Y � C � X �� dra wn from D pro duce a i i i � 1 1 0 0 classi�er C � X � � f � � � g 0/12 5/14 2/8 0/83 x.1<-1.07831 � x.1>-1.07831 Goal� c ho ose C to ha v e lo w generalization error 1 1 � � R � C � � P � C � X � � � C � X �� 1/5 4/9 D � E �� D � � � X � � � C � X �� C � � � �

� � � � Stanford Univ ersit y Decem b er �� Bo osting� � Stanford Univ ersit y Decem b er �� Bo osting� � Decision Boundary� T ree Bagging and Bo osting Classi�cation trees can b e simple� but often pro duce noisy �bush y� or w eak �stun ted� classi�ers� 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 0 1 1 1 � Bagging �Breiman� �� Fit man y large 1 0 1 1 1 1 0 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 1 1 1 1 1 0 0 1 trees to b o otstrap�resampled v ersions of the 0 0 0 0 1 0 1 1 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 training data� and classify b y ma jorit y v ote� 0 0 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 1 1 1 1 0 0 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 � Bo osting �F reund � Shapire� �� Fit man y 1 1 1 1 1 1 1 1 1 1 1 1 large or small trees to rew eigh ted v ersions of 1 1 the training data� Classify b y w eigh ted ma jorit y v ote� In general Bo osting � Bagging � Single T ree� �� TM When the nested spheres are in R � CAR T �AdaBo ost � � � b est o��the�shelf classi�er in the pro duces a rather noisy and inaccurate rule � w orld� � Leo Breiman� NIPS w orkshop� �� C � X �� with error rates around ��

� � � � Stanford Univ ersit y Decem b er �� Bo osting� � Stanford Univ ersit y Decem b er �� Bo osting� � Bagging and Bo osting Final classifier sign[ Σα b f b (x)] weighted sample f B (x) weighted sample f 3 (x) f 2 (x) weighted sample �� p oin ts from Nested Spheres in R � Ba y es training sample f 1 (x) error rate is �� The w eigh ting in b o osting can b e ac hiev ed b y T rees are gro wn Best First without pruning� w eigh ted imp ortance sampling� Leftmost iteration is a single tree� � � � �

1 1 1 1 1 1 1 1 1 1 1 1 1 1 Jerome F riedman T rev - PowerPoint PPT Presentation

Stanford Univ ersit y Decem b er Bo osting Stanford Univ ersit y Decem b er Bo osting Classication Problem Additiv e Logistic Regression a

CONTRACT DRAFTING PRACTICAL GUIDANCE & BEST PRACTICE PRACTICE John Bennett John Bennett

Plannin ing Law U Update by Killian an G Garvey ey Covid-19 a and P d Plann nning i. a

Tracking Illiquidities in Daily and Intradaily Characteristics 1 Gulten MERO 2 co-authors: Serge

#2: Does Four Seasons really treat the rectification of pension schemes differently? James

Improving Data Access Performance of Applications in IT Infrastructure Hao Wen Advisor: David

PADS/ML A Functional Data Description Language Yitzhak Mandelbaum Princeton University, AT&T

Efficient Parameter Estimation for ODE Models from Relative Data Using Hierarchical

Complexity of automatic verification of cryptographic protocols Clermont Ferrand 02/02/2017

Surfaces/Meshes Well stick to triangles Working with Meshes CS 176 Winter 2011 CS 176 Winter

CSSE 220 Interfaces and Polymorphism Check out Interfaces from SVN Interfaces What, When,

Time-regularized versus framewise reconstruction (a) A = 2 . 8 cps (c) A = 5 . 7 cps (e) A = 5 . 7

1 Peter Series Lesson #143 September 6, 2018 Dean Bible Ministries www.deanbibleministries.org Dr.

LECTURE 5 Advanced Functions and OOP FUNCTIONS Before we start, lets talk about how name

Curb Your Dogs Enthusiasm Kate Naito, CPDT-KA Doggie Academy SEQUENCE FOR TRAINING 1 . Mark:

1 Peter Series Lesson #115 December 21, 2017 Dean Bible Ministries www.deanbibleministries.org Dr.

Plasma models physically consistent from kinetic scale to hydrodynamic scale Thierry Magin

Imp mpro roving Search rch Thro rough Effici cient A/ A/B Test sting: : A A Ca Case se

PROPNEX LIMITED AGM 25 April 2019 1 Disclaimer This presentation does not constitute or form

Evaluation of State and Local Education Programs and Policies (84.305E) Allen Ruby Associate

DARTEP NOVEMBER 30, 2018 WAYNE STATE UNIVERSITY WELCOME DARTEP ATTENDEES Julie Sinkovitz, Chair

CS6220: DATA MINING TECHNIQUES Chapter 1: Introduction Instructor: Yizhou Sun yzsun@ccs.neu.edu

Heterogeneity and the Business Cycle Advances in Macroeconomic Modelling Vincent Sterk UCL , CfM ,

Volet data-centers de SILECS (A.K.A. Grid5000) Prsentation et exemples dexpriences

Passive scalar decay in bounded and unbounded fluid flows Jacques Vanneste School of Mathematics

Sambuz

Useful Links

Newsletter

Mail Us

1 1 1 1 1 1 1 1 1 1 1 1 1 1 Jerome F riedman T rev - PowerPoint PPT Presentation

Stanford Univ ersit y Decem b er Bo osting Stanford Univ ersit y Decem b er Bo osting Classication Problem Additiv e Logistic Regression a

CONTRACT DRAFTING PRACTICAL GUIDANCE &amp; BEST PRACTICE PRACTICE John Bennett John Bennett

Plannin ing Law U Update by Killian an G Garvey ey Covid-19 a and P d Plann nning i. a

Tracking Illiquidities in Daily and Intradaily Characteristics 1 Gulten MERO 2 co-authors: Serge

#2: Does Four Seasons really treat the rectification of pension schemes differently? James

Improving Data Access Performance of Applications in IT Infrastructure Hao Wen Advisor: David

PADS/ML A Functional Data Description Language Yitzhak Mandelbaum Princeton University, AT&amp;T

Efficient Parameter Estimation for ODE Models from Relative Data Using Hierarchical

Complexity of automatic verification of cryptographic protocols Clermont Ferrand 02/02/2017

Surfaces/Meshes Well stick to triangles Working with Meshes CS 176 Winter 2011 CS 176 Winter

CSSE 220 Interfaces and Polymorphism Check out Interfaces from SVN Interfaces What, When,

Time-regularized versus framewise reconstruction (a) A = 2 . 8 cps (c) A = 5 . 7 cps (e) A = 5 . 7

1 Peter Series Lesson #143 September 6, 2018 Dean Bible Ministries www.deanbibleministries.org Dr.

LECTURE 5 Advanced Functions and OOP FUNCTIONS Before we start, lets talk about how name

Curb Your Dogs Enthusiasm Kate Naito, CPDT-KA Doggie Academy SEQUENCE FOR TRAINING 1 . Mark:

1 Peter Series Lesson #115 December 21, 2017 Dean Bible Ministries www.deanbibleministries.org Dr.

Plasma models physically consistent from kinetic scale to hydrodynamic scale Thierry Magin

Imp mpro roving Search rch Thro rough Effici cient A/ A/B Test sting: : A A Ca Case se

PROPNEX LIMITED AGM 25 April 2019 1 Disclaimer This presentation does not constitute or form

Evaluation of State and Local Education Programs and Policies (84.305E) Allen Ruby Associate

DARTEP NOVEMBER 30, 2018 WAYNE STATE UNIVERSITY WELCOME DARTEP ATTENDEES Julie Sinkovitz, Chair

CS6220: DATA MINING TECHNIQUES Chapter 1: Introduction Instructor: Yizhou Sun yzsun@ccs.neu.edu

Heterogeneity and the Business Cycle Advances in Macroeconomic Modelling Vincent Sterk UCL , CfM ,

Volet data-centers de SILECS (A.K.A. Grid5000) Prsentation et exemples dexpriences

Passive scalar decay in bounded and unbounded fluid flows Jacques Vanneste School of Mathematics

Sambuz

Useful Links

Newsletter

Mail Us

CONTRACT DRAFTING PRACTICAL GUIDANCE & BEST PRACTICE PRACTICE John Bennett John Bennett

PADS/ML A Functional Data Description Language Yitzhak Mandelbaum Princeton University, AT&T