Introduction Examples Trees and Forests Stata approach References
Implementing machine learning methods in Stata
Austin Nichols 6 September 2018
Austin Nichols Implementing machine learning methods in Stata
Implementing machine learning methods in Stata Austin Nichols 6 - - PowerPoint PPT Presentation
Introduction Examples Trees and Forests Stata approach References Implementing machine learning methods in Stata Austin Nichols 6 September 2018 Austin Nichols Implementing machine learning methods in Stata Introduction Examples
Introduction Examples Trees and Forests Stata approach References
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Preliminaries Methods
◮ Methods to derive a rule from data, or reduce the dimension of available
◮ Also known as data mining, data science, statistical learning, or statistics. ◮ Or econometrics, if you are in my tribe.
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Preliminaries Methods
◮ Clustering: cluster kmeans, kmedians ◮ Principal component analysis: pca ◮ Latent class analysis: gsem in Stata 15
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Preliminaries Methods
◮ Regression or linear discriminants: regress, discrim lda ◮ Nonlinear discriminants: discrim knn ◮ Shrinkage: lasso, ridge regression, findit lassopack ◮ Generalized additive models (findit gam), wavelets, splines (mkspline) ◮ Nonparametric regress e.g. lpoly, npregress ◮ Support Vector Machines or kernel machines ◮ “Structural” Equation Models e.g. sem, gsem, irt, fmm ◮ Tree builders such as ID3 (Quinlan, 1986), C4.5 (Quinlan, 1993), CART
◮ Neural Networks (NN), Convolutional NN ◮ Boosting e.g. AdaBoost ◮ Bagging e.g. RandomForest
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Preliminaries Methods
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
14 16 18 20 22 24 Lot size 60 80 100 120 140 Income Nonowner Owner Linear discriminant
Predicting lawnmower ownership
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Netflix kaggle
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Trees Ensembles
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References Code outline Caveats Innovations
◮ Predict (noisily) the probability of treatment in each tree, ◮ Reweight within tree for ATE or ATT, then ◮ Average over trees for overall average impact estimate.
◮ out of sample prediction error rate, ◮ distance from average prediction, or ◮ permutations in one feature.
Austin Nichols Implementing machine learning methods in Stata
Introduction Examples Trees and Forests Stata approach References
Austin Nichols Implementing machine learning methods in Stata