Trees and forests
Econ 2148, fall 2019 Trees, forests, and causal trees
Maximilian Kasy
Department of Economics, Harvard University
1 / 16
Econ 2148, fall 2019 Trees, forests, and causal trees Maximilian - - PowerPoint PPT Presentation
Trees and forests Econ 2148, fall 2019 Trees, forests, and causal trees Maximilian Kasy Department of Economics, Harvard University 1 / 16 Trees and forests Agenda Regression trees: Splitting the covariate space. Random forests: Many
Trees and forests
1 / 16
Trees and forests
2 / 16
Trees and forests
3 / 16
Trees and forests Regression trees
4 / 16
Trees and forests Regression trees
5 / 16
Trees and forests Regression trees
◮ For each Rm, m = 1,...,M, and ◮ for each Xj, j = 1,...,k, ◮ find the xj,m that minimizes the mean squared error,
◮ Then pick the (m,j) that minimizes the mean squared error,
◮ Iterate.
6 / 16
Trees and forests Regression trees
◮ Fit a flexible tree (with large M) using CART. ◮ Then iteratively remove (collapse) nodes. ◮ To minimize the sum of squared errors,
7 / 16
Trees and forests Regression trees
◮ Repeatedly draw bootstrap samples (X b
i ,Y b i )n i=1 from the observed sample.
◮ For each bootstrap sample, fit a regression tree ˆ
◮ Average across bootstrap samples to get the predictor
B
b=1
8 / 16
Trees and forests Regression trees
9 / 16
Trees and forests Regression trees
10 / 16
Trees and forests Regression trees
11 / 16
Trees and forests Regression trees
12 / 16
Trees and forests Regression trees
13 / 16
Trees and forests Regression trees
14 / 16
Trees and forests Regression trees
15 / 16
Trees and forests References
16 / 16