Data Mining Classification Trees (2)
Ad Feelders
Universiteit Utrecht
September 16, 2020
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 1 / 55
Data Mining Classification Trees (2) Ad Feelders Universiteit - - PowerPoint PPT Presentation
Data Mining Classification Trees (2) Ad Feelders Universiteit Utrecht September 16, 2020 Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 1 / 55 Basic Tree Construction Algorithm Construct tree nodelist {{ training
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 1 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 2 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 3 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 4 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 5 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 6 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 7 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 8 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 9 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 10 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 11 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 12 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 13 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 14 / 55
1 Cα(T(α)) = minT≤Tmax Cα(T)
2 If Cα(T) = Cα(T(α)) then T(α) ≤ T.
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 15 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 16 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 17 / 55
Tt(R(t′) + α) Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 18 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 19 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 20 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 21 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 22 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 23 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 24 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 25 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 26 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 27 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 28 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 29 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 30 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 31 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 32 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 33 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 34 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 35 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 36 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 37 / 55
1 Divide data into v folds. 2 Train on v − 1 folds. 3 Predict on the remaining fold. 4 Leave out each of the v folds in turn.
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 38 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 39 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 40 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 41 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 42 / 55
1 Perform cross-validation for different hyper-parameter settings
2 Compute prediction error for each parameter setting. 3 Pick setting with lowest error. 4 Train with selected setting on complete data set. Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 43 / 55
1 Divide the data into v groups G1, . . . , Gv. 2 For each value ci of C 1
1
2
2
3 Select the value c∗ of C with the smallest CV prediction error. 4 Train on the complete training sample with C = c∗ Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 44 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 45 / 55
1 Grow a tree on all data except Gj, and determine the smallest
2 Compute the error of T (j)(βk) (k = 1, . . . , K) on Gj. Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 46 / 55
1 For each βk, sum the errors of T (j)(βk) over Gj (j = 1, . . . , v). 2 Let βh be the one with the lowest overall error.
3 Use the error rate computed with cross-validation as an estimate of
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 47 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 48 / 55
1 Build a tree on all data except G1, and determine the smallest
2 Compute the error of those trees on G1.
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 49 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 50 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 51 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 52 / 55
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 53 / 55
X−val Relative Error 0.6 0.7 0.8 0.9 1.0 1.1 Inf 0.043 0.014 0.0099 0.0068 0.0053 0.0041 0.0026 1 2 3 6 10 13 17 20 24 29 43 46 51 89 97 129 size of tree
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 54 / 55
| plasma< 127.5 bmi< 29.95 plasma>=127.5 bmi>=29.95 500/268 391/94 1 109/174 52/24 1 57/150
Ad Feelders ( Universiteit Utrecht ) Data Mining September 16, 2020 55 / 55