Xiyang Hu Cynthia Rudin Margo Seltzer
Carnegie Mellon University Duke University University of British Columbia
Optimal Sparse Decision Trees Xiyang Hu Cynthia Rudin Margo - - PowerPoint PPT Presentation
Optimal Sparse Decision Trees Xiyang Hu Cynthia Rudin Margo Seltzer Carnegie Mellon Duke University University of British University Columbia Decision Trees Decision Trees Decision Trees Should I click on the link in this email? Do I
Xiyang Hu Cynthia Rudin Margo Seltzer
Carnegie Mellon University Duke University University of British Columbia
Do I recognize the from address? Do the contents seem
Can I see the URL for the link?
Should I click on the link in this email?
Can I see the URL for the link? Do the contents seem
Do I recognize the from address?
Should I click on the link in this email?
ˆ L(tree,{(xi, yi)}i) = 1 n i=1
n
[tree(xi )¹yi] + C(#leaves in tree)
Misclassification error
ˆ L(tree,{(xi, yi)}i) = 1 n i=1
n
[tree(xi )¹yi] + C(#leaves in tree)
Misclassification error Sparsity
ˆ L(tree,{(xi, yi)}i) = 1 n i=1
n
[tree(xi )¹yi] + C(#leaves in tree)
(Broward County Recidivism Data)
Prior offenses > 3
no yes
Predict Arrest Age < 26 Predict No Arrest
yes no
Prior Offenses > 1
no yes
Any juvenile crimes? Predict Arrest Predict No Arrest
yes no
Predict Arrest
Prior offenses > 3
no yes
Predict Arrest Age > 70
yes no
Prior Offenses > 2
Prior offenses > 3
no yes
Predict Arrest Age > 70
yes no
Prior Offenses > 2
Node support insufficient to produce
Prior offenses > 3
no yes
Predict Arrest Age > 70
yes no
Prior Offenses > 2
Node support insufficient to produce
Prior offenses > 3
no yes
Predict Arrest Felony > 5
yes no
Predict Arrest
Prior offenses > 3
no yes
Predict Arrest Felony > 5
yes no
Predict Arrest
Doesn’t classify at least Cn points correctly.
Prior offenses > 3
no yes
Predict Arrest Felony > 5
yes no
Predict Arrest
Doesn’t classify at least Cn points correctly.
Prior offenses > 3
no yes
Age > 18
yes no
Predict Arrest Predict No Arrest Age > 18
yes no
Predict No Arrest Predict Arrest Age > 18
no yes
Prior offenses > 3
yes no
Predict Arrest Predict No Arrest Prior offenses > 3
yes no
Predict No Arrest Predict Arrest
Prior offenses > 3
no yes
Age > 18
yes no
Predict Arrest Predict No Arrest Age > 18
yes no
Predict No Arrest Predict Arrest Age > 18
no yes
Prior offenses > 3
yes no
Predict Arrest Predict No Arrest Prior offenses > 3
yes no
Predict No Arrest Predict Arrest
Prior offenses > 3
no yes
Age > 18
yes no
Predict Arrest Predict No Arrest Age > 18
yes no
Predict No Arrest Predict Arrest Age > 18
no yes
Prior offenses > 3
yes no
Predict Arrest Predict No Arrest Prior offenses > 3
yes no
Predict No Arrest Predict Arrest
Prior offenses > 3
no yes
Age > 18
yes no
Predict Arrest Predict No Arrest Age > 18
yes no
Predict No Arrest Predict Arrest Age > 18
no yes
Prior offenses > 3
yes no
Predict Arrest Predict No Arrest Prior offenses > 3
yes no
Predict No Arrest Predict Arrest
Prior offenses > 3
no yes
Age > 18
yes no
Predict Arrest Predict No Arrest Age > 18
yes no
Predict No Arrest Predict Arrest Age > 18
no yes
Prior offenses > 3
yes no
Predict Arrest Predict No Arrest Prior offenses > 3
yes no
Predict No Arrest Predict Arrest
– Equivalent points bound: Samples with the same features, but different predictions will produce misclassifications regardless of model. – Bound on the number of leaves: Regularization value bounds the number of leaves.
https://github.com/xiyanghu/OSDT