Decision Trees Petr Pok Czech Technical University in Prague - PowerPoint PPT Presentation

Decision Trees Petr Pošík Czech Technical University in Prague Faculty of Electrical Engineering Dept. of Cybernetics This lecture is largely based on the book Artificial Intelligence: A Modern Approach, 3rd ed. by Stuart Russell and Peter Norvig (Prentice Hall, 2010). P. Pošík c � 2013 Artificial Intelligence – 1 / 29

Decision Trees What is a decision tree? Attribute description Expressiveness of decision trees Learning a Decision Tree Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary Decision Trees P. Pošík c � 2013 Artificial Intelligence – 2 / 29

What is a decision tree? Decision Trees Decision tree What is a decision tree? Attribute description ✔ is a function that Expressiveness of decision trees ✘ takes a vector of attribute values as its input, and Learning a Decision Tree ✘ returns a “decision” as its output. Generalization and Both input and output values can be measured on a nominal, ordinal, interval, ✘ Overfitting and ratio scales, can be discrete or continuous. Broadening the Applicability of Desicion Trees ✔ The decision is formed via a sequence of tests: Summary ✘ each internal node of the tree represents a test, ✘ the branches are labeled with possible outcomes of the test, and ✘ each leaf node represents a decision to be returned by the tree. P. Pošík c � 2013 Artificial Intelligence – 3 / 29

What is a decision tree? Decision Trees Decision tree What is a decision tree? Attribute description ✔ is a function that Expressiveness of decision trees ✘ takes a vector of attribute values as its input, and Learning a Decision Tree ✘ returns a “decision” as its output. Generalization and Both input and output values can be measured on a nominal, ordinal, interval, ✘ Overfitting and ratio scales, can be discrete or continuous. Broadening the Applicability of Desicion Trees ✔ The decision is formed via a sequence of tests: Summary ✘ each internal node of the tree represents a test, ✘ the branches are labeled with possible outcomes of the test, and ✘ each leaf node represents a decision to be returned by the tree. Decision trees examples: ✔ classification schemata in biology (urˇ covací klíˇ ce) ✔ diagnostic sections in illness encyclopedias online troubleshooting section on software web pages ✔ ✔ ... P. Pošík c � 2013 Artificial Intelligence – 3 / 29

Attribute description Decision Trees Example: A computer game. What is a decision tree? The main character of the game meets various robots along his way. Some behave like Attribute description allies, others like enemies. Expressiveness of decision trees Learning a Decision Tree ally head body smile neck holds class Generalization and Overfitting circle circle yes tie nothing ally Broadening the circle square no tie sword enemy Applicability of Desicion ... ... ... ... ... ... Trees Summary The game engine may use e.g. the following tree to assign the ally or enemy attitude to the generated robots: enemy neck other tie body smile triangle other yes no ally enemy ally enemy P. Pošík c � 2013 Artificial Intelligence – 4 / 29

Expressiveness of decision trees Decision Trees The tree on previous slide is a Boolean decision tree: What is a decision tree? Attribute description ✔ the decision is a binary variable (true, false), and Expressiveness of decision trees ✔ the attributes are discrete. Learning a Decision Tree ✔ It returns ally iff the input attributes satisfy one of the paths leading to an ally leaf: Generalization and Overfitting ally ⇔ ( neck = tie ∧ smile = yes ) ∨ ( neck = ¬ tie ∧ body = triangle ) , Broadening the Applicability of Desicion i.e. in general Trees Goal ⇔ ( Path 1 ∨ Path 2 ∨ . . . ) , where Summary ✘ Path is a conjuction of attribute-value tests, i.e. ✘ ✘ the tree is equivalent to a DNF of a function. Any function in propositional logic can be expressed as a dec. tree. ✔ Trees are a suitable representation for some functions and unsuitable for others. What is the cardinality of the set of Boolean functions of n attributes? ✔ It is equal to the number of truth tables that can be created with n attributes. ✘ The truth table has 2 n rows, i.e. there is 2 2 n different functions. ✘ ✘ The set of trees is even larger; several trees represent the same function. ✔ We need a clever algorithm to find good hypotheses (trees) in such a large space. P. Pošík c � 2013 Artificial Intelligence – 5 / 29

Decision Trees Learning a Decision Tree A computer game A computer game Alternative hypotheses How to choose the best tree? Learning a Decision Tree Attribute importance Choosing the test attribute Choosing the test attribute (special case: binary classification) Learning a Decision Tree Choosing the test attribute (example) Choosing subsequent test attribute Decision tree building procedure Algorithm characteristics Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary P. Pošík c � 2013 Artificial Intelligence – 6 / 29

A computer game Decision Trees Example 1: Learning a Decision Tree Can you distinguish between allies and enemies after seeing a few of them? A computer game A computer game Allies Enemies Alternative hypotheses How to choose the best tree? Learning a Decision Tree Attribute importance Choosing the test attribute Choosing the test attribute (special case: binary classification) Choosing the test attribute (example) Choosing subsequent test attribute Decision tree building procedure Algorithm characteristics Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary P. Pošík c � 2013 Artificial Intelligence – 7 / 29

A computer game Decision Trees Example 1: Learning a Decision Tree Can you distinguish between allies and enemies after seeing a few of them? A computer game A computer game Allies Enemies Alternative hypotheses How to choose the best tree? Learning a Decision Tree Attribute importance Choosing the test attribute Choosing the test attribute (special case: binary classification) Choosing the test attribute (example) Choosing subsequent test attribute Decision tree building procedure Algorithm characteristics Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary Hint: concentrate on the shapes of heads and bodies. P. Pošík c � 2013 Artificial Intelligence – 7 / 29

A computer game Decision Trees Example 1: Learning a Decision Tree Can you distinguish between allies and enemies after seeing a few of them? A computer game A computer game Allies Enemies Alternative hypotheses How to choose the best tree? Learning a Decision Tree Attribute importance Choosing the test attribute Choosing the test attribute (special case: binary classification) Choosing the test attribute (example) Choosing subsequent test attribute Decision tree building procedure Algorithm characteristics Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary Hint: concentrate on the shapes of heads and bodies. Answer: Seems like allies have the same shape of their head and body. How would you represent this by a decision tree? (Relation among attributes.) P. Pošík c � 2013 Artificial Intelligence – 7 / 29

A computer game Decision Trees Example 1: Learning a Decision Tree Can you distinguish between allies and enemies after seeing a few of them? A computer game A computer game Allies Enemies Alternative hypotheses How to choose the best tree? Learning a Decision Tree Attribute importance Choosing the test attribute Choosing the test attribute (special case: binary classification) Choosing the test attribute (example) Choosing subsequent test attribute Decision tree building procedure Algorithm characteristics Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary Hint: concentrate on the shapes of heads and bodies. Answer: Seems like allies have the same shape of their head and body. How would you represent this by a decision tree? (Relation among attributes.) How do you know that you are right? P. Pošík c � 2013 Artificial Intelligence – 7 / 29

A computer game Decision Trees Example 2: Learning a Decision Tree Some robots changed their attitudes: A computer game A computer game Allies Enemies Alternative hypotheses How to choose the best tree? Learning a Decision Tree Attribute importance Choosing the test attribute Choosing the test attribute (special case: binary classification) Choosing the test attribute (example) Choosing subsequent test attribute Decision tree building procedure Algorithm characteristics Generalization and Overfitting Broadening the Applicability of Desicion Trees Summary P. Pošík c � 2013 Artificial Intelligence – 8 / 29

Decision Trees Petr Pok Czech Technical University in Prague - PowerPoint PPT Presentation

Decision Trees Petr Pok Czech Technical University in Prague Faculty of Electrical Engineering Dept. of Cybernetics This lecture is largely based on the book Artificial Intelligence: A Modern Approach, 3rd ed. by Stuart Russell and Peter

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

Lecture 23: Decision Trees Decision trees Prof. Julia Hockenmaier

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes

/ + - * * 5 3 2 6 5 2 Examples Binary Trees BSTs Augmenting BinExpr General Trees

Learning Decision Trees Machine Learning 1 Some slides from Tom Mitchell, Dan Roth and others

Optimal Sparse Decision Trees Xiyang Hu Cynthia Rudin Margo Seltzer Carnegie Mellon Duke

Decision Trees: Discussion Machine Learning 1 Some slides from Tom Mitchell, Dan Roth and others

Decision trees Decision Trees / Discrete Variables Location Season Location Fun? Ski Slope

Machine Learning I: Decision Trees AI Class 14 (Ch. 18.118.3) Cynthia Matuszek CMSC 671

and Random Forests Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Universit

Decision Tree and Automata Learning Stefan Edelkamp 1 Overview - Decision tree representation

Learning Objectives At the end of the class you should be able to: show an example of

Decision tree learning Aim: find a small tree consistent with the training examples Idea:

Supervised Learning Decision Trees and Linear Models Marco Chiarandini Department of Mathematics

Applied Machine Learning Decision Trees Siamak Ravanbakhsh COMP 551 (Fall 2020)

Foundations of Artificial Intelligence 14. Machine Learning Learning from Observations Joschka