Lecture 4 Review from last lecture Nearest neighbor classifier - PDF document

Oct - 6 - 2008 Lecture 4

Review from last lecture • Nearest neighbor classifier – A lazy learning algorithm – Decision boundary can be obtained by the Voronoi diagram of the training set – Complex, sensitive to label noise • K-nearest neighbor, how to select k? a model selection problem – Use training error – bad idea – Use validation error – better – Use cross-validation error – even better • Issues with KNN – Features need to be normalized to the same range – Computational cost – high – Irrelevant features – bad for kNN, which assumes all features are equally important – Last but not least: finding the right distance metric can be difficult

• So far we’ve learned two classifiers – Perceptron: LTU – KNN: complex decision boundary • We have paid special attention on some of the issues such as – Is the learning algorithm robust to outliers? – Is the learning algorithm sensitive to irrelevant features? – Is the algorithm computationally scalable? – We will continue to pay attention to these issues as we introduce more learning algorithms

Decision Tree One of the most popular off-the-shelf classifiers

What is decision tree: an example

Definition • Internal nodes (rectangles) – Each node presents a test on a particular attribute – Multiple possible outcomes lead to branches of the tree – For discrete attributes (outlook = sunny, overcast or wind) • n possible values -> n branches – Continuous attributes ( temperature = 87 F)? • Leaf nodes (elipses) – Each assign a class label to all examples that end up there,

Decision Tree Decision Boundaries • Decision Trees divide the input space into axis-parallel rectangles and label each rectangle with one of the K classes X2 < 1.5? Y N 4 X1<3.5? Y N 3 X1<1.5? Y N 2 X2<3.5? 1 N Y 2 3 4 1

Characteristics of Decision Trees • Decision trees have many appealing properties – Similar to human decision process, easy to understand – Handle both Discrete and Continuous features – Highly flexible hypothesis space, as the # of nodes (or depth) of tree increases, decision tree can represent increasingly complex decision boundaries Definition: Hypothesis space The space of solutions that a learning algorithm can possibly output. For example, • For Perceptron: the hypothesis space is the space of all straight lines • For nearest neighbor: the hypothesis space is infinitely complex • For decision tree: it is a flexible space, as we increase the depth of the tree, the hypothesis space grows larger and larger

DT can represent arbitrarily complex decision boundaries If needed, we can keep growing the tree until all examples are correctly classified, although it may not be the best idea.

So far we have looked at what is a decision tree, and what kind of decision boundaries decision trees produce, and its apealling properties. We now need to address: How to learn decision trees • Goal: Find a decision tree h that achieves minimum misclassification errors on the training data • As our previous slides suggest, we can always achieve this by using large trees • In fact, we can achieve this trivially: just create a decision tree with one path from root to leaf for each training example – Problem: Such a tree would just memorize the training data. It would not generalize to new data points – i.e., capture regularities that are applicable to unseen data • Alternatively: find the smallest tree h that minimizes training error – Problem: This is NP-Hard

Greedy Learning For DT There are different ways to construct trees from data. We will focus on the top-down, greedy search approach: Instead of trying to optimize the whole tree together, we try to find one test at a time. Basic idea: (assuming discrete features, relaxed later) 1. Choose the best attribute a* to place at the root of the tree. 2. Separate training set S into subsets {S 1 , S 2 , .., S k } where each subset S i contains examples having the same value for a* 3. Recursively apply the algorithm on each new subset until examples have the same class or there are few of them.

Building DT: an example y Training data contains 1 13 15 If we had to make a decision now, we’d pick . But there’s too much uncertainty . Based on training data, with probability 13/28 I would be wrong x 0 Now if you are allowed to ask 1 one question about your example to help the decision, 13 15 which question will you ask?

One possible question: is x < 0.5? y [13,15] 1 X < 0.5 [5,15] [8,0] [5,15] Y < 0.5 [4,0] [1,15] x 0 1 Now we feel much better because the uncertainty in each leaf node is much reduced!

Building a decision tree 1. Choose the best attribute a* to place at the root of the tree. What do we mean by “best” – reduce the most uncertainty about our decision of the class labels 2. Separate the training set S into subsets {S 1 , S 2 , .., S k } where each subset S i contains examples having the same value for a* 3. Recursively apply the algorithm on each new subset until all examples have the same class label

Choosing split: example # of training examples y=0 # of training examples y=1 Candidate test 0 1 1+3= # of training examples with x1=0 0 1 0 1 # of training mistakes can be used as a measure of uncertainty

Lecture 4 Review from last lecture Nearest neighbor classifier - PDF document

Oct - 6 - 2008 Lecture 4 Review from last lecture Nearest neighbor classifier A lazy learning algorithm Decision boundary can be obtained by the Voronoi diagram of the training set Complex, sensitive to label noise

Math 3B: Lecture 2 Noah White September 26, 2016 Last time Last time, we spoke about The

Who Is My Counselor? Last Name A-Co: Mrs. Ary Last Name Cr-He: Mr. Peslak Last Name Hi-Ma:

CS 241: Systems Programming Lecture 15. Strings Fall 2019 Prof. Stephen Checkoway 1 Review of

Presentation Last Names A-E Ms. Kennair Last Names F-L Ms. Fornera Last Names M-R Ms. Tippins

Virginia Webb, PhD, RD Procurement Review Process First review cycle Review last

FE Review-Transportation 1 FE Review-Transportation 2 FE Review-Transportation 3 FE

Recall last lecture ... Lecture 8 Also last lecture: Painter's Algorithm More Hidden Surface

Lecture 28: Last Lecture! COMPSCI/MATH 290-04 Chris Tralie, Duke University 4/26/2016

Prelim 1 Review Spring 2019 Exam Info Prelim 1: Tuesday, March 12th BKL 219 Last

FE Review-Mechanics of Materials 1 FE Review-Mechanics of Materials 2 FE Review-Mechanics of

MTA-RF: Fabrication Readiness Review Bowring Review Daniel Bowring Lawrence Berkeley National

AAA Showcase! Who is my counselor? Last Names A-EL: Mr. Melvin Last Names EM-LEE: Ms. Tauer

Last Monday: Department Meetings Last Tuesday/Wednesday: Tours and Information Gathering Last

Order Statistics Algorithm Quicksort(A, first, last) if first < last then // // Partition

Complexity of Counting Lecture 21 #P: Toda s Theorem 1 Last Time 2 Last Time #P:

Keeyask Engineering Review Jan 30 2017 Project Design Review Contract Cost Review

Decision Tree Mahdi Roozbahani Lecturer, Computational Science and Engineering, Georgia Tech

DecisionTrees MachineLearning10601 GeoffGordon,MiroslavDudk

Module 3 Lessons learned from mini project & Verify your results with structured testing

Specifying Operations Specifying Operations Why operations are specified Algorithmic methods

Decision Tree Learning Based on Machine Learning, T. Mitchell, McGRAW Hill, 1997, ch. 3

Learning Objectives At the end of the class you should be able to: show an example of

Decision Tree and Automata Learning Stefan Edelkamp 1 Overview - Decision tree representation

and Random Forests Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Universit

Lecture 4 Review from last lecture Nearest neighbor classifier - PDF document

Oct - 6 - 2008 Lecture 4 Review from last lecture Nearest neighbor classifier A lazy learning algorithm Decision boundary can be obtained by the Voronoi diagram of the training set Complex, sensitive to label noise

Math 3B: Lecture 2 Noah White September 26, 2016 Last time Last time, we spoke about The

Who Is My Counselor? Last Name A-Co: Mrs. Ary Last Name Cr-He: Mr. Peslak Last Name Hi-Ma:

CS 241: Systems Programming Lecture 15. Strings Fall 2019 Prof. Stephen Checkoway 1 Review of

Presentation Last Names A-E Ms. Kennair Last Names F-L Ms. Fornera Last Names M-R Ms. Tippins

Virginia Webb, PhD, RD Procurement Review Process First review cycle Review last

FE Review-Transportation 1 FE Review-Transportation 2 FE Review-Transportation 3 FE

Recall last lecture ... Lecture 8 Also last lecture: Painter's Algorithm More Hidden Surface

Lecture 28: Last Lecture! COMPSCI/MATH 290-04 Chris Tralie, Duke University 4/26/2016

Prelim 1 Review Spring 2019 Exam Info Prelim 1: Tuesday, March 12th BKL 219 Last

FE Review-Mechanics of Materials 1 FE Review-Mechanics of Materials 2 FE Review-Mechanics of

MTA-RF: Fabrication Readiness Review Bowring Review Daniel Bowring Lawrence Berkeley National

AAA Showcase! Who is my counselor? Last Names A-EL: Mr. Melvin Last Names EM-LEE: Ms. Tauer

Last Monday: Department Meetings Last Tuesday/Wednesday: Tours and Information Gathering Last

Order Statistics Algorithm Quicksort(A, first, last) if first &lt; last then // // Partition

Complexity of Counting Lecture 21 #P: Toda s Theorem 1 Last Time 2 Last Time #P:

Keeyask Engineering Review Jan 30 2017 Project Design Review Contract Cost Review

Decision Tree Mahdi Roozbahani Lecturer, Computational Science and Engineering, Georgia Tech

DecisionTrees MachineLearning10601 GeoffGordon,MiroslavDudk

Module 3 Lessons learned from mini project &amp; Verify your results with structured testing

Specifying Operations Specifying Operations Why operations are specified Algorithmic methods

Decision Tree Learning Based on Machine Learning, T. Mitchell, McGRAW Hill, 1997, ch. 3

Learning Objectives At the end of the class you should be able to: show an example of

Decision Tree and Automata Learning Stefan Edelkamp 1 Overview - Decision tree representation

and Random Forests Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Universit

Order Statistics Algorithm Quicksort(A, first, last) if first < last then // // Partition

Module 3 Lessons learned from mini project & Verify your results with structured testing