Machine Learning Algorithms for Classification Machine Learning - PowerPoint PPT Presentation

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Rob Schapire Princeton University www.cs.princeton.edu/ ∼ schapire

Machine Learning Machine Learning Machine Learning Machine Learning Machine Learning • studies how to automatically learn automatically learn automatically learn to make accurate predictions automatically learn predictions predictions predictions based automatically learn predictions on past observations • classification classification problems: classification classification classification • classify examples into given set of categories new example labeled classification machine learning training rule algorithm examples predicted classification

Examples of Classification Problems Examples of Classification Problems Examples of Classification Problems Examples of Classification Problems Examples of Classification Problems • text categorization • e.g.: spam filtering • e.g.: categorize news articles by topic • fraud detection • optical character recognition • natural-language processing • e.g.: part-of-speech tagging • e.g.: spoken language understanding • market segmentation • e.g.: predict if customer will respond to promotion • e.g.: predict if customer will switch to competitor • medical diagnosis . . .

Why Use Machine Learning? Why Use Machine Learning? Why Use Machine Learning? Why Use Machine Learning? Why Use Machine Learning? • advantages advantages advantages advantages advantages: accurate • often much more accurate accurate accurate than human-crafted rules accurate (since data driven) • humans often incapable of expressing what they know (e.g., rules of English, or how to recognize letters), but can easily classify examples • don’t need a human expert or programmer • flexible flexible flexible flexible — can apply to any learning task flexible • cheap cheap cheap cheap — can use in applications requiring many many many many classifiers cheap many (e.g., one per customer, one per product, one per web page, ...) • disadvantages disadvantages disadvantages disadvantages disadvantages labeled • need a lot of labeled labeled labeled labeled data error prone • error prone error prone error prone error prone — usually impossible to get perfect accuracy

Machine Learning Algorithms Machine Learning Algorithms Machine Learning Algorithms Machine Learning Algorithms Machine Learning Algorithms • this talk this talk this talk this talk this talk: • decision trees • boosting • support-vector machines • neural networks • others not not not not not covered: • nearest neighbor algorithms • Naive Bayes • bagging . . .

Decision Trees Decision Trees Decision Trees Decision Trees Decision Trees

Example: Good versus Evil Example: Good versus Evil Example: Good versus Evil Example: Good versus Evil Example: Good versus Evil • problem problem problem problem problem: identify people as good or bad from their appearance sex mask cape tie ears smokes class training data training data training data training data training data batman male yes yes no yes no Good robin male yes yes no no no Good alfred male no no yes no no Good penguin male no no yes no yes Bad catwoman female yes no no yes no Bad joker male no no no no no Bad test data test data test data test data test data batgirl female yes yes no yes no ?? riddler male yes no no no no ??

Example (cont.) Example (cont.) Example (cont.) Example (cont.) Example (cont.) tie yes no cape smokes yes no no yes bad bad good good

How to Build Decision Trees How to Build Decision Trees How to Build Decision Trees How to Build Decision Trees How to Build Decision Trees • choose rule to split on • divide data using splitting rule into disjoint subsets • repeat recursively for each subset • stop when leaves are (almost) “pure”

Choosing the Splitting Rule Choosing the Splitting Rule Choosing the Splitting Rule Choosing the Splitting Rule Choosing the Splitting Rule • choose rule that leads to greatest increase in “purity”: � � ��

Choosing the Splitting Rule (cont.) Choosing the Splitting Rule (cont.) Choosing the Splitting Rule (cont.) Choosing the Splitting Rule (cont.) Choosing the Splitting Rule (cont.) • (im)purity measures: • entropy: − p + ln p + − p − ln p − • Gini index: p + p − where p + / p − = fraction of positive / negative examples impurity 0 1 1/2 p = 1 − p + −

Kinds of Error Rates Kinds of Error Rates Kinds of Error Rates Kinds of Error Rates Kinds of Error Rates • training error training error training error training error training error = fraction of training examples misclassified • test error test error test error test error test error = fraction of test examples misclassified • generalization error generalization error generalization error generalization error = probability of misclassifying new random generalization error example

Tree Size versus Accuracy Tree Size versus Accuracy Tree Size versus Accuracy Tree Size versus Accuracy Tree Size versus Accuracy �� 0.5 �� 50 �� 0.55 �� On test data �� 0.6 On training data 40 �� 0.65 error (%) test �� Accuracy �� 0.7 30 �� 0.75 �� 0.8 20 �� train �� 0.85 �� 10 �� 0.9 �� 0 50 100 tree size �� • trees must be big enough to fit training data (so that “true” patterns are fully captured) • BUT: trees that are too big may overfit overfit overfit overfit overfit (capture noise or spurious patterns in the data) • significant problem significant problem significant problem significant problem significant problem: can’t tell best tree size from training error

Overfitting Example Overfitting Example Overfitting Example Overfitting Example Overfitting Example • fitting points with a polynomial underfit ideal fit overfit (degree = 1) (degree = 3) (degree = 20)

Machine Learning Algorithms for Classification Machine Learning - PowerPoint PPT Presentation

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Rob

Machine Learning Machine Learning: algorithms that use experience to improve their

Classifiers: Support Vector Machine 1 MACHINE LEARNING What is Classification? Female Adult

machine learning classification algorithms & Topic Modeling A quick look at 145,000 World

Machine Learning: Study of algorithms that improve their performance P at some task T

Traditional Machine Learning: Unsupervised Learning Juhan Nam Traditional Machine Learning

Multiclass Classification Machine Learning So far: Binary Classification We have seen linear

1 Why Study Machine Learning? Why Study Machine Learning? Cognitive Science The Time is Ripe

Speech segment classification on music radio shows using machine learning algorithms Tim Scarfe,

Overview CS 446 What is machine learning? Machine learning : study of computational

Machine learning and event classification SOTARRIVA ALVAREZ ISAI ROBERTO Advisor Dr. Antonio

Softmax Classifier + SGD Todays Class Intro to Machine Learning What is Machine Learning?

Machine learning strikes from below, a mining application: Material Classification by Drilling

Machine Learning Classification over Encrypted Data Raphal Bost Raluca Ada Popa,

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Foundations of Machine Learning Multi-Class Classification Motivation Real-world problems often

Machine Learning Modeling and Learning 15-110 Monday 4/13 Learning Goals Given a

Perceptrons From the heights of error, To the valleys of Truth Piyush Kumar Advanced

the Company of Heaven Gods Invisible Creatures Angels are ministering spirits (Hebrews

Wireless Networks Lecture 20 : Managing Wireless Networks Peter Steenkiste CS and ECE, Carnegie

Probabilistic modeling Subhransu Maji CMPSCI 689: Machine Learning 3 March 2015 5 March 2015

Unfolding and Shrinking Neural Machine Translation Ensembles Felix Stahlberg and Bill Byrne

Tcl Values: Past, Present & Tales from the Future 2016 Tcl Conference Don Porter Tcl/Tk

Nuclear effects in high energy lepton-nucleus scattering Vadim Guzey Theory Center, Jefferson

Physics 2D Lecture Slides Lecture 18: Feb 9th 2005 Vivek Sharma UCSD Physics Wave Packets

Machine Learning Algorithms for Classification Machine Learning - PowerPoint PPT Presentation

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Rob

Machine Learning Machine Learning: algorithms that use experience to improve their

Classifiers: Support Vector Machine 1 MACHINE LEARNING What is Classification? Female Adult

machine learning classification algorithms &amp; Topic Modeling A quick look at 145,000 World

Machine Learning: Study of algorithms that improve their performance P at some task T

Traditional Machine Learning: Unsupervised Learning Juhan Nam Traditional Machine Learning

Multiclass Classification Machine Learning So far: Binary Classification We have seen linear

1 Why Study Machine Learning? Why Study Machine Learning? Cognitive Science The Time is Ripe

Speech segment classification on music radio shows using machine learning algorithms Tim Scarfe,

Overview CS 446 What is machine learning? Machine learning : study of computational

Machine learning and event classification SOTARRIVA ALVAREZ ISAI ROBERTO Advisor Dr. Antonio

Softmax Classifier + SGD Todays Class Intro to Machine Learning What is Machine Learning?

Machine learning strikes from below, a mining application: Material Classification by Drilling

Machine Learning Classification over Encrypted Data Raphal Bost Raluca Ada Popa,

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Foundations of Machine Learning Multi-Class Classification Motivation Real-world problems often

Machine Learning Modeling and Learning 15-110 Monday 4/13 Learning Goals Given a

Perceptrons From the heights of error, To the valleys of Truth Piyush Kumar Advanced

the Company of Heaven Gods Invisible Creatures Angels are ministering spirits (Hebrews

Wireless Networks Lecture 20 : Managing Wireless Networks Peter Steenkiste CS and ECE, Carnegie

Probabilistic modeling Subhransu Maji CMPSCI 689: Machine Learning 3 March 2015 5 March 2015

Unfolding and Shrinking Neural Machine Translation Ensembles Felix Stahlberg and Bill Byrne

Tcl Values: Past, Present &amp; Tales from the Future 2016 Tcl Conference Don Porter Tcl/Tk

Nuclear effects in high energy lepton-nucleus scattering Vadim Guzey Theory Center, Jefferson

Physics 2D Lecture Slides Lecture 18: Feb 9th 2005 Vivek Sharma UCSD Physics Wave Packets

machine learning classification algorithms & Topic Modeling A quick look at 145,000 World

Tcl Values: Past, Present & Tales from the Future 2016 Tcl Conference Don Porter Tcl/Tk