Exam Review Introduction to Machine Learning T-529-ITME - PDF document

Exam Review Introduction to Machine Learning T-529-ITME Instructor: Dan Lizotte Exam Logistics  When: Tuesday, 15 May 2007 at 9:00am  Where: Ofanleiti 131a, 131b  Materials/aids: None. No books, no calculators, no laptops.  You don’t need to memorize formulas except as noted in this document , but you should know what they mean. 1

Introduction  What is classification?  What is regression?  What is the difference?  What do these have in common with Reinforcement learning?  They are all prediction problems.  What is different?  RL is Evaluative learning  Classification and Regression are Instructive learning  “Supervised” Learning  What is a “feature”? Decision Trees  Understand the meaning of Entropy  More entropy -> more uncertainty  Understand the meaning of Information Gain  IG = Entropy Before - Entropy After  Know how a tree is constructed  Choose a feature, split, choose a new feature, split…  When do we stop?  Know how to use a tree to classify an instance  Why is pruning important? 2

Decision Trees, General Classifier Stuff  Understand the difference between “Training Error” and “Test Error”  Why do we care about the difference?  Want to avoid overfitting.  Test set error is more representative of future error  How can we avoid overfitting?  Pruning  chi-squared test estimates “what is the probability we would see these data by accident?”  And therefore “Should we maybe just ignore this split?” PAC Learning  PAC Stands for…?  Know what a hypothesis space is  The space of all functions representable by your learning machine.  How to count a simple hypothesis space  Figure out what the independent choices are  e.g. “To include x i or not to include x i .”  Multiply the number of independent choices together 3

PAC Learning  Understand that if we have a hypothesis space of size H , and we want to have test error < ɛ with probability (1 - δ ) then we need R data points to guarantee this, where R � 1 � 1 � � log 2 H + log 2 � � � � �  BIG IDEA: Bigger hypothesis space needs more data. VC Dimension  When do we use VC dimension?  When H = ∞ , but we need to measure complexity.  Understand Shattering  Show how to shatter a given set of points with a given ( simple ) classifier  VC dimension = k if  Can shatter *some* set of k points. (You pick.)  Can not shatter *any* set of k+1 points. 4

VC Dimension  Understand that if we have a particular TRAINERR achieved on R data points, and the VC dimension of our classifier is h, then we know the following is true with probability (1 - η ): h (log(2 R / h ) + 1) � log( � /4) TESTERR � TRAINERR + R  Structural Risk Minimization is picking the classifier with the smallest bound VC Dimension  Again, notice that the more complex a classifier we have, the more data we need to guarantee good performance. 5

Cross-Validation  We want good performance on test data. Cross validation is a good way to estimate this performance.  Training error is too optimistic.  Understand  What a test set is  LOOCV - Leave One Out Cross Validation  k-Fold Cross Validation  Be able to explain how each of these works  Remember the folk-theorem:  You need about 10 times as much data as you have parameters in your model Density Estimators, Bayes Classifiers  Be able to compute simple probabilities  KNOW  0 <= P(A) <= 1  P(A or B) = P(A) + P(B) - P(A and B)  P(A|B) = P(A and B) / P(B)  Bayes Rule: P(A|B) = P(B|A)*P(A) / P(B) 6

Density Estimators, Bayes Classifiers  Be able to produce, given a small amount of data  A Joint Density Estimator or Bayes Classifier  A Naïve Density Estimator or Bayes Classifier  Be able to compute P(class = +) given  Joint Density estimates  Naïve Density estimates  KNOW  For naïve: P(A and B | C) = P(A|C) * P(B|C)  For joint: P(A and B | C) = look it up in your table Density Estimators, Bayes Classifiers  Know that, for m binary variables  Joint Density learns 2 m numbers  Therefore needs lots of data  Naïve Density learns m numbers  Therefore needs little data  But the Naïve Density Estimator is not very powerful  Assumes independence  Cannot capture relationships between variables 7

Support Vector Machines  Know what a linear separator is.  Given a weight vector w and constant b , and a data point x , KNOW how to classify that point.  class = sign( w ·x + b )  If I gave you a picture of some data points, draw the maximum margin separator, along with + and - planes, and indicate the margin.  Know what a support vector is. Support Vector Machines  Know what a slack variable is for  allows training points to be misclassified  Know why sometimes we use kernels  when training data are not linearly separable  Understand why using a kernel is like inventing new features  a.k.a. ‘basis functions’ 8

Reinforcement Learning  Understand the Big Four:  Policy  Reward  Value  Transition Model  Understand what TD learning is trying to do  Learn a good value function in order to learn a good policy  Know the difference between Sarsa and Q-learning  Understand on-policy vs. off-policy learning 9

Exam Review Introduction to Machine Learning T-529-ITME - PDF document

Exam Review Introduction to Machine Learning T-529-ITME Instructor: Dan Lizotte Exam Logistics When: Tuesday, 15 May 2007 at 9:00am Where: Ofanleiti 131a, 131b Materials/aids: None. No books, no calculators, no laptops. You

Exam4 Information and Guidance General Topics General Exam Information Exam types

Quicksort Sorting Lower Bound Exam Exam Exam Exam 2 2 tomorrow evening 2 2 tomorrow

Examination Lydia Love DVM DACVAA 2018 Exam Committee Chair September 2018 Exam Format

Announcements Announcements Final Exam will be a take Final Exam will be a take- -home exam

ICS 101 Final Exam Review Fall 2016 Final Exam information In lab: check final exam schedule

Exam 2 Review CS461/ECE422 Fall 2009 Exam guidelines Same as for first exam A single page

Exam Review 2 Exam Overview Final Exam Friday,

Math 211 Math 211 Review for the Final Exam December 8, 2002 2 The Final Exam The Final Exam

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Exam 1 Review Exam 1 Review February

The Bohr Model of Hydrogen Exam Details The exam will be held Wednesday, October 5th from

The final exam Other finals review Final Exam Review CSH Review November 17 th

Vector Graphics Project Check out FilesAndExceptions from SVN Exam 2 review File I/O and

Review Final exam Final exam will be 11-12 problems, drop any 2 Cumulative up to and including

Exam review, Game of Life work time Turn in your questions on material in preparation for

Exam #2 Review sseshadr Agenda Administrative things Exam tomorrow, shouldve been

Lectur Lecture 20: e 20: DC M DC Motor otors Exam Exam 2 Results 2 Results Most M ost

for Compressed Imaging Chunli Guo, Mike E. Davies Institute for Digital Communications

Computing Equilibria Christos H. Papadimitriou UC Berkeley christos Games 1/3 1/3 1/3

Continuous Time Models of Repeated Games with Imperfect Public Monitoring Drew Fudenberg and

Repeated Games CMPUT 654: Modelling Human Strategic Behaviour S&LB 6.1 Recap: Imperfect

LEARNING IN GAMES WITH NOISY PAYOFF OBSERVATIONS Background and motivation Preliminaries The

Seminar: Automata Theory Timed Automata Jennifer Nist 11 th February 2016 Chair of Software

Counting minimal transversals in -acyclic Hypergraph Benjamin Bergougnoux, Florent Capelli ,

Comparisons of stochastic task-resource systems Bruno Gaujal Jean-Marc Vincent INRIA and LIG

Exam Review Introduction to Machine Learning T-529-ITME - PDF document

Exam Review Introduction to Machine Learning T-529-ITME Instructor: Dan Lizotte Exam Logistics When: Tuesday, 15 May 2007 at 9:00am Where: Ofanleiti 131a, 131b Materials/aids: None. No books, no calculators, no laptops. You

Exam4 Information and Guidance General Topics General Exam Information Exam types

Quicksort Sorting Lower Bound Exam Exam Exam Exam 2 2 tomorrow evening 2 2 tomorrow

Examination Lydia Love DVM DACVAA 2018 Exam Committee Chair September 2018 Exam Format

Announcements Announcements Final Exam will be a take Final Exam will be a take- -home exam

ICS 101 Final Exam Review Fall 2016 Final Exam information In lab: check final exam schedule

Exam 2 Review CS461/ECE422 Fall 2009 Exam guidelines Same as for first exam A single page

Exam Review 2 Exam Overview Final Exam Friday,

Math 211 Math 211 Review for the Final Exam December 8, 2002 2 The Final Exam The Final Exam

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Exam 1 Review Exam 1 Review February

The Bohr Model of Hydrogen Exam Details The exam will be held Wednesday, October 5th from

The final exam Other finals review Final Exam Review CSH Review November 17 th

Vector Graphics Project Check out FilesAndExceptions from SVN Exam 2 review File I/O and

Review Final exam Final exam will be 11-12 problems, drop any 2 Cumulative up to and including

Exam review, Game of Life work time Turn in your questions on material in preparation for

Exam #2 Review sseshadr Agenda Administrative things Exam tomorrow, shouldve been

Lectur Lecture 20: e 20: DC M DC Motor otors Exam Exam 2 Results 2 Results Most M ost

for Compressed Imaging Chunli Guo, Mike E. Davies Institute for Digital Communications

Computing Equilibria Christos H. Papadimitriou UC Berkeley christos Games 1/3 1/3 1/3

Continuous Time Models of Repeated Games with Imperfect Public Monitoring Drew Fudenberg and

Repeated Games CMPUT 654: Modelling Human Strategic Behaviour S&amp;LB 6.1 Recap: Imperfect

LEARNING IN GAMES WITH NOISY PAYOFF OBSERVATIONS Background and motivation Preliminaries The

Seminar: Automata Theory Timed Automata Jennifer Nist 11 th February 2016 Chair of Software

Counting minimal transversals in -acyclic Hypergraph Benjamin Bergougnoux, Florent Capelli ,

Comparisons of stochastic task-resource systems Bruno Gaujal Jean-Marc Vincent INRIA and LIG

Repeated Games CMPUT 654: Modelling Human Strategic Behaviour S&LB 6.1 Recap: Imperfect