Learning from Examples Classification Regression Model Selection & Generalization Recap
Supervised Learning
Steven J Zeil
Old Dominion Univ.
Fall 2010
1 Learning from Examples Classification Regression Model Selection & Generalization Recap
Outline
1
Learning from Examples Motivation Aspects of Supervised Learning
2
Classification Hypotheses Noise Multiple Classes
3
Regression
4
Model Selection & Generalization
5
Recap
2 Learning from Examples Classification Regression Model Selection & Generalization Recap
Motivation
We have a collection of data Some instances of which are known to be examples of an interesting class We wish to learn a rule by which we can determine membership in that class.
3 Learning from Examples Classification Regression Model Selection & Generalization Recap
Example: Spam Detection
Training set: A collection of email messages Humans have flagged some as spam. Knowledge Extraction:
We scan the messages for “typical” spam vocabulary: names
- f drugs, “mortgage”, Nigeria, . . .
We also scan for irregularities in the formal content
- bfuscated URLs
routing via open SMTP relays mismatches between “From:” domain and IP address of
- rigin, etc.
Input Representation: x,
xi, 0 ≤ i < k: count of number of occurrences of spam term wi xi, k ≤ i < n: 1 if irregularity zi−k is present, 0 ow.
4