Inductive Learning Algorithms and Representations for Text - - PowerPoint PPT Presentation

inductive learning algorithms and representations for
SMART_READER_LITE
LIVE PREVIEW

Inductive Learning Algorithms and Representations for Text - - PowerPoint PPT Presentation

Inductive Learning Algorithms and Representations for Text Categorization David Heckerman Susan Dumais John Platt Mehran Sahami Presenter: Haoran Hou Text Categorization real-time sorting emails/files topic identification structured search


slide-1
SLIDE 1

Inductive Learning Algorithms and Representations for Text Categorization

Presenter: Haoran Hou

Susan Dumais John Platt David Heckerman Mehran Sahami

slide-2
SLIDE 2

Text Categorization

real-time sorting emails/files topic identification structured search and/or browsing finding documents that match long-term standing interests

slide-3
SLIDE 3

Old School

Dewey Decimal MeSH(Medical Subject Headings) Yahoo!’s topic hierarchy CyberPatrol

slide-4
SLIDE 4

Inductive Learning Methods Evaluation Results & Others

slide-5
SLIDE 5

Data:

a collection of hand-tagged financial newswire stories from Reuters. http://www.research.att.com/~lewis/reuters21578.html (no longer available)

slide-6
SLIDE 6

Inductive Learning Methods

Classifiers Inductive Learning of Classifiers

Inductive Learning Methods

slide-7
SLIDE 7

Inductive Learning Methods

Classifiers

f(→┬x) = confidence(class) →┬x = (x1,x2,x3…xn)

Classifiers

  • eg. class- interest

if (interest AND rate) OR (quarterly), then confidence(cat interest) = 0.9

confidence(interest cat) = 0.3*interest + 0.4*rate + 0.7*quarterly

slide-8
SLIDE 8

Inductive Learning Methods

Inductive Learning of Classifiers

Find Similar (a variant of Rocchio’s method for relevance feedback) Decision Tree Naive Bayes Naive Nets SVM

*All methods require only on a small amount of labeled training data

The effectiveness of the model is tested on previously unseen instances.

slide-9
SLIDE 9

Inductive Learning Methods

Inductive Learning of Classifiers

Find Similar (a variant of Rocchio’s method for relevance feedback)

  • tf*idf
  • all features used

*no error minimization is applied

slide-10
SLIDE 10

Inductive Learning Methods

Inductive Learning of Classifiers

Feature selection SVM: K = 300 The remaining: K = 50 Only binary feature values are used

slide-11
SLIDE 11

Inductive Learning Methods

Inductive Learning of Classifiers

Decision Tree Recursive greedy splitting Bayesian posterior probability Node class probability

slide-12
SLIDE 12

Inductive Learning Methods

Inductive Learning of Classifiers

Naive Bayes

Assume the features X1,….Xn are conditionally independent

slide-13
SLIDE 13

Inductive Learning Methods

Inductive Learning of Classifiers

Bayes Nets 2-dependence Bayesian classifier

slide-14
SLIDE 14

Inductive Learning Methods

Inductive Learning of Classifiers

SVM Simplest linear version

slide-15
SLIDE 15

Evaluation

Reuters-21578 Summary of Inductive Learning Process

Inductive something something Evaluation

slide-16
SLIDE 16

Inductive something something Evaluation

750 1500 2250 3000 Corn Wheat Ship Interest Trade Crude Grain Money-fx Acquisitions Earn

21578 collection, 200 words in length

118 categories

75% train, 25% test

Reuters-21578

slide-17
SLIDE 17

Inductive something something Evaluation

slide-18
SLIDE 18

Summary of Inductive Learning Process

Inductive something something Evaluation

Average of precision and recall(F measure?) Train/test dataset not optimized

slide-19
SLIDE 19

Results & Others

Training Time Classification Speed for New Instances Classification Accuracy Other Experiments

Something something something Evaluation Results

slide-20
SLIDE 20

Inductive something something Evaluation Results

Training Time 266 MHz Pentium II running Windows NT. Fastest: Find Similar (<1 CUP sec/cat) SVM (<2 CUP sec/cat) Naive Bayes(8 CPU sec/cat) Decision Trees (~70 CUP sec/cat) Bayes Nets(~145 CUP sec/cat) Slowest:

slide-21
SLIDE 21

Inductive something something Evaluation Results

slide-22
SLIDE 22

Inductive something something Evaluation Results

New Instances? All less than 2 sec

slide-23
SLIDE 23

Inductive something something Evaluation Results

Accuracy

slide-24
SLIDE 24

Inductive something something Evaluation Results

Others? Sample Size N-gram Binary vs. 0/1/2 features

slide-25
SLIDE 25

Questions?