Classifica4on Binary classifica,on Given a set of examples ( x i , - - PDF document

classifica4on
SMART_READER_LITE
LIVE PREVIEW

Classifica4on Binary classifica,on Given a set of examples ( x i , - - PDF document

4/8/09 CSCI1950Z Computa4onal Methods for Biology Lecture 17 Ben Raphael April 6, 2009 hJp://cs.brown.edu/courses/csci1950z/ Classifica4on Binary classifica,on Given a set of examples ( x i , y i ) , where y i = + 1, from unknown


slide-1
SLIDE 1

4/8/09 1

CSCI1950‐Z Computa4onal Methods for Biology Lecture 17

Ben Raphael April 6, 2009

hJp://cs.brown.edu/courses/csci1950‐z/

Classifica4on

Binary classifica,on Given a set of examples (xi, yi), where yi = +‐ 1, from unknown distribu4on D. Design func4on f: Rn  {‐1,+1} that op+mally assigns addi4onal samples xi to

  • ne of two classes.

Supervised learning (xi, yi) training data xi(j): feature. Rn: feature space.

slide-2
SLIDE 2

4/8/09 2

Classifica4on in 2D Hard vs. So\ Op4on

slide-3
SLIDE 3

4/8/09 3

Decision Surfaces

Linear Nonlinear

Embedding in Higher Dimension

slide-4
SLIDE 4

4/8/09 4

Cancer Classifica4on

(ben‐Dor et al. JCB 2000)

  • Colon cancer (Alon et al. (1999))

– 62 samples. 6000 measured genes. 2000 selected.

  • Ovary (Schummer et al. (1999))

– 15 cancerous, 13 normal, 4 other 4ssues. 100,000 cDNA clones

  • Leukemia (Golub et al. (1999))

– 25 AML. 47 ALL. 7,129 genes measured genes.

Cancer Classifica4on Results

(ben‐Dor et al. JCB 2000)

Clustering method Select similarity threshold (CAST or hierarchical) to maximize compa+bility with sample labeling. Linear kernel K(x, y) = x . y Quadra4c kernel K(x, y) = (x . y + 1)2 Less than margin

slide-5
SLIDE 5

4/8/09 5

Results

Colon cancer data