L15:Microarray analysis (Classification) November 09 Bafna Silly - PowerPoint PPT Presentation

L15:Microarray analysis (Classification) November 09 Bafna

Silly Quiz • Social networking site: • How can you find people with interests similar to yours? November 09 Bafna

Gene Expression Data Gene Expression data: • s 1 s 2 s – Each row corresponds to a gene – Each column corresponds to an expression value • Can we separate the experiments into two or more classes? g • Given a training set of two classes, can we build a classifier that places a new experiment in one of the two classes. November 09 Bafna

Formalizing Classification • Classification problem: Find a surface (hyperplane) that will separate the classes • Given a new sample point, its class is then determined by which side of the surface it lies on. • How do we find the hyperplane? How do we find the side that a point lies on? 1 2 3 4 5 6 1 2 g1 3 1 .9 .8 .1 .2 .1 .1 0 .2 .8 .7 .9 g2 Bafna November 09

Basic geometry • What is || x || 2 ? • What is x /|| x || x=(x1,x2) • Dot product? y x T y x 1 y 1 + x 2 y 2 = || x || ⋅ || y ||cos θ x cos θ y + || x || ⋅ || y ||sin( θ x )sin( θ y ) = || x || ⋅ || y ||cos( θ x − θ y ) Bafna November 09

Dot Product x • Let β be a unit vector. – || β || = 1 • Recall that – β T x = ||x|| cos θ θ • What is β T x if x is β orthogonal T x = ||x|| cos θ β (perpendicular) to β ? Bafna November 09

Hyperplane • How can we define a hyperplane L? • Find the unit vector that is perpendicular (normal to the hyperplane) Bafna November 09

Points on the hyperplane • Consider a hyperplane L defined by unit vector β , and distance β 0 • Notes; x2 x1 – For all x ∈ L, x T β must be the same, x T β = β 0 – For any two points x 1 , x 2 , • (x 1 - x 2 ) T β =0 Bafna November 09

Hyperplane properties • Given an arbitrary point x, what is the distance from x to the plane L? – D(x,L) = ( β T x - β 0 ) • When are points x1 and x2 x on different sides of the β 0 hyperplane? Bafna November 09

Separating by a hyperplane • Input: A training set of +ve & + -ve examples • Goal: Find a hyperplane that separates the two classes. • Classification: A new point x is +ve if it lies on the +ve side of the hyperplane, -ve x2 otherwise. - • The hyperplane is represented by the line • {x:- β 0 + β 1 x 1 + β 2 x 2 =0} x1 Bafna November 09

Error in classification + • An arbitrarily chosen hyperplane might not separate the test. We need to minimize a mis-classification error • Error: sum of distances of the β misclassified points. x2 • Let y i =-1 for +ve example i, – y i =1 otherwise. - ( ) ∑ T β + β 0 D ( β , β 0 ) = y i x i i ∈ M • Other definitions are also x1 possible. Bafna November 09

Gradient Descent • The function D( β ) defines the error. • We follow an iterative refinement. In each step, D( β ) D’( β ) refine β so the error is reduced. • Gradient descent is an approach to such iterative refinement. β β ← β − ρ ⋅ D '( β ) Bafna November 09

Rosenblatt’s perceptron learning algorithm ( ) ∑ T β + β 0 D ( β , β 0 ) = y i x i i ∈ M ∂ D ( β , β 0 ) ∑ y i x i = ∂β i ∈ M ∂ D ( β , β 0 ) ∑ y i = ∂β 0 i ∈ M   ∑ y i x i     ⇒ Update rule : β  = β   i ∈ M  − ρ     ∑ β 0 β 0 y i         Bafna November 09 i ∈ M

Classification based on perceptron learning • Use Rosenblatt’s algorithm to compute the hyperplane L=( β , β 0 ). • Assign x to class 1 if f(x) >= 0, and to class 2 otherwise. Bafna November 09

Perceptron learning • If many solutions are possible, it does no choose between solutions • If data is not linearly separable, it does not terminate, and it is hard to detect. • Time of convergence is not well understood Bafna November 09

Linear Discriminant analysis + • Provides an alternative approach to classification with a linear function. • Project all points, including the means, onto vector β . β • We want to choose β such x 2 that – Difference of projected - means is large. – Variance within group is small x 1 November 09 Bafna

Choosing the right β + + β 1 x 2 x 2 - - β 2 x 1 x 1 • β 1 is a better choice than β 2 as the variance within a group is small, and difference of means is large. • How do we compute the best β ? November 09 Bafna

Linear Discriminant analysis • Fisher Criterion (difference of projected means) Max β (sum of projected variance) November 09 Bafna

LDA cont’d + • What is the projection of a point x onto β ? – Ans: β T x m 2 • What is the distance β x 2 between projected m 1 means? - x ˜ m 1 x 1 2 = β T m 1 − m 2 2 ˜ 1 − ˜ ( ) m m 2 November 09 Bafna

LDA Cont’d β T ( m 1 − m 2 ) 2 2 | 2 | ˜ 1 − ˜ m m = β T ( m 1 − m 2 ) T β T ( m 1 − m 2 )( m 1 − m 2 ) T β = β β T S B β = ( m 1 − m 2 ) β T 2 + ˜ 2 scatter within sample : ˜ s s 1 2 β ( β T ( x − m 1 ) 2 = ) 2 = ) 2 = β T S 1 β S B ∑ ∑ where, ˜ ( ˜ x − ˜ s m 1 1 y x ∈ D 1 2 = β T ( S 1 + S 2 ) β = β T S w β 2 + ˜ ˜ s s 1 2 β T S B β Fisher Criterion max β β T S w β November 09 Bafna

LDA β T S B β Let max β β T S w β = λ Then, β T S B β − λ S w β ( ) = 0 ⇒ λ S w β = S B β − 1 S B β ⇒ λβ = S w − 1 ( m 1 − m 2 ) ⇒ β = S w Therefore, a simple computation (Matrix inverse) is sufficient to compute the ‘best’ separating hyperplane November 09 Bafna

End of Lecture 15 November 09 Bafna

Maximum Likelihood discrimination • Consider the simple case of single dimensional data. values • Compute a distribution of the values in each class. Pr November 09 Bafna

Maximum Likelihood discrimination • Suppose we knew the distribution of points in each class ω i . – We can compute Pr(x| ω i ) for all classes i, and take the maximum • The true distribution is not known, so usually, we assume that it is Gaussian November 09 Bafna

ML discrimination • Use a Bayesian approach to identify the class for each sample P(x) x µ Pr( x | ω i )Pr( ω i ) Pr( ω i | x ) = ∑ 2 Pr( x | ω j )Pr( ω j ) − x − µ ( ) 1 j ( ) 2 σ 2 P ( x ) = e σ 2 π ( ) + ln Pr( ω i ) ( ) g i ( x ) ln Pr( x | ω i ) = − ( x − µ i ) 2 ( ) + ln Pr( ω i ) ≅ 2 2 σ i November 09 Bafna

ML discrimination recipe (1 dimensional case) • We know the distribution for each class, but not the parameters • Estimate the mean and variance for each class. • For a new point x, compute the discrimination function g i (x) for each class i. • Choose argmax i g i (x) as the class for x November 09 Bafna

ML discrimination • Suppose all the points were in 1 dimension, and all classes were normally distributed. Pr( x | ω i )Pr( ω i ) Pr( ω i | x ) = ∑ Pr( x | ω j )Pr( ω j ) j ( ) + ln Pr( ω i ) ( ) g i ( x ) ln Pr( x | ω i ) = µ 1 x µ 2 − ( x − µ i ) 2 ( ) − ln( σ i ) + ln Pr( ω i ) ≅ 2 2 σ i   ( x − µ i ) 2 ( ) Choose argmin i + ln( σ i ) − ln Pr( ω i )   2 2 σ i   November 09 Bafna

ML discrimination (multi- dimensional case) Sample mean,   µ = 1 ˆ ∑ x i n i  k −   k −  1 T Covariance matrix = ˆ ˆ ˆ ( ) ( ) ∑ x x ∑ = µ µ n − 1 k November 09 Bafna

ML discrimination (multi- dimensional case)   1 exp − 1 T Σ − 1 x − m ( ) ( ) p ( x | ω i ) = 2 x − m   d   12 2 Σ ( ) 2 π ( ) + ln P ( ω i ) g i ( x ) = ln p ( x | ω i ) Compute argmax i g i ( x ) November 09 Bafna

Supervised classification summary • Most techniques for supervised classification are based on the notion of a separating hyperplane. • The ‘optimal’ separation can be computed using various combinatorial (perceptron), algebraic (LDA), or statistical (ML) analyses. November 09 Bafna

Review of micro-array analysis November 09 Bafna

The dynamic picture of the cellular activity • Each Cell is continuously active, – Genes are being transcribed into RNA – RNA is translated into proteins – Proteins are PT modified and transported – Proteins perform various cellular functions • Can we probe the Cell dynamically? – Which transcripts are active? Gene – Which proteins are active? Proteomic Regulation – Which proteins interact? Transcript profiling profiling November 09 Bafna

Other static analysis is possible Genomic Analysis/ Pop. Genetics Assembly Protein Sequence Sequence Analysis Analysis Gene Finding November 09 Bafna ncRNA

Silly Quiz • Who are these people, and what is the occasion? November 09 Bafna

Genome Sequencing and Assembly November 09 Bafna

DNA Sequencing • DNA is double- stranded • The strands are separated, and a polymerase is used to copy the second strand. • Special bases terminate this process early. November 09 Bafna

Sequencing • A break at T is shown here. • Measuring the lengths using electrophoresis allows us to get the position of each T • The same can be done with every nucleotide. Fluorescent labeling can help separate different nucleotides November 09 Bafna

L15:Microarray analysis (Classification) November 09 Bafna Silly - PowerPoint PPT Presentation

L15:Microarray analysis (Classification) November 09 Bafna Silly Quiz Social networking site: How can you find people with interests similar to yours? November 09 Bafna Gene Expression Data Gene Expression data: s 1 s 2 s Each

Capturing Best Practice for Microarray Gene Expression Data Analysis Gregory Piatetsky-Shapiro

A CMOS Label- -free DNA free DNA A CMOS Label Microarray Microarray Erik Anderson Stanford

Microarray Data Analysis ECS 289A ECS289A a) Oligonucleotide and b) Spotted Arrays Lochart and

Recent development in microarray data analysis Guan-Hua Huang Institute of Statistics National

L15 July 16, 2018 1 Lecture 15: Natural Language Processing I CSCI 1360E: Foundations for

L15 July 10, 2017 1 Lecture 15: Data Visualization CSCI 1360E: Foundations for Informatics and

CS3505/5020 Software Practice II Teamwork CS 3505 L15 - 1 Why Teams? In the past, in

Quantitative Cyber-Security Colorado State University Yashwant K Malaiya CS559 L15: CVSS &

Higher Dimensional Approach for Classification of Lung Cancer Microarray Data Nathan Palmer

Graph Classification Classification Outline Introduction, Overview Classification using

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Biology-Driven Clustering of Microarray Data Applications to the NCI60 Data Set K.R. Coombes,

Biweight Correlation as a Measure of Distance between Genes on a Microarray Aya Mitani Pitzer

Conflicts between Optimality Criteria in Incomplete-Block Designs for Microarray Experiments R.

Class discrimination for microarray studies Vlad Popovici Swiss Institute of Bioinformatics

Classification, Dose-response Modelling, and the Evaluation of Biomarker in a Microarray Setting

Natural Language Processing and Information Retrieval Support Vector Machines Alessandro

Optimal separating hyperplane. Basis expansion. Kernel trick. Support vector machine. Petr Po

Support Vector Machines October 16, 2018 Support Vector Machines October 16, 2018 1 / 31

Separating hyperplanes S a closed, convex set Point x not in S ==> strict

IAML: Support Vector Machines I Nigel Goddard School of Informatics Semester 1 1 / 18 Outline

Support Vector Machines (I): Overview and Linear SVM LING 572 Advanced Statistical Techniques

Machine Learning for NLP Support Vector Machines Aurlie Herbelot 2019 Centre for Mind/Brain

Support Vector Machines Part 1 Yingyu Liang Computer Sciences 760 Fall 2017

L15:Microarray analysis (Classification) November 09 Bafna Silly - PowerPoint PPT Presentation

L15:Microarray analysis (Classification) November 09 Bafna Silly Quiz Social networking site: How can you find people with interests similar to yours? November 09 Bafna Gene Expression Data Gene Expression data: s 1 s 2 s Each

Capturing Best Practice for Microarray Gene Expression Data Analysis Gregory Piatetsky-Shapiro

A CMOS Label- -free DNA free DNA A CMOS Label Microarray Microarray Erik Anderson Stanford

Microarray Data Analysis ECS 289A ECS289A a) Oligonucleotide and b) Spotted Arrays Lochart and

Recent development in microarray data analysis Guan-Hua Huang Institute of Statistics National

L15 July 16, 2018 1 Lecture 15: Natural Language Processing I CSCI 1360E: Foundations for

L15 July 10, 2017 1 Lecture 15: Data Visualization CSCI 1360E: Foundations for Informatics and

CS3505/5020 Software Practice II Teamwork CS 3505 L15 - 1 Why Teams? In the past, in

Quantitative Cyber-Security Colorado State University Yashwant K Malaiya CS559 L15: CVSS &amp;

Higher Dimensional Approach for Classification of Lung Cancer Microarray Data Nathan Palmer

Graph Classification Classification Outline Introduction, Overview Classification using

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Biology-Driven Clustering of Microarray Data Applications to the NCI60 Data Set K.R. Coombes,

Biweight Correlation as a Measure of Distance between Genes on a Microarray Aya Mitani Pitzer

Conflicts between Optimality Criteria in Incomplete-Block Designs for Microarray Experiments R.

Class discrimination for microarray studies Vlad Popovici Swiss Institute of Bioinformatics

Classification, Dose-response Modelling, and the Evaluation of Biomarker in a Microarray Setting

Natural Language Processing and Information Retrieval Support Vector Machines Alessandro

Optimal separating hyperplane. Basis expansion. Kernel trick. Support vector machine. Petr Po

Support Vector Machines October 16, 2018 Support Vector Machines October 16, 2018 1 / 31

Separating hyperplanes S a closed, convex set Point x not in S ==&gt; strict

IAML: Support Vector Machines I Nigel Goddard School of Informatics Semester 1 1 / 18 Outline

Support Vector Machines (I): Overview and Linear SVM LING 572 Advanced Statistical Techniques

Machine Learning for NLP Support Vector Machines Aurlie Herbelot 2019 Centre for Mind/Brain

Support Vector Machines Part 1 Yingyu Liang Computer Sciences 760 Fall 2017

Quantitative Cyber-Security Colorado State University Yashwant K Malaiya CS559 L15: CVSS &

Separating hyperplanes S a closed, convex set Point x not in S ==> strict