Linear ¡Discriminant ¡Analysis ¡
Debapriyo Majumdar Data Mining – Fall 2014 Indian Statistical Institute Kolkata
August 28, 2014
The owning house data Can we separate the points with a line? - - PowerPoint PPT Presentation
Linear Discriminant Analysis Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata August 28, 2014 The owning house data Can we separate the points with a line? 200 Income (thousand
Debapriyo Majumdar Data Mining – Fall 2014 Indian Statistical Institute Kolkata
August 28, 2014
2 ¡
30 40 50 60 70 80 50 100 150 200 Age (Years) Income (thousand rupees)
3 ¡
The ¡figures ¡are ¡from ¡Ricardo ¡Gu?errez-‑Osuna’s ¡slides ¡ ¡ ¡ Not ¡same ¡as ¡Latent ¡Dirichlet ¡Alloca?on ¡(also ¡LDA) ¡
2×2 matrix two data points (0.5,0.7) and (1.1,0.8)
4 ¡
1×2 vector norm=1 represents the x axis
Projection onto the x axis Distances from the origin
Projection onto the y axis Distances from the origin
5 ¡
1 2 1 2 ! " # # $ % & &
1×2 vector, norm=1 the x=y line Projection onto the x=y line Distances from the origin
w : some unit vector x : any point distance of projection
w from origin = wTx wTx : a scalar
6 ¡
x∈ci
x∈ci
Better separation of means
7 ¡
µ1 µ2
Example: if w is the unit vector along x
Better separation
8 ¡
µ1 µ2
i =
x∈ci
2
1 + !
2
9 ¡
µ1 µ2
2
1 + !
2
Separation of means and the points as well
10 ¡
x∈ci
T
i =
x∈ci
2 =
x∈ci
2
x∈ci
Tw = wTSiw
1 + !
2 = wTSWw
11 ¡
2 = wTµ1 − wTµ2
2
2
T SB
12 ¡
Dividing by same denominator
WSBw
The generalized eigenvalue problem
13 ¡
µ1=µ2 µ1 µ2
– What if it is mainly in the variance?
14 ¡
µ1=µ2