Discriminative Feature Extraction and Dimension Reduction
- PCA & LDA
Discriminative Feature Extraction and Dimension Reduction - PCA - - PowerPoint PPT Presentation
Discriminative Feature Extraction and Dimension Reduction - PCA & LDA Berlin Chen, 2004 Introduction Goal: discover significant patterns or features from the input data Salient feature selection or dimensionality reduction
2004 TCFST - Berlin Chen
2
2004 TCFST - Berlin Chen
3
2004 TCFST - Berlin Chen
4
2004 TCFST - Berlin Chen
5
Pearson, 1901
2004 TCFST - Berlin Chen
6
The patterns show a significant difference from each other in one
2004 TCFST - Berlin Chen
7
= n i i i i
T n i
1
n i
1
x
2004 TCFST - Berlin Chen
8
j T
i
i
i
T i i T i i
i
2 2 i T i i T T i i T T i
i i
T i T i T i i
1
2
1 where , cos
1 1 1 1
1 1
= = = = φ x φ x x x x
T T
y ϕ ϕ θ
1
y
2
y
∑
i T i T
i
2004 TCFST - Berlin Chen
9
i
2 2 i T i i T T i i T T i
i i
T i T i T i i
∑
i T i T
i
i
j
j T i j T T i j T T i T T j T i j i
2004 TCFST - Berlin Chen
10
i
+ = = =
n m j j j m i i n i i
1 1 1
i i
=
m i i
1
i
∑ ∑ ∑ ∑ ∑ ∑ ∑
+ = + = + = + = + = + = + =
n m j j T j n m j j n m j j k T j n m j n m k k j k n m k k T j n m j j
1 1 2 1 2 1 1 1 1 2
⎩ ⎨ ⎧ ≠ = = k j k j
k T
j
if if 1 φ φ Q
2 2 2 2
j j
y E y E y E y E
j j j
= − = = σ
We should discard the bases where the projections have lower variances
2004 TCFST - Berlin Chen
11
i
i
j j j
∑ ∑ ∑ ∑
+ = + = + = + =
n m j j n m j j j T j n m j j T j n m j j
1 1 1 1 2
is real and symmetric, therefore its eigenvectors form a orthonormal set
2004 TCFST - Berlin Chen
12
n m n m j j eigen m
∑
+ =
1 1
i
j
j
j T i j T i j T T i j T T i T T j T i j i
2004 TCFST - Berlin Chen
13
2004 TCFST - Berlin Chen
14
eigen
n m m n m n m n m n n m m n n m n m m n j m n j n j m jn m j T n m k k jk j j n j m n m j n m k jk k T j jk n m j j T j
j
1 1 1 1 1 1 1 1 1 1 1 + − − − − + − + + − − ≤ ≤ + + + = ≤ ≤ + + = + = + =
Take derivation Have a particular solution if is a diagonal matrix and its diagonal elements is the eigenvalues
m n−
n m
1 +
n m
1 +
ϕ ϕ ϕ ϕ R R 2 = ∂ ∂
T
constraints To be minimized y x y x = ∂ ∂
T
2004 TCFST - Berlin Chen
15
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ =
n
x x x . .
2 1
x
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ =
m
y y y . .
2 1
y
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ =
n
x x x ˆ . . ˆ ˆ ˆ
2 1
x
T x
T
m T
2 1
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ =
m
y y y . .
2 1
y
2004 TCFST - Berlin Chen
16
2004 TCFST - Berlin Chen
17
2004 TCFST - Berlin Chen
18
threshold for information content reserved
2004 TCFST - Berlin Chen
19
256 256 8 8
2004 TCFST - Berlin Chen
20
2004 TCFST - Berlin Chen
21
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ = ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ = ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ⎤ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ ⎡ =
n L L L L n n
x x x x x x x x x
. 2 . 1 , . 2 2 . 2 1 , 2 2 . 1 2 . 1 1 , 1 1
. . ,......, . . , . . x x x
2004 TCFST - Berlin Chen
22
K i i i i K i i i i
, 2 , 1 , , 2 , 1 ,
2004 TCFST - Berlin Chen
23
Eigenvoice Eigenvoice space space construction construction
Speaker 1 Data SI HMM Speaker R Data Model Training Model Training Speaker 1 HMM Speaker R HMM D = (M.n)×1
Each new speaker S is represented Each new speaker S is represented by a point by a point P P in in K K-
space
K i i i i
, 2 , 1 ,
2004 TCFST - Berlin Chen
24
2004 TCFST - Berlin Chen
25
2004 TCFST - Berlin Chen
26
2004 TCFST - Berlin Chen
27
Within-class distributions are assumed here to be Gaussians With equal variance in the two-dimensional sample space
2004 TCFST - Berlin Chen
28
i
i
∑ =
= N i i
N
1
1 x x
( )
∑
=
j g i j j
i
x
( )
∑
=
− − =
j g T j i j i j j
i
N
x
x x x x Σ 1 ∑
j j j w
∑
j T j j j b
2004 TCFST - Berlin Chen
29
m
1
2
i T i
T N i i T N i i T
∑ ∑
= = 1 1
( )
j T j g i T j j
i
x
∑
=
( ) ( )
( )
x x x w T j j j T T j g i T j i T j g j g i T j i T j j j w
i i i
∑ ∑ ∑ ∑ ∑
= = =
2004 TCFST - Berlin Chen
30
m
1
2
b T b =
w T b T w b
i w i i b
i
i i i b w
−1
b w S
1 −
2004 TCFST - Berlin Chen
31
i i i b i w i i b i w i i b i w T i b T i i i w T i w i w T i b i w T i b T i w i w T i w T i b i w T i b T i w i w T i b i i i w T i b T i i w T b T w b
w i i i i i i i i i i i i i
W W W
−1 2 2 2 ˆ ˆ ˆ
2
G F G G F G F ′ − ′ = ′ ⎟ ⎠ ⎞ ⎜ ⎝ ⎛
C C x Cx x
T T
+ = d d ) (
2004 TCFST - Berlin Chen
32
Covariance Matrix of the 18-Mel-filter-bank vectors
Calculated using Year-99’s 5471 files
Covariance Matrix of the 18-cepstral vectors
Calculated using Year-99’s 5471 files
i
T i i
x
i
T i i
y
2004 TCFST - Berlin Chen
33
Covariance Matrix of the 18-PCA-cepstral vectors Covariance Matrix of the 18-LDA-cepstral vectors
Calculated using Year-99’s 5471 files Calculated using Year-99’s 5471 files
20.11 23.11 LDA-2 20.17 23.12 LDA-1 22.71 26.32 MFCC WG TC Character Error Rate
2004 TCFST - Berlin Chen
34
2004 TCFST - Berlin Chen
35
2004 TCFST - Berlin Chen
36
∏ ∏
= =
J j Nj j T b T J j Nj j T b T
1 1
b T J j j T j
1
∑
=
2004 TCFST - Berlin Chen
37