EM Algorithm & High Dimensional Data
Ken Kreutz-Delgado (Nuno Vasconcelos)
ECE 175A – Winter 2012 – UCSD
High Dimensional Data Ken Kreutz-Delgado (Nuno Vasconcelos) ECE - - PowerPoint PPT Presentation
EM Algorithm & High Dimensional Data Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175A Winter 2012 UCSD Gaussian EM Algorithm For the Gaussian mixture model, we have Expectation Step (E-Step): Maximization Step (M-Step): 2 EM
ECE 175A – Winter 2012 – UCSD
2
Data Class Assignments Soft Decisions: Hard Decisions: Parameter Updates Soft Updates: Hard Updates:
3
* | | |
( ) argmax | 1, | | , 0,
i Z X i j Z X i Z X i ij
i x P j x P j x P k x k j h
new ( )
i i j j
4
5
into the cheetah and background classes?
6
7
discrete cosine transform
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Gaussian
Bag of DCT vectors
PX|Y (x|cheetah)
8 +
discrete cosine transform
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Bag of DCT vectors
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
cheetah x P W
X
|
|
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
grass x P W
X
|
|
? ?
i *
Y i i
9
discrete cosine transform
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Mixture
Gaussians
Bag of DCT vectors
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
PX|Y(x|cheetah)
10 +
discrete cosine transform
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Bag of DCT vectors
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
grass x P W
X
|
|
? ?
* , , , i
argmax log ( , , ) log ( )
i k i k i k Y k
i G x P i
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
PX|Y(x|cheetah)
11
12
all 64 features
8%
13
14
15
16
a a
17
d 1 2 3 4 5 6 7 fd 1 .785 .524 .308 .164 .08 .037
18
volume of sphere is already smaller
a
a a
19
a a
All the volume of the cube is in the “spikes” (corners)!
20
a a
d p
21
e
S1 S2
a
22
23
n 1 2 3 4 5 6 10 15 20 1-Pn .998 .99 .97 .94 .89 .83 .48 .134 .02
24
problem in n+1 dimensions without increasing the probability of error, and even often decreasing the probability of error.
25
x y x y
26
x y
27
Gaussian, exponential, etc
mixture of several components.
components, what type), the likelihood has local minima, etc.
28
axis in order to get a reasonable quantization
terrible in 3-D (9 out of each10 bins empty) dimension 1 2 3 points/bin 10 1 0.1
29
Discriminant Feature Non-Discriminant Feature
30