Maximum Likelihood Estimation
CS 446
Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract - - PowerPoint PPT Presentation
Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one main meta-algorithm this semester: (Regularized) ERM principle: pick the model that minimizes an average loss over training data. 1 / 70
CS 446
1 / 70
i=1, pick the model that maximized the likelihood
θ∈Θ L(θ) = max θ∈Θ ln n
θ∈Θ n
1 / 70
2 / 70
2 / 70
2 / 70
2 / 70
3 / 70
i xi and T := i(1 − xi) = n − H for convenience,
n
3 / 70
i xi and T := i(1 − xi) = n − H for convenience,
n
H T +H = H N .
3 / 70
2σ2
4 / 70
2σ2
n
n
4 / 70
i=1 xi n estimates a Gaussian µ parameter; but isn’t it useful
5 / 70
i=1 xi n estimates a Gaussian µ parameter; but isn’t it useful
5 / 70
i=1 xi n estimates a Gaussian µ parameter; but isn’t it useful
i=1 xi n is a valid estimate of the mean,
5 / 70
6 / 70
n
n
n
w∈Rd
w∈Rd n
Txi − yi)2. 7 / 70
n
n
n
w∈Rd
w∈Rd n
Txi − yi)2.
7 / 70
y∈Y
8 / 70
y∈Y
8 / 70
y∈Y
j=1 p(Xj = xj|Y = y)
y∈Y
y∈Y
d
8 / 70
y∈Y
y∈Y
d
9 / 70
y∈Y
y∈Y
d
9 / 70
10 / 70
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
5 5 10 15 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0
11 / 70
k
j=1) = n
k
TΣ−1(xi − µj)
12 / 70
0.58 0.60 0.62 0.64 0.66 0.68 5 10 15 20 25
13 / 70
0.58 0.60 0.62 0.64 0.66 0.68 5 10 15 20 25
13 / 70
0.58 0.60 0.62 0.64 0.66 0.68 5 10 15 20 25
14 / 70
0.58 0.60 0.62 0.64 0.66 0.68 5 10 15 20 25
14 / 70
15 / 70
16 / 70
17 / 70