Maximum Likelihood Learning
Stefano Ermon, Aditya Grover
Stanford University
Lecture 4
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 1 / 25
Maximum Likelihood Learning Stefano Ermon, Aditya Grover Stanford - - PowerPoint PPT Presentation
Maximum Likelihood Learning Stefano Ermon, Aditya Grover Stanford University Lecture 4 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 1 / 25 Learning a generative model We are given a training set of examples, e.g.,
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 1 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 2 / 25
M
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 3 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 4 / 25
1
2
3
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 5 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 6 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 7 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 8 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 9 / 25
Pθ D(Pdata||Pθ) = arg min Pθ −Ex∼Pdata [log Pθ(x)] = arg max Pθ Ex∼Pdata [log Pθ(x)]
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 10 / 25
Pθ
x∈D Pθ(x)
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 11 / 25
1 Express the quantity of interest as the expected value of a
2 Generate T samples x1, . . . , xT from the distribution P with respect
3 Estimate the expected value from the samples using:
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 12 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 13 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 14 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 15 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 16 / 25
m
m
n
i
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 17 / 25
m
m
n
i
1 Initialize θ0 at random 2 Compute ∇θℓ(θ) (by back propagation) 3 θt+1 = θt + αt∇θℓ(θ)
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 18 / 25
m
n
i
1
2
3
m
n
i
m
n
i
i
i=1 ∇θ log pneural(x(j) i
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 19 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 20 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 21 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 22 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 23 / 25
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 24 / 25
i
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 4 25 / 25