Energy Based Models
Stefano Ermon, Aditya Grover
Stanford University
Lecture 11
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 1 / 21
Energy Based Models Stefano Ermon, Aditya Grover Stanford - - PowerPoint PPT Presentation
Energy Based Models Stefano Ermon, Aditya Grover Stanford University Lecture 11 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 1 / 21 Summary Story so far Representation: Latent variable vs. fully observed
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 1 / 21
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 2 / 21
1 non-negative: p(x) ≥ 0 2 sum-to-one:
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 3 / 21
1 non-negative: p(x) ≥ 0 2 sum-to-one:
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 4 / 21
1 g(µ,σ)(x) = e− (x−µ)2 2σ2 . Volume is:
2σ2 dx =
2 gλ(x) = e−λx. Volume is:
3 Etc.
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 5 / 21
1
2
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 6 / 21
1
2
3
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 7 / 21
1 extreme flexibility: can use pretty much any function fθ(x) you want
1 Sampling from pθ(x) is hard 2 Evaluating and optimizing likelihood pθ(x) is hard (learning is hard) 3 No feature learning (but can add latent variables)
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 8 / 21
1
2
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 9 / 21
X
Y
X
Y
X
Y
cat
“class” noun
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 10 / 21
Y1 X1 Y2 X2 Y3 X3 Y7 X7 Y4 X4 Y5 X5 Y6 X6 Y8 X8 Y9 X9
Xi: noisy pixels Yi: “true” pixels Markov Random Field
i
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 11 / 21
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 12 / 21
1
2
m
Hidden units
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 13 / 21
Deep Boltzmann machine v h(3) h(2) h(1) W(3) W(2) W(1)
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 14 / 21
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 15 / 21
1 can plug in pretty much any function fθ(x) you want
1 Sampling is hard 2 Evaluating likelihood (learning) is hard 3 No feature learning
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 16 / 21
1
2
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 17 / 21
Z(θ)
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 18 / 21
Z(θ)
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 19 / 21
1
2
1
2
3
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 20 / 21
Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 11 21 / 21