CSC 311: Introduction to Machine Learning
Lecture 8 - Probabilistic Models Pt. II, PCA Roger Grosse Chris Maddison Juhan Bae Silviu Pitis
University of Toronto, Fall 2020
Intro ML (UofT) CSC311-Lec8 1 / 44
CSC 311: Introduction to Machine Learning Lecture 8 - Probabilistic - - PowerPoint PPT Presentation
CSC 311: Introduction to Machine Learning Lecture 8 - Probabilistic Models Pt. II, PCA Roger Grosse Chris Maddison Juhan Bae Silviu Pitis University of Toronto, Fall 2020 Intro ML (UofT) CSC311-Lec8 1 / 44 Recap Last week took a
Intro ML (UofT) CSC311-Lec8 1 / 44
◮ maximum likelihood estimation:
◮ expected Bayesian posterior:
◮ Maximum a-posteriori (MAP) estimation:
◮ Gaussian Discriminant Analysis: Use Gaussian generative model of
◮ Principal Component Analysis: Simplify a Gaussian model by
Intro ML (UofT) CSC311-Lec8 2 / 44
Intro ML (UofT) CSC311-Lec8 3 / 44
Intro ML (UofT) CSC311-Lec8 4 / 44
Intro ML (UofT) CSC311-Lec8 5 / 44
Intro ML (UofT) CSC311-Lec8 6 / 44
1
2
D
◮ This is not true for distributions in general! Intro ML (UofT) CSC311-Lec8 7 / 44
(UofT) CSC311-Lec8 8 / 44
Intro ML (UofT) CSC311-Lec8 9 / 44
◮ The standard multivariate normal has µ = 0 and Σ = I
◮ In the univariate case, the scale factor was the square root of the
◮ But in the multivariate case, the covariance Σ is a matrix!
1 2 exist, and can we scale by it?
Intro ML (UofT) CSC311-Lec8 10 / 44
(optional draw-on slide for intuition)
Intro ML (UofT) CSC311-Lec8 11 / 44
◮ Any positive definite matrix is positive semidefinite. ◮ Positive definite matrices have positive eigenvalues, and positive
◮ For any matrix X, X⊤X and XX⊤ are positive semidefinite.
1 2 B the positive square root of A.
◮ The columns of Q are (unit) eigenvectors of A. Intro ML (UofT) CSC311-Lec8 12 / 44
1 2 .
1 2 is also positive definite, so by the Real Spectral Theorem, it “scales”
1 2 !
◮ Note that if Σ = QΛQT , Σ
1 2 = QΛ 1 2 QT
Intro ML (UofT) CSC311-Lec8 13 / 44
Intro ML (UofT) CSC311-Lec8 14 / 44
Intro ML (UofT) CSC311-Lec8 15 / 44
1
2 Test your intuition: Does Q1 = Q2?
Intro ML (UofT) CSC311-Lec8 16 / 44
1
2 Test your intuition: Does Q1 = Q2? What are λ1 and λ2?
Intro ML (UofT) CSC311-Lec8 17 / 44
Intro ML (UofT) CSC311-Lec8 18 / 44
Intro ML (UofT) CSC311-Lec8 19 / 44
N
N
N
Optional intuition building: why does |Σ|1/2 show up in the Gaussian density p(x)?
Hint: determinant is product of eigenvalues
Intro ML (UofT) CSC311-Lec8 20 / 44
Intro ML (UofT) CSC311-Lec8 21 / 44
Intro ML (UofT) CSC311-Lec8 22 / 44
N
N
N
N
N
N
Intro ML (UofT) CSC311-Lec8 23 / 44
k (x − µk)
Intro ML (UofT) CSC311-Lec8 24 / 44
Intro ML (UofT) CSC311-Lec8 25 / 44
Intro ML (UofT) CSC311-Lec8 26 / 44
N
n=1 ✶[t(n) = k] · x(n)
n=1 ✶[t(n) = k]
n=1 ✶[t(n) = k] N
Intro ML (UofT) CSC311-Lec8 27 / 44
◮ In Scikit-Learn this is called “Linear Discriminant Analysis” (LDA) Intro ML (UofT) CSC311-Lec8 28 / 44
Intro ML (UofT) CSC311-Lec8 29 / 44
Intro ML (UofT) CSC311-Lec8 30 / 44
Intro ML (UofT) CSC311-Lec8 31 / 44
◮ Flexible models, easy to add/remove class. ◮ Handle missing data naturally. ◮ More “natural” way to think about things, but usually doesn’t work
Intro ML (UofT) CSC311-Lec8 32 / 44
◮ You want to understand how a scientific field has changed over time. You
◮ You’re a biologist studying animal behavior, so you want to infer a
◮ You want to reduce your energy consumption, so you take a time series of
Intro ML (UofT) CSC311-Lec8 33 / 44
◮ based on clothing styles, gender, age, etc
Intro ML (UofT) CSC311-Lec8 34 / 44
◮ Save computation/memory ◮ Reduce overfitting, achieve better generalization ◮ Visualize in 2 dimensions
◮ Autoencoders ◮ Matrix factorizations (next week)
Intro ML (UofT) CSC311-Lec8 35 / 44
1
2
D
1
2
D
1
2
D
1
2
D
Intro ML (UofT) CSC311-Lec8 36 / 44
(UofT) CSC311-Lec8 37 / 44
Intro ML (UofT) CSC311-Lec8 38 / 44
◮ Computational benefits ◮ Interpretability, visualization ◮ Generalization Intro ML (UofT) CSC311-Lec8 39 / 44
Intro ML (UofT) CSC311-Lec8 40 / 44
◮ e.g. direction of higher variance ◮ e.g. direction of minimum difference after projection ◮ turns out they are the same! Intro ML (UofT) CSC311-Lec8 41 / 44
u3
<latexit sha1_base64="BJtgPVIz0UB+7ENyLWvfoj6wESs=">AHYnic3VNb9NAEJ20BTz1VBucDBElThFCUF8SAgVcoFL1BTSVmqynY2qdW1Hdlr1Cjqv+AK/4s7P4S3Yzf1khg4dy17Z8dvZt6MZ73uRPqJajR+rqyu3bi5Xtm4Zd2+c/fe/c3qg/0kSmNP9LxIRvGh6yRC+qHoKV9JcTiJhRO4Uhy4Z239/uCriBM/Cr+o6UQcB849Ee+5yio+gMlztUsvThpnWzWGvUGD3tRaOZCjfKxG1XntCAhSRykFJCgkBVmSQwmuI2pSgybQHdMuhiSz+8FXZAF2xQoAYQD7RmeY6yOcm2ItfaZsLWHKBJ3DEubtnPMEPKItdms49sFbFmMGfvWHKeY3dxnAK2iU2j/ZXeJ/F87nZMCw9eciw+eE9boLD0joxFmibUCf/2cAikgDWEVQ/Kgk9BmGh0jxpzVWd+ynV2GCcgaU4WfyXte8A+HhIwGMwZ+7imtEemGRcbMgZzgbDlGskuMqXFdMVLvrdK/VXRPn8hQTXJF7KoV/g8MlAl8ful/oxM3e4ThF31vIKdI0KmHizEonhu1vq4gK+X3KTJbH7xTidwz036J3Sj0VUXpXBOgmvfvOl1i0C7HbBnYxtkXb9sPrdumskOtn3nEe7y75R9cl1zbzqz5y70heaVYcvD9x/lfO+uP93ONjcrE+b9OAG925jv2pf9QY/SZynfhlTd0Ep86b/R4OT9jFoX95/Vmq97qvqjtfMjPnw16RE/pGc6YV7RDH2mXemAR0jf6Tj/Wf1WsSrWylUFXV3KbLTJG5fFvcjVG7Q=</latexit>Intro ML (UofT) CSC311-Lec8 42 / 44
u1 u1 u2 u2
u1
(UofT) CSC311-Lec8 43 / 44
u3
<latexit sha1_base64="BJtgPVIz0UB+7ENyLWvfoj6wESs=">AHYnic3VNb9NAEJ20BTz1VBucDBElThFCUF8SAgVcoFL1BTSVmqynY2qdW1Hdlr1Cjqv+AK/4s7P4S3Yzf1khg4dy17Z8dvZt6MZ73uRPqJajR+rqyu3bi5Xtm4Zd2+c/fe/c3qg/0kSmNP9LxIRvGh6yRC+qHoKV9JcTiJhRO4Uhy4Z239/uCriBM/Cr+o6UQcB849Ee+5yio+gMlztUsvThpnWzWGvUGD3tRaOZCjfKxG1XntCAhSRykFJCgkBVmSQwmuI2pSgybQHdMuhiSz+8FXZAF2xQoAYQD7RmeY6yOcm2ItfaZsLWHKBJ3DEubtnPMEPKItdms49sFbFmMGfvWHKeY3dxnAK2iU2j/ZXeJ/F87nZMCw9eciw+eE9boLD0joxFmibUCf/2cAikgDWEVQ/Kgk9BmGh0jxpzVWd+ynV2GCcgaU4WfyXte8A+HhIwGMwZ+7imtEemGRcbMgZzgbDlGskuMqXFdMVLvrdK/VXRPn8hQTXJF7KoV/g8MlAl8ful/oxM3e4ThF31vIKdI0KmHizEonhu1vq4gK+X3KTJbH7xTidwz036J3Sj0VUXpXBOgmvfvOl1i0C7HbBnYxtkXb9sPrdumskOtn3nEe7y75R9cl1zbzqz5y70heaVYcvD9x/lfO+uP93ONjcrE+b9OAG925jv2pf9QY/SZynfhlTd0Ep86b/R4OT9jFoX95/Vmq97qvqjtfMjPnw16RE/pGc6YV7RDH2mXemAR0jf6Tj/Wf1WsSrWylUFXV3KbLTJG5fFvcjVG7Q=</latexit>u3
<latexit sha1_base64="BJtgPVIz0UB+7ENyLWvfoj6wESs=">AHYnic3VNb9NAEJ20BTz1VBucDBElThFCUF8SAgVcoFL1BTSVmqynY2qdW1Hdlr1Cjqv+AK/4s7P4S3Yzf1khg4dy17Z8dvZt6MZ73uRPqJajR+rqyu3bi5Xtm4Zd2+c/fe/c3qg/0kSmNP9LxIRvGh6yRC+qHoKV9JcTiJhRO4Uhy4Z239/uCriBM/Cr+o6UQcB849Ee+5yio+gMlztUsvThpnWzWGvUGD3tRaOZCjfKxG1XntCAhSRykFJCgkBVmSQwmuI2pSgybQHdMuhiSz+8FXZAF2xQoAYQD7RmeY6yOcm2ItfaZsLWHKBJ3DEubtnPMEPKItdms49sFbFmMGfvWHKeY3dxnAK2iU2j/ZXeJ/F87nZMCw9eciw+eE9boLD0joxFmibUCf/2cAikgDWEVQ/Kgk9BmGh0jxpzVWd+ynV2GCcgaU4WfyXte8A+HhIwGMwZ+7imtEemGRcbMgZzgbDlGskuMqXFdMVLvrdK/VXRPn8hQTXJF7KoV/g8MlAl8ful/oxM3e4ThF31vIKdI0KmHizEonhu1vq4gK+X3KTJbH7xTidwz036J3Sj0VUXpXBOgmvfvOl1i0C7HbBnYxtkXb9sPrdumskOtn3nEe7y75R9cl1zbzqz5y70heaVYcvD9x/lfO+uP93ONjcrE+b9OAG925jv2pf9QY/SZynfhlTd0Ep86b/R4OT9jFoX95/Vmq97qvqjtfMjPnw16RE/pGc6YV7RDH2mXemAR0jf6Tj/Wf1WsSrWylUFXV3KbLTJG5fFvcjVG7Q=</latexit>Intro ML (UofT) CSC311-Lec8 44 / 44