Final Exam Review Readings: Matt Gormley Murphy (all chapters) - PowerPoint PPT Presentation

10-‑601 ¡Introduction ¡to ¡Machine ¡Learning Machine ¡Learning ¡Department School ¡of ¡Computer ¡Science Carnegie ¡Mellon ¡University Final ¡Exam ¡Review Readings: Matt ¡Gormley Murphy ¡(all ¡chapters) Bishop ¡(all ¡chapters) Lecture ¡29 HTF ¡(all ¡chapters) May ¡3, ¡2016 Mitchell ¡(all ¡chapters) 1

Reminders • Homework 9: ¡Applications of ¡ML – Release: ¡Mon, ¡Apr. ¡24 – Due: ¡Wed, ¡May 3 ¡at ¡11:59pm • Final ¡Exam (Evening Exam) – Mon, ¡May 08 ¡at ¡5:30pm ¡– 8:30pm – See Piazza ¡for details about location 2

Outline 1. Exam ¡Logistics 2. Sample ¡Questions 3. Overview 3

EXAM ¡LOGISTICS 4

Final ¡Exam • Time ¡/ ¡Location – Time: ¡ Evening ¡Exam Mon, ¡May ¡8 ¡at ¡5:30pm ¡– 8:30pm – Room : ¡We ¡will ¡contact ¡each ¡student ¡individually ¡with your ¡room ¡ assignment . ¡The ¡rooms ¡are ¡ not based ¡on ¡section. ¡ – Seats: ¡ There ¡will ¡be ¡ assigned ¡seats . ¡Please ¡arrive ¡early. ¡ – Please ¡watch ¡Piazza ¡carefully ¡for ¡announcements regarding ¡room ¡/ ¡seat ¡ assignments. • Logistics – 8-‑9 ¡Sections – Format ¡of ¡questions: • Multiple ¡choice • True ¡/ ¡False ¡(with ¡justification) • Derivations • Short ¡answers • Interpreting ¡figures – No ¡electronic ¡devices – You ¡are ¡allowed ¡to ¡ bring one ¡8½ ¡x ¡11 ¡sheet ¡of ¡notes ¡(front ¡and ¡back) 5

Final ¡Exam • How ¡to ¡Prepare – Attend ¡(or ¡watch) ¡this ¡final ¡exam ¡review ¡session – Review ¡prior ¡year’s ¡exams ¡and ¡solutions • We ¡will ¡post ¡them ¡shortly • Disclaimer: ¡This ¡year’s ¡10-‑601 ¡is ¡not ¡the ¡same ¡as ¡prior ¡ offerings – Review ¡this ¡year’s ¡homework ¡problems – Attend ¡the ¡ Mock ¡Final ¡Exam • Thu, ¡May ¡4, ¡6:30pm ¡ • Section ¡A ¡should ¡go ¡to ¡PH100 • Section ¡B ¡and ¡C ¡should ¡go ¡to ¡DH2210 • Disclaimer: ¡The ¡Mock ¡will ¡be ¡much ¡shorter ¡and ¡not ¡ exhaustive, ¡but ¡great ¡practice! 6

Final ¡Exam • How ¡to ¡Prepare – Attend ¡the ¡final ¡recitation ¡session: ¡ Tue, ¡Dec. ¡6 th at ¡5:30pm ¡ – Review ¡prior ¡year’s ¡exams ¡and ¡solutions (we ¡will ¡post ¡them) – Review ¡this ¡year’s ¡homework ¡problems – Flip ¡through ¡the ¡“What ¡you ¡should ¡know” ¡points ¡ (see ¡‘More’ ¡links ¡on ¡‘Schedule’ ¡page ¡of ¡course ¡ website) 7

Final ¡Exam • Advice ¡(for ¡during ¡the ¡exam) – Solve ¡the ¡easy ¡problems ¡first ¡ (e.g. ¡multiple ¡choice ¡before ¡derivations) • if ¡a ¡problem ¡seems ¡extremely ¡complicated ¡you’re ¡likely ¡ missing ¡something – Don’t ¡leave ¡any ¡answer ¡blank! – If ¡you ¡make ¡an ¡assumption, ¡write ¡it ¡down – If ¡you ¡look ¡at ¡a ¡question ¡and ¡don’t ¡know ¡the ¡ answer: • we ¡probably ¡haven’t ¡told ¡you ¡the ¡answer • but ¡we’ve ¡told ¡you ¡enough ¡to ¡work ¡it ¡out • imagine ¡arguing ¡for ¡some ¡answer ¡and ¡see ¡if ¡you ¡like ¡it 8

Final ¡Exam • Exam ¡Contents – 10-‑20% ¡of ¡material ¡comes ¡from ¡topics ¡covered ¡ before ¡ the ¡midterm ¡exam – 80-‑90% ¡of ¡material ¡comes ¡from ¡topics ¡covered ¡ after ¡ the ¡midterm ¡exam 9

Topics ¡covered ¡ before ¡ Midterm • Foundations • Regression – Probability – Linear ¡Regression – MLE, ¡MAP • Important ¡Concepts – Optimization – Kernels • Classifiers – Regularization ¡and ¡ Overfitting – KNN – Experimental ¡Design – Naïve ¡Bayes – Logistic ¡Regression – Perceptron – SVM 10

Topics ¡covered ¡ after ¡ Midterm • Unsupervised ¡Learning • Graphical ¡Models – K-‑means ¡/ ¡Lloyd’s ¡method – Bayesian ¡Networks – PCA – HMMs – EM ¡/ ¡GMMs – Learning ¡and ¡Inference • Neural ¡Networks • Learning ¡Theory – Feedforward ¡Neural ¡Nets – Statistical ¡Estimation ¡ (covered ¡right ¡before ¡ – Basic ¡architectures midterm) – Backpropagation – PAC ¡Learning – CNNs • Other ¡Learning ¡ Paradigms – Matrix ¡Factorization – Reinforcement ¡Learning – Information ¡Theory 11

SAMPLE ¡QUESTIONS 12

Samples ¡Questions 2 K-Means Clustering (a) [3 pts] We are given n data points, x 1 , ..., x n and asked to cluster them using K-means. If we choose the value for k to optimize the objective function how many clusters will be used (i.e. what value of k will we choose)? No justification required. (i) 1 (ii) 2 (iii) n (iv) log( n ) 13

Samples ¡Questions 2.2 Lloyd’s algorithm 3.5 3 2.5 Circle the image which depicts the cluster center positions after 1 2 iteration of Lloyd’s algorithm. 1.5 1 0.5 0 − 0.5 − 1 − 1 − 0.5 0 0.5 1 1.5 2 2.5 3 Figure 2: Initial data and cluster centers 14

Samples ¡Questions 2.2 Lloyd’s algorithm 3.5 3.5 3 3 2.5 2.5 2 2 1.5 1.5 Circle the image which depicts 1 1 the cluster center positions after 1 0.5 0.5 0 0 iteration of Lloyd’s algorithm. − 0.5 − 0.5 − 1 − 1 − 1 − 0.5 0 0.5 1 1.5 2 2.5 3 − 1 − 0.5 0 0.5 1 1.5 2 2.5 3 3.5 3.5 3.5 3 3 3 2.5 2.5 2.5 2 2 2 1.5 1.5 1.5 1 1 1 0.5 0.5 0.5 0 − 0.5 0 0 − 1 − 0.5 − 0.5 − 1 − 0.5 0 0.5 1 1.5 2 2.5 3 − 1 − 1 − 1 − 0.5 0 0.5 1 1.5 2 2.5 3 − 1 − 0.5 0 0.5 1 1.5 2 2.5 3 Figure 2: Initial data and cluster centers 15

Sample ¡Questions Question 4: Expectation Maximization Given a set of observed variables X , a set of latent variables Z , and a set of model parameters with the current estimate being θ , a single iteration of the EM algorithm updates the parameters estimate θ as follows: Q ( θ 0 | θ ) ≡ E P ( Z | X, θ ) [log P ( X, Z | θ 0 )] θ ← arg max θ 0 where log P ( X, Z | θ 0 ) = log Q n i =1 P ( X i , Z i | θ 0 ) is known as the complete log likelihood of the data. (a) [2 pts] True or False: In the case of fully observed data, i.e. when Z is an empty set, the EM algorithm reduces to a maximum likelihood estimate. (b) [2 pts] True or False: Since the EM algorithm guarantees that the value of its objective function will increase on each iteration, it is guaranteed to eventually reach a global maximum. 18

Sample ¡Questions 4 Principal Component Analysis [16 pts.] (a) In the following plots, a train set of data points X belonging to two classes on R 2 are given, where the original features are the coordinates ( x, y ). For each, answer the following questions: (i) [3 pt.] Draw all the principal components. (ii) [6 pts.] Can we correctly classify this dataset by using a threshold function after projecting onto one of the principal components? If so, which principal component should we project onto? If not, explain in 1–2 sentences why it is not possible. Dataset 2: Dataset 1: 19

Sample ¡Questions 4 Principal Component Analysis [ (i) T or F The goal of PCA is to interpret the underlying structure of the data in terms of the principal components that are best at predicting the output variable. (ii) T or F The output of PCA is a new representation of the data that is always of lower dimensionality than the original feature representation. (iii) T or F Subsequent principal components are always orthogonal to each other. 21

Sample ¡Questions Neural Networks Can the neural network in Figure (b) correctly classify the dataset given in Figure (a)? 5 y S 2 4 w 31 w 32 3 S 1 S 3 x2 h 1 h 2 2 w 12 w 11 w 21 w 22 1 x 1 x 2 0 0 1 2 3 4 5 x1 (b) The neural network architecture (a) The dataset with groups S 1 , S 2 , and S 3 . 22

Sample ¡Questions Neural Networks y w 31 w 32 Apply the backpropagation algorithm to obtain the partial derivative of the mean-squared error h 1 h 2 of y with the true value y * with respect to the weight w 22 assuming a sigmoid nonlinear w 12 w 11 w 21 w 22 activation function for the hidden layer. x 1 x 2 (b) The neural network architecture 23

Sample ¡Questions (a) [2 pts.] Write the expression for the joint distribution. 5 Graphical Models [16 pts.] We use the following Bayesian network to model the relationship between studying (S), being well-rested (R), doing well on the exam (E), and getting an A grade (A). All nodes are binary, i.e., R, S, E, A ∈ { 0 , 1 } . S R E A Figure 5: Directed graphical model for problem 5. 24

Sample ¡Questions (b) [2 pts.] How many parameters, i.e., entries in the CPT tables, are necessary to describe the joint distribution? 5 Graphical Models [16 pts.] We use the following Bayesian network to model the relationship between studying (S), being well-rested (R), doing well on the exam (E), and getting an A grade (A). All nodes are binary, i.e., R, S, E, A ∈ { 0 , 1 } . S R E A Figure 5: Directed graphical model for problem 5. 25

Sample ¡Questions (d) [2 pts.] Is S marginally independent of R ? Is S conditionally independent of R given E ? Answer yes or no to each questions and provide a brief explanation why. 5 Graphical Models [16 pts.] We use the following Bayesian network to model the relationship between studying (S), being well-rested (R), doing well on the exam (E), and getting an A grade (A). All nodes are binary, i.e., R, S, E, A ∈ { 0 , 1 } . S R E A Figure 5: Directed graphical model for problem 5. 26

Final Exam Review Readings: Matt Gormley Murphy (all chapters) - PowerPoint PPT Presentation

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Final Exam Review Readings: Matt Gormley Murphy (all chapters) Bishop (all

Math 211 Math 211 Review for the Final Exam December 8, 2002 2 The Final Exam The Final Exam

Final Review Drawing on the Web Final exam on Thursday, May 14 at 2:00 p.m. (EST) Final Review

ICS 101 Final Exam Review Fall 2016 Final Exam information In lab: check final exam schedule

Final Review Introduction to Web Design Final exam on Thursday, December 19 at 12:00 p.m. Final

Final exam effects Textures I Final exam effects Final exam effects Lighting Grads

Announcements Announcements Final Exam will be a take Final Exam will be a take- -home exam

The final exam Other finals review Final Exam Review CSH Review November 17 th

Did I happen to mention? Final exam Final Exam Review The date for the Final has been

Exam4 Information and Guidance General Topics General Exam Information Exam types

Quicksort Sorting Lower Bound Exam Exam Exam Exam 2 2 tomorrow evening 2 2 tomorrow

Review Final exam Final exam will be 11-12 problems, drop any 2 Cumulative up to and including

Final Exam Details The final exam will be posted on Blackboard by 7am on April 26th It will be

Final exam on Thursday, May 16 Drawing on the Web Final CSCI-UA 380 Review Multiple choice

FINAL EXAM REVIEW PACKET ANSWERS All answers can be found on my website! Final Exam Review 1.

Exam Review 2 Exam Overview Final Exam Friday,

Examination Lydia Love DVM DACVAA 2018 Exam Committee Chair September 2018 Exam Format

Midterm Exam Review Matt Gormley Lecture 14 March 6, 2017 1 Reminders Midterm

Making Multiple Choice Tests More Effective Linda Suskie Assessment & Accreditation

Underwater communications using acoustic parametric arrays M. C AMPO *, M. A RDID , D. D. I T

EE359 Lecture 2 Outline TX and RX Signal Models Path Loss Models Free-space and

Pythagorean Theorem, Distance, and Midpoint Multiple Choice Questions Slide 2 / 31 1 What is the

Successfully Using the Tests & Quizzes Tool in Courses .Pepperdine.edu No ve mb e r 2, 2011,

Session 15 Session 15 Tool Time Tuesday Tool Time Tuesday Office 365 Forms: Surveys, Quizzes,

CSE140L: Components and Design Techniques for Digital Systems Lab Final Review Pietro Mercati

Final Exam Review Readings: Matt Gormley Murphy (all chapters) - PowerPoint PPT Presentation

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Final Exam Review Readings: Matt Gormley Murphy (all chapters) Bishop (all

Math 211 Math 211 Review for the Final Exam December 8, 2002 2 The Final Exam The Final Exam

Final Review Drawing on the Web Final exam on Thursday, May 14 at 2:00 p.m. (EST) Final Review

ICS 101 Final Exam Review Fall 2016 Final Exam information In lab: check final exam schedule

Final Review Introduction to Web Design Final exam on Thursday, December 19 at 12:00 p.m. Final

Final exam effects Textures I Final exam effects Final exam effects Lighting Grads

Announcements Announcements Final Exam will be a take Final Exam will be a take- -home exam

The final exam Other finals review Final Exam Review CSH Review November 17 th

Did I happen to mention? Final exam Final Exam Review The date for the Final has been

Exam4 Information and Guidance General Topics General Exam Information Exam types

Quicksort Sorting Lower Bound Exam Exam Exam Exam 2 2 tomorrow evening 2 2 tomorrow

Review Final exam Final exam will be 11-12 problems, drop any 2 Cumulative up to and including

Final Exam Details The final exam will be posted on Blackboard by 7am on April 26th It will be

Final exam on Thursday, May 16 Drawing on the Web Final CSCI-UA 380 Review Multiple choice

FINAL EXAM REVIEW PACKET ANSWERS All answers can be found on my website! Final Exam Review 1.

Exam Review 2 Exam Overview Final Exam Friday,

Examination Lydia Love DVM DACVAA 2018 Exam Committee Chair September 2018 Exam Format

Midterm Exam Review Matt Gormley Lecture 14 March 6, 2017 1 Reminders Midterm

Making Multiple Choice Tests More Effective Linda Suskie Assessment &amp; Accreditation

Underwater communications using acoustic parametric arrays M. C AMPO *, M. A RDID , D. D. I T

EE359 Lecture 2 Outline TX and RX Signal Models Path Loss Models Free-space and

Pythagorean Theorem, Distance, and Midpoint Multiple Choice Questions Slide 2 / 31 1 What is the

Successfully Using the Tests &amp; Quizzes Tool in Courses .Pepperdine.edu No ve mb e r 2, 2011,

Session 15 Session 15 Tool Time Tuesday Tool Time Tuesday Office 365 Forms: Surveys, Quizzes,

CSE140L: Components and Design Techniques for Digital Systems Lab Final Review Pietro Mercati

Making Multiple Choice Tests More Effective Linda Suskie Assessment & Accreditation

Successfully Using the Tests & Quizzes Tool in Courses .Pepperdine.edu No ve mb e r 2, 2011,