outline
play

Outline Administrivia Introduction to Machine Learning Greg Mori - - PDF document

Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Outline Administrivia Introduction to Machine Learning Greg Mori - CMPT 419/726 Machine Learning Bishop PRML Ch. 1


  1. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Outline Administrivia Introduction to Machine Learning Greg Mori - CMPT 419/726 Machine Learning Bishop PRML Ch. 1 Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Administrivia • Recommend doing associated readings from Bishop, • We will cover techniques in the standard ML toolkit Pattern Recognition and Machine Learning (PRML) after • maximum likelihood, regularization, neural networks, each lecture stochastic gradient descent, principal components analysis • Reference books for alternate descriptions (PCA), Markov random fields (MRF), graphical models, • The Elements of Statistical Learning , Trevor Hastie, Robert belief propagation, Markov Chain Monte Carlo (MCMC), Tibshirani, and Jerome Friedman hidden Markov models (HMM), particle filters, recurrent • Information Theory, Inference, and Learning Algorithms , David MacKay (available online) neural networks (RNNs), long short-term memory (LSTM), • Deep Learning , Ian Goodfellow, Yoshua Bengio and Aaron generative adversarial networks (GANs), variational Courville (available online) auto-encoders (VAEs), ... • Online courses • There will be 3 assignments • Coursera, Udacity • Exam in class on Dec. 2

  2. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia - Assignments Administrivia - Project • Project details • Practice doing research • Assignment late policy • Ideal project – take problem from your research/interests, • 3 grace days , use at your discretion (not on project) use ML (properly) • Other projects fine too ($1 million project: • Programming assignments use Python http://netflixprize.com ) • Too late :( • Others on http://www.kaggle.com Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia - Project Administrivia - Office Hours • Project details • Blocks (sections) • Work in groups (up to 5 students) • Produce (short) research paper • Block 1: Grad students, CMPT MSc/PhD thesis, other • Graded on proper research methodology, not just results • Block 2: Grad students, CMPT Prof. MSc (last name A-L) • Block 3: Grad students, CMPT Prof. MSc (last name M-Z) • Choice of problem / algorithms • Relation to previous work • See schedule on course website • Comparative experiments • Please attend office hours for your block, priority given to • Quality of exposition • Details on course webpage corresponding students • Poster session Dec. 8, 4-7pm Downtown Vancouver • Will have separate, bookable office hours for project groups (tentative) • Report due Dec. 13 at 11:59pm

  3. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia - Background What is Machine Learning (ML)? • Calculus: E = mc 2 ⇒ ∂ E ∂ c = 2 mc • Linear algebra: ∂ ∂ x ( x T a ) = a Au i = λ i u i ; • Algorithms that automatically improve performance through experience • See PRML Appendix C • Often this means define a model by hand, and use data to • Probability: fit its parameters � � � p ( X ) = p ( X , Y ); p ( x ) = p ( x , y ) dy ; E x [ f ] = p ( x ) f ( x ) dx Y • See PRML Ch. 1.2 It will be possible to refresh, but if you’ve never seen these before this course will be very difficult. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Why ML? Hand-written Digit Recognition • The real world is complex – difficult to hand-craft solutions. • ML is the preferred framework for applications in many fields: • Computer Vision • Natural Language Processing, Speech Recognition • Robotics • . . . Belongie et al. PAMI 2002 • Difficult to hand-craft rules about digits

  4. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Hand-written Digit Recognition Face Detection x i = t i = ( 0 , 0 , 0 , 1 , 0 , 0 , 0 , 0 , 0 , 0 ) • Represent input image as a vector x i ∈ R 784 . • Suppose we have a target vector t i • This is supervised learning • Discrete, finite label set: perhaps t i ∈ { 0 , 1 } 10 , a classification problem • Given a training set { ( x 1 , t 1 ) , . . . , ( x N , t N ) } , learning problem is to construct a “good” function y ( x ) from these. Schneiderman and Kanade, IJCV 2002 • y : R 784 → R 10 • Classification problem • t i ∈ { 0 , 1 , 2 } , non-face, frontal face, profile face. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Spam Detection Caveat - Horses (source?) • Once upon a time there were two neighboring farmers, Jed and Ned. Each owned a horse, and the horses both liked to jump the fence between the two farms. Clearly the farmers needed some means to tell whose horse was whose. • So Jed and Ned got together and agreed on a scheme for discriminating between horses. Jed would cut a small notch in one ear of his horse. Not a big, painful notch, but just big enough to be seen. Well, wouldn’t you know it, the day after Jed cut the notch in horse’s ear, Ned’s horse caught on the barbed wire fence and tore his ear the exact • Classification problem same way! • t i ∈ { 0 , 1 } , non-spam, spam • Something else had to be devised, so Jed tied a big blue • x i counts of words, e.g. Viagra , stock , outperform , bow on the tail of his horse. But the next day, Jed’s horse multi-bagger jumped the fence, ran into the field where Ned’s horse was grazing, and chewed the bow right off the other horse’s tail. Ate the whole bow!

  5. Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Caveat - Horses (source?) Stock Price Prediction • Finally, Jed suggested, and Ned concurred, that they should pick a feature that was less apt to change. Height seemed like a good feature to use. But were the heights different? Well, each farmer went and measured his horse, and do you know what? The brown horse was a full inch taller than the white one! Moral of the story: ML provides theory and tools for setting • Problems in which t i is continuous are called regression parameters. Make sure you have the right model and inputs. • E.g. t i is stock price, x i contains company profit, debt, cash flow, gross sales, number of spam emails sent, . . . Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Clustering Images Types of Learning Problems • Supervised Learning • Classification • Regression • Unsupervised Learning • Reinforcement Learning • maybe just a little Wang et al., CVPR 2006 • Only x i is defined: unsupervised learning • E.g. x i describes image, find groups of similar images

  6. � � ✁ � ✁ � ✁ � ✁ ✁ ✁ � Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing An Example - Polynomial Curve Fitting Polynomial Curve Fitting • What form is y ( x ) ? 1 • Let’s try polynomials of degree M : 1 y ( x , w ) = w 0 + w 1 x + w 2 x 2 + . . . + w M x M 0 0 −1 • This is the hypothesis space. 0 1 • How do we measure success? −1 t n t • Sum of squared errors: N y ( x n , w ) E ( w ) = 1 0 1 � { y ( x n , w ) − t n } 2 2 n = 1 • Suppose we are given training set of N observations • Among functions in the class, choose x x n ( x 1 , . . . , x N ) and ( t 1 , . . . , t N ) , x i , t i ∈ R that which minimizes this error • Regression problem, estimate y ( x ) from these data Administrivia Machine Learning Curve Fitting Coin Tossing Administrivia Machine Learning Curve Fitting Coin Tossing Polynomial Curve Fitting Which Degree of Polynomial? ✂☎✄✝✆ ✂☎✄✝✆ 1 1 • Error function 0 0 N −1 −1 E ( w ) = 1 � { y ( x n , w ) − t n } 2 0 1 0 1 2 n = 1 ✂☎✄✝✆ ✂☎✄✝✆ 1 1 • Best coefficients w ∗ = arg min 0 0 w E ( w ) −1 −1 • Found using pseudo-inverse (more later) 0 1 0 1 • A model selection problem • M = 9 → E ( w ∗ ) = 0 : This is over-fitting

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend