CMSC 678 Introduction to Machine Learning Spring 2019 - PowerPoint PPT Presentation

CMSC 678 Introduction to Machine Learning Spring 2019 https://www.csee.umbc.edu/courses/graduate/678/spring19/ Some slides adapted from Hamed Pirsiavash

Outline Welcome! Administrivia Basics of Learning Examples of Machine Learning

Frank Ferraro Natural language processing: Semantics ITE 358 ferraro@umbc.edu Vision & language processing Monday: 3:45-4:30 Tuesday: 11-11:30 Generative & neural modeling by appointment Learning with low-to-no supervision

TA: Caroline Kery Multilingual language learning Semantic parsing Location TBA ckery1@umbc.edu Active learning TBD Data visualization Analysis of educational data

https://cdn.arstechnica.net/wp-content/uploads/2015/11/Screen-Shot-2015-11-02-at-9.11.40-PM-640x543.png

http://www.adweek.com/wp-content/uploads/sites/2/2016/02/NewsFeedTeaser640.jpg

http://graphics.wsj.com/blue-feed-red-feed/

Course Goals Be introduced to some of the core problems and solutions of ML (big picture)

Course Goals Be introduced to some of the core problems and solutions of ML (big picture) This is not a survey course. We will go deep into the topics.

Course Goals Be introduced to some of the core problems and solutions of ML (big picture) Learn different ways that success and progress can be measured in ML

keras torch

Course Goals Be introduced to some of the core problems and solutions of ML (big picture) Learn different ways that success and progress can be measured in ML Relate to statistics, AI [671], and specialized areas (e.g., NLP [673] and CV [691]) Implement ML programs

Course Goals Be introduced to some of the core problems and solutions of ML (big picture) Learn different ways that success and progress can be measured in ML Relate to statistics, AI [671], and specialized areas (e.g., NLP [673] and CV [691]) Implement ML programs Assignments will require your own implementation.

Course Goals Be introduced to some of the core problems and solutions of ML (big picture) Learn different ways that success and progress can be measured in ML Relate to statistics, AI [671], and specialized areas (e.g., NLP [673] and CV [691]) Implement ML programs Read and analyze research papers Practice your (written) communication skills

Grading Component 678 Assignments 40% Course Project 40% Two Exams 20%

Grading Component 678 Assignments 40% Course Project 40% Two Exams 20% Each component is max(micro-average, macro-average)

Grading Component 678 Assignments 40% Course Project 40% Two Exams 20% max(micro-average, macro-average) 65/90 95/100 95/110 100/110

Grading Component 678 Assignments 40% Course Project 40% Two Exams 20% max(micro-average, macro-average) 65/90 65 + 95 + 95 + 100 microaverage = 90 + 100 + 110 + 110 ≈ 86.59% 95/100 95/110 100/110

Grading Component 678 Assignments 40% Course Project 40% Two Exams 20% max(micro-average, macro-average) 65/90 65 + 95 + 95 + 100 microaverage = 90 + 100 + 110 + 110 ≈ 86.59% 95/100 95/110 macroaverage = 1 65 90 + 95 100 + 95 110 + 100 ≈ 86.12% 100/110 4 110

Final Grades If you get ≥ You get at least a/an 90 A- 80 B- 70 C- 65 D 0 F

https://www.csee.umbc.edu/courses/graduate/678/spring19/

Online Discussions https://piazza.com/umbc/spring2019/cmsc678

Submitting Your Work https://www.csee.umbc.edu/courses/graduate/678/spring19/submit

Running the Assignments A "standard" x86-64 Linux machine, like gl A passable amount of memory (2GB-4GB) Modern but not necessarily cutting edge software Don’t assume a GPU (if you want to write CUDA yourself, talk to me) If in doubt, ask first

Running the Project An x86-64 Linux machine Memory and hardware constraints lifted (somewhat) If in doubt, ask first

Programming Languages for Assignments Use the tools you feel comfortable with Python+numpy, C, C++, Java, Matlab , …: OK (straight Python may not cut it) Libraries: Generally OK, as long as you don’t use their implementation of what you need to implement Math accelerators (blas, numpy, etc.): OK If in doubt, ask first

Programming Languages for the Project Use the tools you feel comfortable with Python+numpy, C, C++, Java, Matlab , …: OK (straight Python may not cut it) Libraries: Use what you want Math accelerators (blas, numpy, etc.): OK

Important Dates Date Due Friday, 2/8 Assignment 1 Wednesday, 3/6 Project Proposal Wednesday, 3/13 Exam 1 (In-class) Wednesday, 4/17 Project Update Friday, 5/17 Exam 2 (Final exam block) Wednesday, 5/22 Course Project Future assignment dates will be announced on Piazza, the website, and in class. All items due 11:59 AM UMBC time (unless specified otherwise)

Late Policy Everyone has a budget of 10 late days

Late Policy Everyone has a budget of 10 late days If you have them left: assignments turned in after the deadline will be graded and recorded, no questions asked

Late Policy Everyone has a budget of 10 late days If you have them left: assignments turned in after the deadline will be graded and recorded, no questions asked If you don’t have any left: still turn assignments in. They could count in your favor in borderline cases

Late Policy Everyone has a budget of 10 late days Use them as needed throughout the course They’re meant for personal reasons and emergencies Do not procrastinate

Late Policy Everyone has a budget of 10 late days Contact me privately if an extended absence will occur You must know how many you’ve used

Main Resource: CIML “A C ourse i n M achine L earning”, v0.99 Hal Daumé III http://ciml.info/ Full book: http://ciml.info/dl/v0_99/ ciml-v0_99-all.pdf Official

Optional Advanced Resource: ESL “ E lements of S tatistical L earning” Hastie, Tibshirani, Friedman https://web.stanford.edu/~hastie /ElemStatLearn/ Full book: https://web.stanford.edu/~hastie /ElemStatLearn/printings/ESLII_p rint12.pdf Unofficial: Recommended

Optional Advanced Resource: ITILA “ I nformation T heory, I nference and L earning A lgorithms” MacKay http://www.inference.org.u k/mackay/itprnn/ps/ Full book: http://www.inference.phy.c am.ac.uk/itprnn/book.pdf Unofficial: Recommended

Optional Advanced Resource: UML “ U nderstanding M achine L earning: From Theory to Algorithms” Shalev-Shwartz, Ben-David http://www.cs.huji.ac.il/~shais/Un derstandingMachineLearning/ Full book: http://www.cs.huji.ac.il/~shais/Un derstandingMachineLearning/und erstanding-machine-learning- theory-algorithms.pdf Unofficial: Recommended

Resources #5… ∞ Peer-reviewed articles (journals, conferences & workshops) ICML

Who should take this course? Is this the right course for you? good math and programming background? diligent and determined? willing to implement & write up your results? Unsure? Let’s talk after class (thank you to everyone who filled out the survey! :) ) https://goo.gl/forms/yqVH8QnwzggpRQJr1

Why do we care about math?! Calculus and linear algebra Techniques for finding maxima/minima of functions Convenient language for high dimensional data analysis Probability The study of the outcomes of repeated experiments The study of the plausibility of some event Statistics The analysis and interpretation of data

Course Announcement 1: Assignment 1 Due Friday, 2/8 (~11 days) Math & programming review Discuss with others, but write, implement and complete on your own

What does it mean to learn? Chris has just begun taking a machine learning course Pat, the instructor has to ascertain if Chris has “ learned ” the topics covered, at the end of the course What is a “reasonable” exam? (Bad) Choice 1: History of pottery Chris ’ s performance is not indicative of what was learned in ML (Bad) Choice 2: Questions answered during lectures Open book? A good test should test ability to answer “ related ” but “ new ” questions on the exam Generalization

Model, parameters and hyperparameters Model: mathematical formulation of system (e.g., classifier) Parameters: primary “knobs” of the model that are set by a learning algorithm Hyperparameter: secondary “knobs” http://www.uiparade.com/wp-content/uploads/2012/01/ui-design-pure-css.jpg

score( )

score θ ( ) scoring model F( θ ) objective

score θ ( ) scoring model F( θ ) objective (implicitly) dependent on the observed data X=

Machine Learning Framework: Learning instance 1 instance 2 Machine Learning Predictor instance 3 instance 4

CMSC 678 Introduction to Machine Learning Spring 2019 - PowerPoint PPT Presentation

CMSC 678 Introduction to Machine Learning Spring 2019 https://www.csee.umbc.edu/courses/graduate/678/spring19/ Some slides adapted from Hamed Pirsiavash Outline Welcome! Administrivia Basics of Learning Examples of Machine Learning Frank

CMSC 678 Introduction to Machine Learning Spring 2018

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Honors Combinatorics CMSC-27410 = Math-28410 CMSC-37200 Instructor: Laszlo Babai University

Honors Combinatorics CMSC-27410 = Math-28410 CMSC-37200 Instructor: Laszlo Babai University

Honors Combinatorics CMSC-27410 = Math-28410 CMSC-37200 Instructor: Laszlo Babai University

Honors Combinatorics CMSC-27410 = Math-28410 CMSC-37200 Instructor: Laszlo Babai University

Honors Combinatorics CMSC-27410 = Math-28410 CMSC-37200 Instructor: Laszlo Babai University

Honors Combinatorics CMSC-27410 = Math-28410 CMSC-37200 Instructor: Laszlo Babai University

CMSC 471 CMSC 471 Fall 2015 Fall 2015 Class #14 Class #14 Tuesday, October 13, 2015 Tuesday,

Overview of Decision Trees, Ensemble Methods and Reinforcement Learning CMSC 678 UMBC Outline

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

and Evaluation CMSC 678 UMBC Central Question: How Well Are We Doing? Precision, Recall,

ACCT 420: Machine Learning and AI Session 11 Dr. Richard M. Crowley 1 Front matter 2 . 1

SOFTWARE ARCHITECTURE SOFTWARE ARCHITECTURE OF AI-ENABLED SYSTEMS OF AI-ENABLED SYSTEMS

Applied Machine Learning Applied Machine Learning Convolutional Neural Networks Siamak

Topological essen-als Get the basics right and the rest

Information Structure Prediction for Visual-World Referring Expressions Micha Elsner Hannah

Mixing R with other languages JOHN D. COOK, PHD SINGULAR VALUE CONSULTING Why R? Libraries,

Woodlands odlands Ri Ring g Prima imary y School hool Primary 4 Parents Briefing 30 30

Interactive Media and Game Development 2-D Tiles and Sprites Outline Tiles Sprites