Introduction to machine learning COMS 4721 Learning from data - PowerPoint PPT Presentation

Introduction to machine learning COMS 4721

Learning from data • Machine learning : the study of computational mechanisms that “learn” from data in order to make predictions and decisions.

Example 1: image classification • A birdwatcher takes pictures of birds and sorts the photos by species. • Goal : automatically recognize bird species in new photos. Indigo bunting

Example 2: matchmaking • An online matchmaking service introduces thousands of pairs of students to each other, and receives feedback about whether the pair actually goes on a date or not. • Goal : predict how likely any pair of students will go on a date if introduced to each other. Alice Bob Charlie Daisy Alice 1 1 Bob 1 1 0 Charlie 1 Daisy 1 0

Example 3: machine translation • Linguists provide translations of all English language books into French, sentence-by-sentence. • Goal : automatically translate any English sentence into French.

Example 4: personalized medicine • A physician attends to patients at a hospital and prescribes treatments on the basis of the patients’ symptoms, medical histories, genetic profiles, etc. The health outcome (e.g., recovery, death) for each patient is observed a day or so after the treatment. • Goal : prescribe a personalized treatment for any patient that delivers the best possible health outcome for that patient.

Basic setting • Data : labeled examples 𝑦 " , 𝑧 " , 𝑦 % , 𝑧 % , …, 𝑦 ' , 𝑧 ' ∈ 𝒴×𝒵 • Goal : “learn” a function -: 𝒴 → 𝒝 𝑔 from the data that is ultimately used for prediction/decision-making. • 𝑦 1 ∈ 𝒴 : representation of 𝑗 34 object ( 𝒴 = input/feature space ) • 𝑧 1 ∈ 𝒵 : label pertinent to 𝑗 34 object ( 𝒵 = output/label space ) • 𝒝 : action space (usually 𝒵 = 𝒝 for prediction problems )

Prediction problems • Goal : learn a prediction function ( predictor ) that provides “correct” labels to inputs that may be encountered in the future (i.e., new unlabeled examples ). New unlabeled example Learning algorithm Collection of labeled examples Learned predictor Prediction Why should this be possible?

Some basic issues 1. How should we represent the input objects? 2. What types of prediction functions should we consider? 3. How should data be used to select a predictor? 4. How can we evaluate whether learning was successful?

Special case: binary classification 𝒵 = 0,1 (e.g., is it an indigo bunting or not) 1 0 Why is this hard? ' , which together comprise a miniscule 1. Only have labels for 𝑦 1 18" fraction of the input space 𝒴 . 2. Relationship between an input 𝑦 ∈ 𝒴 and its correct label 𝑧 ∈ 𝒵 may be complicated, possibly ambiguous/non-deterministic! 3. Can be many functions that perfectly match input/output ' . How should we pick one among these? relationship on 𝑦 1 , 𝑧 1 18"

Topics for this course (tentative) 1. Non-parametric models (e.g., nearest neighbor, decision trees) 2. Parametric models (e.g., generative models, linear models) 3. Ensemble methods (e.g., boosting, hedging) 4. Regression (e.g., least squares, Lasso) 5. Representation learning (e.g., mixture models, PCA, auto-encoders) 6. Other topics as time permits (e.g., sequence models, partial feedback)

A small sample of other topics in ML… • Advanced issues : • Application areas : • Distributed learning • Natural language processing • Incomplete data • Speech recognition • Causal inference • Computer vision • Privacy and fairness • Computational advertising • Other models of learning : • Modes of study : • Semi-supervised learning • Mathematical analysis • Active learning • Cross-domain evaluations • Online learning • End-to-end application study • Reinforcement learning

This course • Mathematical prerequisites : • Multivariable calculus • Linear algebra • Probability (and some basic statistics would be helpful) • Basic data structures and algorithms • Computational prerequisites : • You should have regular access and be able to program in MATLAB, Python, or R. • Course requirements : • Around four homework assignments (theoretical & empirical exercises): 24% • Two in-class exams (March 3, April 28): 25% each • Practical modeling project: 26% • No late assignments accepted, no make-up exams

Resources • Course website : http://www.cs.columbia.edu/~djhsu/coms4721-s16 • Course staff : • Instructor : Daniel Hsu • Instructional assistants : Edward Li, Siddharth Varshney, Robert Ying • Office hours, contact information, online forum : see course website • Course materials : • Course policies : posted on the course website • Lecture slides, notes, etc. : posted on the course website • Readings : “A Course in Machine Learning” and “The Elements of Statistical Learning” (both available free online), as well as other materials posted on the course website

Introduction to machine learning COMS 4721 Learning from data - PowerPoint PPT Presentation

Introduction to machine learning COMS 4721 Learning from data Machine learning : the study of computational mechanisms that learn from data in order to make predictions and decisions. Example 1: image classification A birdwatcher

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

2014-15 State Planning Program Industry Briefing Joshua Hannan, Penny Ford, Andrew Willis, Robert

Supplementary Information for Article Topology of sustainable management of dynamical systems

Cell- -based Architecture for based Architecture for Cell Adaptive Wiring Panels: A First

Computational design of a circular RNA with prion-like behavior Stefan Badelt 1 , Christoph Flamm

Instructional Budget 2018-2019 PRESENTATION TO THE BOARD OF EDUCATION FEBRUARY 12, 2018

Educate Challenge Inspire 2 FREMONT USD L o n g - R a n g e Fa c i l i t i e s P l

Progress in the Industrial Deployment of Materials Modelling Software Experiences from

Murphy USA Inc. (MUSA) November 2017 Investor Presentation Murphy USA Inc. 1 Cautionary