Announcement HW 1 out TODAY Watch your email 1 What is Machine - PowerPoint PPT Presentation

Announcement • HW 1 out TODAY – Watch your email 1

What is Machine Learning? (Formally) 2

What is Machine Learning? Study of algorithms that • improve their performance • at some task • with experience Learning algorithm (experience) (performance) (task) 3

Supervised Learning Task Task: “Anemic cell (0)” “Healthy cell (1)” 4

Performance Measures Performance: - Measure of closeness between true label Y and prediction f ( X ) Y f ( X ) X “Anemic cell” “Anemic cell” 0 “Healthy cell” 1 0/1 loss 5

Performance Measures Performance: - Measure of closeness between true label Y and prediction f ( X ) Share price, Y f ( X ) X Past performance, “$24.50” “$24.50” 0 trade volume etc. as of Sept 8, 2010 “$26.00” 1? “$26.10” 2? square loss 6

Performance Measures Performance: - Measure of closeness between true label Y and prediction f ( X ) Don’t just want label of one test data (cell image), but any cell image Given a cell image drawn randomly from the collection of all cell images, how well does the predictor perform on average? 7

Performance Measures Performance: “Anemic cell” 0/1 loss Probability of Error Share Price “$ 24.50” Mean Square Error square loss 8

Bayes Optimal Rule Ideal goal: Bayes optimal rule Best possible performance: Bayes Risk BUT… Optimal rule is not computable - depends on unknown P XY ! 9

Experience - Training Data Can’t minimize risk since P XY unknown! Training data (experience) provides a glimpse of P XY (observed) (unknown) independent, identically distributed , Healthy , Anemic cell cell Provided by expert, measuring device, some experiment, … , Healthy , Anemic cell cell 10

Supervised Learning Task: Performance: Experience: Training data (unknown) , Healthy , Anemic cell cell , Healthy , Anemic cell cell 11

Machine Learning Algorithm , Healthy , Anemic cell cell Learning algorithm , Healthy , Anemic cell cell Training data = “Anemic cell” Test data Note: test data ≠ training data 12

Issues in ML • A good machine learning algorithm – Does not overfit training data Training data Football Player No Weight Weight Test data Height Height – Generalizes well to test data More later … 13

Performance Revisited Performance: (of a learning algorithm) How well does the algorithm do on average 1. for a test cell image X drawn at random, and 2. for a set of training images and labels drawn at random Expected Risk (aka Generalization Error ) 14

How to sense Generalization Error? • Can’t compute generalization error. How can we get a sense of how well algorithm is performing in practice? • One approach - – Split available data into two sets – Training Data – used for training the algorithm Learning algorithm – Test Data (a.k.a. Validation Data, Hold-out Data) – provides estimate of generalization error Why not use Test Error = Training Error? 15

Supervised vs. Unsupervised Learning Supervised Learning – Learning with a teacher Learning algorithm Documents, topics Mapping between Documents and topics Unsupervised Learning – Learning without a teacher Learning algorithm Documents Model for word distribution OR Clustering of similar documents 16

Lets get to some learning algorithms! 17

Learning Distributions (Parametric Approach) Aarti Singh Machine Learning 10-701/15-781 Sept 13, 2010

Your first consulting job • A billionaire from the suburbs of Seattle asks you a question: – He says: I have a coin , if I flip it, what’s the probability it will fall with the head up? – You say: Please flip it a few times: – You say: The probability is: 3/5 – He says: Why??? – You say: Because… 19

Bernoulli distribution Data, D = • P(Heads) =  , P(Tails) = 1-  • Flips are i.i.d. : – Independent events – Identically distributed according to Bernoulli distribution Choose  that maximizes the probability of observed data 20

Maximum Likelihood Estimation Choose  that maximizes the probability of observed data MLE of probability of head: = 3/5 “Frequency of heads” 21

How many flips do I need? • Billionaire says: I flipped 3 heads and 2 tails. • You say:  = 3/5, I can prove it! • He says: What if I flipped 30 heads and 20 tails? • You say: Same answer, I can prove it! • He says: What’s better? • You say: Hmm… The more the merrier??? • He says: Is this why I am paying you the big bucks??? 22

Simple bound ( Hoeffding’s inequality) • For n =  H +  T , and • Let  * be the true parameter, for any  >0: 23

PAC Learning • PAC: Probably Approximate Correct • Billionaire says: I want to know the coin parameter  , within  = 0.1, with probability at least 1-  = 0.95. How many flips? Sample complexity 24

What about prior knowledge? • Billionaire says: Wait, I know that the coin is “close” to 50-50. What can you do for me now? • You say: I can learn it the Bayesian way… • Rather than estimating a single  , we obtain a distribution over possible values of  After data Before data 50-50 25

Bayesian Learning • Use Bayes rule: • Or equivalently: posterior likelihood prior 26

Prior distribution • What about prior? – Represents expert knowledge (philosophical approach) – Simple posterior form (engineer’s approach) • Uninformative priors: – Uniform distribution • Conjugate priors: – Closed-form representation of posterior – P(  ) and P(  |D) have the same form 27

Conjugate Prior • P(  ) and P(  |D) have the same form Eg. 1 Coin flip problem Likelihood is ~ Binomial If prior is Beta distribution, Then posterior is Beta distribution For Binomial, conjugate prior is Beta distribution. 28

Beta distribution More concentrated as values of b H , b T increase 29

Beta conjugate prior As n =  H +  T increases As we get more samples, effect of prior is “washed out” 30

Conjugate Prior • P(  ) and P(  |D) have the same form Eg. 2 Dice roll problem (6 outcomes instead of 2) Likelihood is ~ Multinomial(  = { 1 ,  2 , … ,  k }) If prior is Dirichlet distribution, Then posterior is Dirichlet distribution For Multinomial, conjugate prior is Dirichlet distribution. 31

Maximum A Posteriori Estimation Choose  that maximizes a posterior probability MAP estimate of probability of head: Mode of Beta distribution 32

Announcement HW 1 out TODAY Watch your email 1 What is Machine - PowerPoint PPT Presentation

Announcement HW 1 out TODAY Watch your email 1 What is Machine Learning? (Formally) 2 What is Machine Learning? Study of algorithms that improve their performance at some task with experience Learning algorithm

General Announcement::SPH REIT ANNOUNCEMENT Page 1 of 1 GENERAL ANNOUNCEMENT::SPH REIT

SPECIAL ANNOUNCEMENT SPECIAL ANNOUNCEMENT ! May 6, 2015 ! ! ! A Special Announcement From: ! RICK

2 YBM MITI ANNOUNCEMENT ON NEW INCENTIVES 4 YBM MITI ANNOUNCEMENT ON NEW

SPECIAL ANNOUNCEMENT Copenhagen Denmark SPECIAL ANNOUNCEMENT Copenhagen Denmark SPECIAL

ASX Announcement To: Company Announcement Office From: Senex Energy Limited Company: ASX

General Announcement::Corporate Presentation Page 1 of 1 GENERAL ANNOUNCEMENT::CORPORATE

Mid- -Term Performance Term Performance Mid Announcement for the Year Ending Announcement for

Joint Venture Announcement 16 February 2018 Disclaimer This announcement contains forward-looking

Provident Financial plc Interim results announcement 2015 28 July 2015 Provident Financial plc

Ag e nda 1. Announcement of Quorum 2. Announcement of elections results of four members of

Annual results announcement Annual results announcement Year to 30 June 2011 Year to 30 June

GENERAL ANNOUNCEMENT::CORPORATE PRESENTATION Issuer & Securities Issuer/ Manager SINGAPORE

GENERAL ANNOUNCEMENT::CORPORATE PRESENTATION Issuer & Securities Issuer/ Manager SINGAPORE

ASX Announcement 15 M arch 2017 The Company Announcement Officer ASX Ltd via electronic

ASX Announcement To: Company Announcement Office Company: Senex Energy Limited Date: 2 May

Announcement Announcement If you need more experiments to If you need more experiments to

1. Evolution and Classification 1.1 Origin of Life and Plants 1.2 Animal Evolution 1.3 Human

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen Introduction to

An Update from ASCO 2009 F B Hagemeister, MD Professor of Medicine Department of

Metastatic Refractory Gestational Trophoblastic Neoplasia JOHN R. LURAIN, MD JOHN I. BREWER

An Integrated Architecture for Generating Parenthetical Constructions Eva Banik The Open

Providing Input-Discriminative Protection for Local Differential Privacy Xiaolan Gu * , Ming Li *

Energy management Water and steam Exergy Exergy by heat transfer Exergy in the case steam

Understanding the Economic Imperative of Energy Efficiency * John A. Skip Laitner

Announcement HW 1 out TODAY Watch your email 1 What is Machine - PowerPoint PPT Presentation

Announcement HW 1 out TODAY Watch your email 1 What is Machine Learning? (Formally) 2 What is Machine Learning? Study of algorithms that improve their performance at some task with experience Learning algorithm

General Announcement::SPH REIT ANNOUNCEMENT Page 1 of 1 GENERAL ANNOUNCEMENT::SPH REIT

SPECIAL ANNOUNCEMENT SPECIAL ANNOUNCEMENT ! May 6, 2015 ! ! ! A Special Announcement From: ! RICK

2 YBM MITI ANNOUNCEMENT ON NEW INCENTIVES 4 YBM MITI ANNOUNCEMENT ON NEW

SPECIAL ANNOUNCEMENT Copenhagen Denmark SPECIAL ANNOUNCEMENT Copenhagen Denmark SPECIAL

ASX Announcement To: Company Announcement Office From: Senex Energy Limited Company: ASX

General Announcement::Corporate Presentation Page 1 of 1 GENERAL ANNOUNCEMENT::CORPORATE

Mid- -Term Performance Term Performance Mid Announcement for the Year Ending Announcement for

Joint Venture Announcement 16 February 2018 Disclaimer This announcement contains forward-looking

Provident Financial plc Interim results announcement 2015 28 July 2015 Provident Financial plc

Ag e nda 1. Announcement of Quorum 2. Announcement of elections results of four members of

Annual results announcement Annual results announcement Year to 30 June 2011 Year to 30 June

GENERAL ANNOUNCEMENT::CORPORATE PRESENTATION Issuer &amp; Securities Issuer/ Manager SINGAPORE

GENERAL ANNOUNCEMENT::CORPORATE PRESENTATION Issuer &amp; Securities Issuer/ Manager SINGAPORE

ASX Announcement 15 M arch 2017 The Company Announcement Officer ASX Ltd via electronic

ASX Announcement To: Company Announcement Office Company: Senex Energy Limited Date: 2 May

Announcement Announcement If you need more experiments to If you need more experiments to

1. Evolution and Classification 1.1 Origin of Life and Plants 1.2 Animal Evolution 1.3 Human

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen Introduction to

An Update from ASCO 2009 F B Hagemeister, MD Professor of Medicine Department of

Metastatic Refractory Gestational Trophoblastic Neoplasia JOHN R. LURAIN, MD JOHN I. BREWER

An Integrated Architecture for Generating Parenthetical Constructions Eva Banik The Open

Providing Input-Discriminative Protection for Local Differential Privacy Xiaolan Gu * , Ming Li *

Energy management Water and steam Exergy Exergy by heat transfer Exergy in the case steam

Understanding the Economic Imperative of Energy Efficiency * John A. Skip Laitner

GENERAL ANNOUNCEMENT::CORPORATE PRESENTATION Issuer & Securities Issuer/ Manager SINGAPORE

GENERAL ANNOUNCEMENT::CORPORATE PRESENTATION Issuer & Securities Issuer/ Manager SINGAPORE