Machine learning system design Priori3zing what to work - PowerPoint PPT Presentation

Machine ¡learning ¡ system ¡design ¡ Priori3zing ¡what ¡to ¡ work ¡on: ¡Spam ¡ classifica3on ¡example ¡ Machine ¡Learning ¡

Building ¡a ¡spam ¡classifier ¡ From: cheapsales@buystufffromme.com From: Alfred Ng To: ang@cs.stanford.edu To: ang@cs.stanford.edu Subject: Buy now! Subject: Christmas dates? Deal of the week! Buy now! Hey Andrew, Rolex w4tchs - $100 Was talking to Mom about plans Med1cine (any kind) - $50 for Xmas. When do you get off Also low cost M0rgages work. Meet Dec 22? available . Alf Andrew ¡Ng ¡

Building ¡a ¡spam ¡classifier ¡ Supervised ¡learning. ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡features ¡of ¡email. ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡spam ¡(1) ¡or ¡not ¡spam ¡(0). ¡ Features ¡ ¡ ¡ ¡: ¡Choose ¡100 ¡words ¡indica3ve ¡of ¡spam/not ¡spam. ¡ ¡ From: cheapsales@buystufffromme.com To: ang@cs.stanford.edu Subject: Buy now! Deal of the week! Buy now! Note: ¡In ¡prac3ce, ¡take ¡most ¡frequently ¡occurring ¡ ¡ ¡ ¡ ¡ ¡words ¡( ¡10,000 ¡to ¡50,000) ¡ in ¡training ¡set, ¡rather ¡than ¡manually ¡pick ¡100 ¡words. ¡ Andrew ¡Ng ¡

Building ¡a ¡spam ¡classifier ¡ How ¡to ¡spend ¡your ¡3me ¡to ¡make ¡it ¡have ¡low ¡error? ¡ -‑ Collect ¡lots ¡of ¡data ¡ -‑ E.g. ¡“honeypot” ¡project. ¡ -‑ Develop ¡ sophis3cated ¡ features ¡ based ¡ on ¡ email ¡ rou3ng ¡ informa3on ¡(from ¡email ¡header). ¡ -‑ Develop ¡ sophis3cated ¡ features ¡ for ¡ message ¡ body, ¡ e.g. ¡ should ¡ “discount” ¡and ¡“discounts” ¡be ¡treated ¡as ¡the ¡same ¡word? ¡How ¡ about ¡“deal” ¡and ¡“Dealer”? ¡Features ¡about ¡punctua3on? ¡ -‑ Develop ¡ sophis3cated ¡ algorithm ¡ to ¡ detect ¡ misspellings ¡ (e.g. ¡ m0rtgage, ¡med1cine, ¡w4tches.) ¡ Andrew ¡Ng ¡

Machine ¡learning ¡ system ¡design ¡ Error ¡analysis ¡ Machine ¡Learning ¡

Recommended ¡approach ¡ -‑ Start ¡with ¡a ¡simple ¡algorithm ¡that ¡you ¡can ¡implement ¡quickly. ¡ Implement ¡it ¡and ¡test ¡it ¡on ¡your ¡cross-‑valida3on ¡data. ¡ -‑ Plot ¡learning ¡curves ¡to ¡decide ¡if ¡more ¡data, ¡more ¡features, ¡etc. ¡ are ¡likely ¡to ¡help. ¡ -‑ Error ¡analysis: ¡ ¡Manually ¡examine ¡the ¡examples ¡(in ¡cross ¡ valida3on ¡set) ¡that ¡your ¡algorithm ¡made ¡errors ¡on. ¡See ¡if ¡you ¡ spot ¡any ¡systema3c ¡trend ¡in ¡what ¡type ¡of ¡examples ¡it ¡is ¡ making ¡errors ¡on. ¡ Andrew ¡Ng ¡

Error ¡Analysis ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡500 ¡examples ¡in ¡cross ¡valida3on ¡set ¡ Algorithm ¡misclassifies ¡100 ¡emails. ¡ Manually ¡examine ¡the ¡100 ¡errors, ¡and ¡categorize ¡them ¡based ¡on: ¡ (i) What ¡type ¡of ¡email ¡it ¡is ¡ (ii) What ¡ cues ¡ (features) ¡ you ¡ think ¡ would ¡ have ¡ helped ¡ the ¡ algorithm ¡classify ¡them ¡correctly. ¡ Pharma: ¡ Deliberate ¡misspellings: ¡ Replica/fake: ¡ ¡(m0rgage, ¡med1cine, ¡etc.) ¡ Steal ¡passwords: ¡ Unusual ¡email ¡rou3ng: ¡ Other: ¡ Unusual ¡(spamming) ¡punctua3on: ¡ Andrew ¡Ng ¡

The ¡importance ¡of ¡numerical ¡evalua;on ¡ Should ¡discount/discounts/discounted/discoun3ng ¡be ¡treated ¡as ¡the ¡ same ¡word? ¡ ¡ Can ¡use ¡“stemming” ¡so\ware ¡(E.g. ¡“Porter ¡stemmer”) ¡ ¡universe/university. ¡ Error ¡analysis ¡may ¡not ¡be ¡helpful ¡for ¡deciding ¡if ¡this ¡is ¡likely ¡to ¡improve ¡ performance. ¡Only ¡solu3on ¡is ¡to ¡try ¡it ¡and ¡see ¡if ¡it ¡works. ¡ Need ¡numerical ¡evalua3on ¡(e.g., ¡cross ¡valida3on ¡error) ¡of ¡algorithm’s ¡ performance ¡with ¡and ¡without ¡stemming. ¡ ¡Without ¡stemming: ¡ ¡ ¡With ¡stemming: ¡ Dis3nguish ¡upper ¡vs. ¡lower ¡case ¡(Mom/mom): ¡ Andrew ¡Ng ¡

Machine ¡learning ¡ system ¡design ¡ Error ¡metrics ¡for ¡ skewed ¡classes ¡ Machine ¡Learning ¡

Cancer ¡classifica;on ¡example ¡ Train ¡logis3c ¡regression ¡model ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡. ¡( ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡if ¡cancer, ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ otherwise) ¡ Find ¡that ¡you ¡got ¡1% ¡error ¡on ¡test ¡set. ¡ (99% ¡correct ¡diagnoses) ¡ ¡ Only ¡0.50% ¡of ¡pa3ents ¡have ¡cancer. ¡ function y = predictCancer(x) y = 0; %ignore x! return Andrew ¡Ng ¡

Precision/Recall ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡in ¡presence ¡of ¡rare ¡class ¡that ¡we ¡want ¡to ¡detect ¡ Precision ¡ ¡ (Of ¡all ¡pa3ents ¡where ¡we ¡predicted ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡, ¡what ¡ frac3on ¡actually ¡has ¡cancer?) ¡ Recall ¡ (Of ¡all ¡pa3ents ¡that ¡actually ¡have ¡cancer, ¡what ¡frac3on ¡ did ¡we ¡correctly ¡detect ¡as ¡having ¡cancer?) ¡ Andrew ¡Ng ¡

Machine ¡learning ¡ system ¡design ¡ Trading ¡off ¡precision ¡ and ¡recall ¡ Machine ¡Learning ¡

true ¡posi3ves ¡ precision ¡ ¡ ¡ ¡= ¡ Trading ¡off ¡precision ¡and ¡recall ¡ no. ¡of ¡predicted ¡posi3ve ¡ true ¡posi3ves ¡ Logis3c ¡regression: ¡ recall ¡ ¡ ¡ ¡ ¡= ¡ no. ¡of ¡actual ¡posi3ve ¡ Predict ¡1 ¡if ¡ ¡ Predict ¡0 ¡if ¡ ¡ Suppose ¡we ¡want ¡to ¡predict ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡(cancer) ¡ 1 ¡ only ¡if ¡very ¡confident. ¡ Precision ¡ 0.5 ¡ Suppose ¡we ¡want ¡to ¡avoid ¡missing ¡too ¡many ¡ cases ¡of ¡cancer ¡(avoid ¡false ¡nega3ves). ¡ 0.5 ¡ 1 ¡ Recall ¡ More ¡generally: ¡Predict ¡1 ¡if ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡threshold. ¡ Andrew ¡Ng ¡

F 1 ¡Score ¡(F ¡score) ¡ How ¡to ¡compare ¡precision/recall ¡numbers? ¡ Precision(P) ¡ Recall ¡(R) ¡ Average ¡ F 1 ¡Score ¡ Algorithm ¡1 ¡ 0.5 ¡ 0.4 ¡ 0.45 ¡ 0.444 ¡ Algorithm ¡2 ¡ 0.7 ¡ 0.1 ¡ 0.4 ¡ 0.175 ¡ Algorithm ¡3 ¡ 0.02 ¡ 1.0 ¡ 0.51 ¡ 0.0392 ¡ Average: ¡ F 1 ¡Score: ¡ ¡ Andrew ¡Ng ¡

Machine ¡learning ¡ system ¡design ¡ Data ¡for ¡machine ¡ learning ¡ Machine ¡Learning ¡

Designing ¡a ¡high ¡accuracy ¡learning ¡system ¡ E.g. ¡ ¡Classify ¡between ¡confusable ¡words. ¡ ¡{to, ¡two, ¡too}, ¡ ¡{then, ¡than} ¡ For ¡breakfast ¡I ¡ate ¡_____ ¡eggs. ¡ ¡ ¡ ¡ ¡ ¡Accuracy ¡ ¡ ¡ ¡ ¡ ¡ Algorithms ¡ -‑ Perceptron ¡(Logis3c ¡regression) ¡ -‑ Winnow ¡ -‑ Memory-‑based ¡ -‑ Naïve ¡Bayes ¡ Training ¡set ¡size ¡(millions) ¡ ¡ “It’s ¡not ¡who ¡has ¡the ¡best ¡algorithm ¡that ¡wins. ¡ ¡ ¡ ¡ ¡ ¡It’s ¡who ¡has ¡the ¡most ¡data.” ¡ [Banko ¡and ¡Brill, ¡2001] ¡

Large ¡data ¡ra;onale ¡ Assume ¡feature ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡has ¡sufficient ¡informa3on ¡to ¡ predict ¡ ¡ ¡ ¡ ¡accurately. ¡ ¡ Example: ¡For ¡breakfast ¡I ¡ate ¡_____ ¡eggs. ¡ Counterexample: ¡Predict ¡housing ¡price ¡from ¡only ¡size ¡ (feet 2 ) ¡and ¡no ¡other ¡features. ¡ ¡ Useful ¡test: ¡Given ¡the ¡input ¡ ¡ ¡ ¡, ¡can ¡a ¡human ¡expert ¡ confidently ¡predict ¡ ¡ ¡? ¡

Large ¡data ¡ra;onale ¡ Use ¡a ¡learning ¡algorithm ¡with ¡many ¡parameters ¡(e.g. ¡logis3c ¡ regression/linear ¡regression ¡with ¡many ¡features; ¡neural ¡network ¡ with ¡many ¡hidden ¡units). ¡ ¡ ¡ ¡ ¡ Use ¡a ¡very ¡large ¡training ¡set ¡(unlikely ¡to ¡overfit) ¡

Machine learning system design Priori3zing what to work - PowerPoint PPT Presentation

Machine learning system design Priori3zing what to work on: Spam classifica3on example Machine Learning Building a spam classifier From:

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

MOBILE DATA CHARGING: NEW ATTACKS NEW ATTACKS AND COUNTERMEASURES AND COUNTERMEASURES Chunyi

Identifying Video Spammers in Online Social Networks Fabrcio Benevenuto 1 , Tiago Rodrigues 1 ,

Fourth Quarter 2014 Investor Call M. Terry Turner, President and CEO Harold R. Carpenter, EVP

On Measuring the Client- Side DNS Infrastructure Kyle Schomp , Tom Callahan, Michael

A Priacy-Presering Scial-Aware Incentie System fr Wrd-f-Muth Adertisement

Alerting Husbandry Julien Goodwin jgoodwin@studio442.com.au @laptop006 Bad Alerts Obsolete

Security Psychology Topics Weve Covered Ethics XSS CSRF SQL injection

An innovative and comprehensive framework for Social Driven Vulnerability Assessment 20