Introduction to Machine Learning: Classification and The Noisy - PowerPoint PPT Presentation

Introduction to Machine Learning: Classification and The Noisy Channel Model CMSC 473/673 UMBC Some slides adapted from 3SLP

Outline Classification Why incorporate uncertainty Classification with Bayes Rule Example: Email Classifier Evaluation

Probabilistic Classification 𝑞 𝑍 𝑌) = ℎ(𝑌; 𝑍) Directly model the posterior Discriminatively trained classifier Model the 𝑞 𝑍 𝑌) ∝ 𝑞 𝑌 𝑍) ∗ 𝑞(𝑍) posterior with Bayes rule Generatively trained classifier

Classification P OLITICS T ERRORISM Three people have been fatally shot, and five S PORTS people, including a mayor, were seriously wounded T ECH as a result of a Shining Path attack today against a H EALTH community in Junin department, central F INANCE Peruvian mountain region. …

Classification P OLITICS Electronic alerts have T ERRORISM been used to assist the authorities in moments of S PORTS chaos and potential danger: after the Boston T ECH bombing in 2013, when the Boston suspects were H EALTH still at large, and last month in Los Angeles, F INANCE during an active shooter scare at the airport. … Source: http://www.nytimes.com/2016/09/20/nyregion/cellphone-alerts-used-in-search-of- manhattan-bombing-suspect.html

Classify with Uncertainty Use probabilities

Classify with Uncertainty Use probabilities* *There are non- probabilistic ways to handle uncertainty… but probabilities sure are handy!

Classification P OLITICS .05 Electronic alerts have T ERRORISM .48 been used to assist the authorities in moments of S PORTS .0001 chaos and potential danger: after the Boston T ECH .39 bombing in 2013, when the Boston suspects were H EALTH .0001 still at large, and last month in Los Angeles, F INANCE .0002 during an active shooter scare at the airport. … Source: http://www.nytimes.com/2016/09/20/nyregion/cellphone-alerts-used-in-search-of- manhattan-bombing-suspect.html

Text Classification Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification

Text Classification Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification Input : a document a fixed set of classes C = { c 1 , c 2 ,…, c J } Output : a predicted class c from C

Text Classification Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification Input : a document linguistic blob a fixed set of classes C = { c 1 , c 2 ,…, c J } Output : a predicted class c from C

Text Classification: Hand-coded Rules? Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification Rules based on combinations of words or other features spam: black-list- address OR (“dollars” AND “have been selected”) Accuracy can be high If rules carefully refined by expert Building and maintaining these rules is expensive Can humans faithfully assign uncertainty?

Text Classification: Supervised Machine Learning Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification Input: a document d a fixed set of classes C = { c 1 , c 2 ,…, c J } A training set of m hand-labeled documents (d 1 ,c 1 ),....,(d m ,c m ) Output: a learned classifier γ that maps documents to classes

Text Classification: Supervised Machine Learning Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification Input: Naïve Bayes a document d Logistic regression a fixed set of classes C = { c 1 , c 2 ,…, c J } A training set of m hand-labeled Support-vector documents (d 1 ,c 1 ),....,(d m ,c m ) machines Output: a learned classifier γ that maps k-Nearest Neighbors documents to classes …

Multi-class Classification Given input 𝑦 , predict discrete label 𝑧 Multi-label Classification

Multi-class Classification Given input 𝑦 , predict discrete label 𝑧 If 𝑧 ∈ {0,1} (or 𝑧 ∈ {True, False} ), then a binary classification task Multi-label Classification

Multi-class Classification Given input 𝑦 , predict discrete label 𝑧 If 𝑧 ∈ {0,1} (or 𝑧 ∈ If 𝑧 ∈ {0,1, … , 𝐿 − 1} (for {True, False} ), then a finite K), then a multi-class binary classification task classification task Q: What are some examples of multi-class classification? Multi-label Classification

Multi-class Classification Given input 𝑦 , predict discrete label 𝑧 If 𝑧 ∈ {0,1} (or 𝑧 ∈ If 𝑧 ∈ {0,1, … , 𝐿 − 1} (for Single {True, False} ), then a finite K), then a multi-class output binary classification task classification task If multiple 𝑧 𝑚 are Multi- predicted, then a multi- output label classification task Multi-label Classification

Multi-class Classification Given input 𝑦 , predict discrete label 𝑧 If 𝑧 ∈ {0,1} (or 𝑧 ∈ If 𝑧 ∈ {0,1, … , 𝐿 − 1} (for Single {True, False} ), then a finite K), then a multi-class output binary classification task classification task If multiple 𝑧 𝑚 are Multi- predicted, then a multi- output label classification task Given input 𝑦 , predict multiple discrete labels 𝑧 = (𝑧 1 , … , 𝑧 𝑀 ) Multi-label Classification

Multi-class Classification Given input 𝑦 , predict discrete label 𝑧 If 𝑧 ∈ {0,1} (or 𝑧 ∈ If 𝑧 ∈ {0,1, … , 𝐿 − 1} (for Single {True, False} ), then a finite K), then a multi-class output binary classification task classification task If multiple 𝑧 𝑚 are Each 𝑧 𝑚 could be binary or Multi- predicted, then a multi- multi-class output label classification task Given input 𝑦 , predict multiple discrete labels 𝑧 = (𝑧 1 , … , 𝑧 𝑀 ) Multi-label Classification

Probabilistic Text Classification Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification class 𝑞 𝑍 𝑌) = 𝑞 𝑌 𝑍) ∗ 𝑞(𝑍) 𝑞(𝑌) observed data

Probabilistic Text Classification Assigning subject Age/gender identification categories, topics, or Language Identification genres Sentiment analysis Spam detection … Authorship identification prior class-based likelihood probability of (language model) class class 𝑞 𝑍 𝑌) = 𝑞 𝑌 𝑍) ∗ 𝑞(𝑍) 𝑞(𝑌) observed observation likelihood (averaged over all classes) data

Classification with Bayes Rule argmax 𝑍 𝑞 𝑍 𝑌)

Classification with Bayes Rule 𝑞 𝑌 𝑍) ∗ 𝑞(𝑍) argmax 𝑍 𝑞(𝑌)

Classification with Bayes Rule 𝑞 𝑌 𝑍) ∗ 𝑞(𝑍) argmax 𝑍 𝑞(𝑌) constant with respect to Y

Classification with Bayes Rule argmax 𝑍 𝑞 𝑌 𝑍) ∗ 𝑞(𝑍)

Classification with Bayes Rule argmax 𝑍 log 𝑞 𝑌 𝑍) + log 𝑞(𝑍)

Introduction to Machine Learning: Classification and The Noisy - PowerPoint PPT Presentation

Introduction to Machine Learning: Classification and The Noisy Channel Model CMSC 473/673 UMBC Some slides adapted from 3SLP Outline Classification Why incorporate uncertainty Classification with Bayes Rule Example: Email Classifier

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Machine Learning Classification over Encrypted Data Raphael Bost, Raluca Ada Popa, Stephen Tu,

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

MSc Course MACHINE LEARNING TECHNIQUES AND APPLICATIONS Classification with GMM + Bayes 1

Graph Classification Classification Outline Introduction, Overview Classification using

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Classifiers: Support Vector Machine 1 MACHINE LEARNING What is Classification? Female Adult

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

CSE 258 Lecture 9 Web Mining and Recommender Systems T ext Mining Administrivia Midterms

Mahdi Roozbahani Lecturer, Computational Science & Engineering, Georgia Tech Founder of

The Modern Cybersecurity Stack Data-Driven Network Monitoring with Bro Robin Sommer Corelight,

Day 4 Google Analytics Goals Google Webmaster Tools Microsoft Webmaster Tools Goals

Russia vs. Telegram technical notes on the battle Leonid Evdokimov 35c3, Leipzig, 29 Dec 2018

#FluxFlow: Visual Analysis of Anomalous Jian Zhao, Nan Cao, Zhen Wen, Yale Song, Yu-Ru Lin,

How AI is enhancing JOURNALISM @carolstran @carolstran bit.ly/ai-journo-resources @carolstran

What is journalism? Principles of Journalism January 30, 2018 What can I say about

Sambuz

Useful Links

Newsletter

Mail Us

Introduction to Machine Learning: Classification and The Noisy - PowerPoint PPT Presentation

Introduction to Machine Learning: Classification and The Noisy Channel Model CMSC 473/673 UMBC Some slides adapted from 3SLP Outline Classification Why incorporate uncertainty Classification with Bayes Rule Example: Email Classifier

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Machine Learning Classification over Encrypted Data Raphael Bost, Raluca Ada Popa, Stephen Tu,

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

MSc Course MACHINE LEARNING TECHNIQUES AND APPLICATIONS Classification with GMM + Bayes 1

Graph Classification Classification Outline Introduction, Overview Classification using

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Classifiers: Support Vector Machine 1 MACHINE LEARNING What is Classification? Female Adult

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

CSE 258 Lecture 9 Web Mining and Recommender Systems T ext Mining Administrivia Midterms

Mahdi Roozbahani Lecturer, Computational Science &amp; Engineering, Georgia Tech Founder of

The Modern Cybersecurity Stack Data-Driven Network Monitoring with Bro Robin Sommer Corelight,

Day 4 Google Analytics Goals Google Webmaster Tools Microsoft Webmaster Tools Goals

Russia vs. Telegram technical notes on the battle Leonid Evdokimov 35c3, Leipzig, 29 Dec 2018

#FluxFlow: Visual Analysis of Anomalous Jian Zhao, Nan Cao, Zhen Wen, Yale Song, Yu-Ru Lin,

How AI is enhancing JOURNALISM @carolstran @carolstran bit.ly/ai-journo-resources @carolstran

What is journalism? Principles of Journalism January 30, 2018 What can I say about

Sambuz

Useful Links

Newsletter

Mail Us

Mahdi Roozbahani Lecturer, Computational Science & Engineering, Georgia Tech Founder of