(Statistical Machine-Learning) General framework + Supervised - PDF document

Apprentissage Artificiel (Statistical Machine-Learning) General framework + Supervised Learning Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Université Paris Fabien.Moutarde@mines-paristech.fr http://people.mines-paristech.fr/fabien.moutarde Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 1 Outline • Intro: What is Statistical Machine-Learning? • Typology of Machine-Learning • General formalism for SUPERVISED Learning • Evaluating learnt models: metrics for CLASSIFICATION • Generalization vs. overfitting Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 2

What is Statistical Machine-Learning? ARTIFICIAL INTELLIGENCE STATISTICS OPTIMIZATION Data analysis Machine Learning Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 3 Statistical Machine-Learning • One of many sub-fields of Artificial Intelligence • Application of optimization methods to statistical modelling • Data-driven mathematical modelling, for automated classification , regression, partitioning/clustering , or decision/behavior rule Clustering Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 4

Real-world examples of Machine-Learning applications • Handwritten characters recognition 3 6 Handwritten digits recognition 3 6 system … … • Object category visual recognition « non-pedestrians » Pedestrians Pedestrian recognition system • Speech recognition • Multi-factorial forecasting • Natural Language understanding • Playing GO! • MANY MANY MORE… Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 5 One of simplest ML algorithm: Least Squares Linear Regression • Model: (straight) line y=ax+b (2 parameters a and b ) • Data: n points with target value (x i ,y i ) ÎÂ 2 • Cost function: sum of squares of deviation from line K= S i (y i -a.x i -b) 2 • Algorithm: direct ( or iterative ) solving of linear system æ ö æ ö n n n å å å ç 2 ÷ ç ÷ x x x y æ a ö i i i i ç ÷ ç ÷ × ç ÷ = = = = i 1 i 1 i 1 ç ÷ ç ÷ ç ÷ n n b å è ø å x n y ç ÷ ç ÷ i i è ø è ø = = i 1 i 1 [Question: Where does this equation come from?] Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 6

Regression vs. classification Regression Classification Input = point position target Output = class label output ( ¨ =-1,+=+1) ê Function label=f(x) input (and separation boundary) points = examples è curve = regression Continuous output(s) Discrete output(s) Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 7 Simplest classification method: Nearest Neighbors algorithm Principle of Nearest Neighbors (kNN) for classification [What are the main drawbacks of this method??] Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 8

Outline • Intro: What is Statistical Machine-Learning? • Typology of Machine-Learning • General formalism for SUPERVISED Learning • Evaluating learnt models: metrics for CLASSIFICATION • Generalization vs. overfitting Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 9 Supervised vs Unsupervised learning Learning is called "supervised" when there are "target" values for every example in training dataset: examples = (input-output) = (x 1 ,y 1 ),(x 2 ,y 2 ), … ,(x n ,y n ) The goal is to build a (generally non-linear) approximate model for interpolation, in order to be able to GENERALIZE to input values other than those in training set "Unsupervised" = when there are NO target values: dataset = {x 1 , x 2 , … , x n } The goal is typically either to do datamining (unveil structure in the distribution of examples in input space), or to find an output maximizing a given evaluation function Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 11

Machine-Learning TYPOLOGY Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 12 SUPERVISED LEARNING: regression or classification Regression Classification output input Input { x i , i =1,…N } = points positions Examples {(x i ,y i ), i =1,…N } target Output = class label ( ¨ =-1,+=+1) x i =input, y i =target output è Infer: label=h(x) è Infer: curve = regression y » h(x) (and separation boundary) y: Continuous output(s) y: Discrete output(s) Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 13

UNSUPERVISED LEARNING: Clustering vs. Generative model Generative model Clustering From examples x n , estimate the PROBABILITY DISTRIBUTION p(x) è Can GENERATE new examples SIMILAR to those in training set Points = examples è partitioning in “groups” (colors) based on similarity Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 14 Reinforcement Learning (RL) Goal: find a “policy” a t = p (s t ) that maximizes Typical use of RL: learn a BEHAVIOR Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 15

Outline • Intro: What is Statistical Machine-Learning? • Typology of Machine-Learning • General formalism for SUPERVISED Learning • Evaluating learnt models: metrics for CLASSIFICATION • Generalization vs. overfitting Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 16 Many different supervised ML approaches & algorithms • Linear regressions • Decision trees (ID3 or CART algorithms) • Bayesian (probabilistic) methods • … • Multi-layer neural networks trained with gradient backpropagation • Support Vector Machines • Boosting of "weak" classifiers • Random forests • Deep Learning (Convolutional Neural Networks,…) • … Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 17

Supervised learning Examples (input-output) LEARNING (x 1 ,y 1 ), (x 2 ,y 2 ), … , ( x n , y n ) ALGORITHM H (usually based on h* Î H (parameterized) family optimization of mathematical models so that technique) h*(x i ) » y i Hyper-parameters for training algorithm In most cases, h*= argMin h Î H K ( h, {(x i ,y i )} ) where K=cost K = S i loss( h(x i ),y i ) [+ regularization-term] and loss= || h(x i )-y i || 2 Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 18 Cost function and loss function Most supervised Machine-Learning algorithms work by minimizing a "cost function" • The cost function is generally the average over all training examples of a "loss function" K = S i loss( h(x i ),y i ) (+ sometimes an additional « regularization » term) • The loss function is usually some measure of the difference between target value and prediction by the output of the learnt model Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 19

Linear Multivariate Regression [From slide by ] Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 20 Logistic Multivariate Regression If target output is binary (classification) [From slide by ] Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 21

Usual two distinct phases of supervised Machine-Learning Training Pedestrians « non-pedestrians » Recognition « non-cars» cars Input STATISTICAL MACHINE- CLASSIFIER LEARNING ALGORITHM Category (class) Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 22 Outline • Intro: What is Statistical Machine-Learning? • Typology of Machine-Learning • General formalism for SUPERVISED Learning • Evaluating learnt models: metrics for CLASSIFICATION • Generalization vs. overfitting Statistical Machine-Learning: framework + supervised ML, Pr Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, Nov.2019 23

(Statistical Machine-Learning) General framework + Supervised - PDF document

Apprentissage Artificiel (Statistical Machine-Learning) General framework + Supervised Learning Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Universit Paris Fabien.Moutarde@mines-paristech.fr

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Foundations of AI Why learning works 1 6 . Statistical Machine Learning Bayesian Learning and

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

MACHINE LEARNING, STATISTICAL LEARNING AND PARALLEL COMPUTING INTRODUCTION VS MACHINE LEARNING

COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Trevor Cohn 23. PGM

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Data Sciences CentraleSupelec Advance Machine Learning Course VI - Nonnegative matrix

Lecture 1: Introduction Felix Held, Mathematical Sciences MSA220/MVE440 Statistical Learning for

Basics on generative and discriminative classification Machine Learning and Object Recognition

Printability and Complexity Co-optimization Bentian Jiang 1 , Lixin Liu 1 , Yuzhe Ma 1 , Hang Zhang

Neural Networks Janos Borst July 23, 2019 University of Leipzig - NLP Group Machine Learning

Electricity Demand Forecasting by Multi-Task Learning Jean-Baptiste Fiot Francesco Dinuzzo IBM

A New Paradigm in Hydrological Forecasting Qingyun Duan College of Hydrology and Water Resources

Training to Improve Judgmental Expertise by Using Decompositions of Judgment Accuracy Measures