Introduction to Machine Learning Lecture 1 Introduction to Machine - PowerPoint PPT Presentation

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Introduction to Machine Learning Lecture 1 Introduction to Machine Learning September 2, 2015 1

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Outline 1. What is Machine Learning? 2. Key Terminology 3. Machine Learning Tasks 4. Challenges/Issues 5. Developing a Machine Learning Application Introduction to Machine Learning September 2, 2015 2

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky What is Machine Learning (ML)? The study/construction of algorithms that can learn from data The study of algorithms that improve their performance P at some task T with experience E – Tom Mitchell (CMU) Fusion of algorithms, artificial intelligence, statistics, optimization theory, visualization, … Introduction to Machine Learning September 2, 2015 3

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Natural Language Processing (NLP) Modern NLP algorithms are typically based on statistical ML Applications – Summarization – Machine Translation – Speech Processing – Sentiment Analysis … Introduction to Machine Learning September 2, 2015 4

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Computer Vision Methods for acquiring, processing, analyzing, and understanding images Applications – Image search – Facial recognition – Object tracking – Image restoration … Introduction to Machine Learning September 2, 2015 5

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Games, Robotics, Medicine, Ads, … Introduction to Machine Learning September 2, 2015 6

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Machine Learning is in Demand! Posi%on ¡ Salary * ¡ Data ¡Scien*st ¡ $118,709 ¡ Machine ¡Learning ¡Engineer ¡ $112,500 ¡ So3ware ¡Engineer ¡ $90,374 ¡ “A ¡data ¡scien*st ¡is ¡someone ¡who ¡knows ¡more ¡sta*s*cs ¡than ¡a ¡computer ¡ scien*st ¡and ¡more ¡computer ¡science ¡than ¡a ¡sta*s*cian.” ¡ – ¡Josh ¡Blumenstock ¡(UW) ¡ ¡ “Data ¡Scien*st ¡= ¡sta*s*cian ¡+ ¡programmer ¡+ ¡coach ¡+ ¡storyteller ¡+ ¡ar*st” ¡ ¡ – ¡Shlomo ¡Aragmon ¡(Ill. ¡Inst. ¡of ¡Tech) ¡ * glassdoor.com, National Avg as of August 24, 2015 Introduction to Machine Learning September 2, 2015 7

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Key Terminology Let’s consider a task [that we will revisit in greater detail]: handwritten digit recognition Given as input… Have the computer correctly identify… 0 ¡ 2 ¡ 1 ¡ 1 ¡ 5 ¡ Introduction to Machine Learning September 2, 2015 8

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Instances and Features • In this case, we could represent each digit via raw pixels: 28x28=784-pixel vector of greyscale values [0-255] – Dimensionality : number of features per instance (|vector|) • But other data representations are possible, and might be advantageous example , instance Unit of input Composed of features (or attributes ) • In general, the problem of feature selection is challenging Introduction to Machine Learning September 2, 2015 9

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Spot the Vocabulary! Features ¡ Instance ¡ Introduction to Machine Learning September 2, 2015 10

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Common Feature Categorizations Numeric/Quantitative Symbolic/Qualitative • Continuous vs. Discrete • Fixed vs. open set • Measurement Scale • Measurement Scale – Interval : degree of – Nominal : equality, difference (e.g. Celsius) containment (e.g. hair color, part of speech) – Ratio : has meaningful zero, ratio has meaning – Ordinal : supports ranking (e.g. Kelvin) (Likert, true/false) Introduction to Machine Learning September 2, 2015 11

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Summary of Measurement Scales hSp://www.mymarketresearchmethods.com/types-‑of-‑data-‑nominal-‑ordinal-‑interval-‑ra*o/ ¡ Introduction to Machine Learning September 2, 2015 12

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Describe the Features Introduction to Machine Learning September 2, 2015 13

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Relational Instances Typically make a closed-world assumption Person1 ¡ Person2 ¡ Rela%onship ¡ Ann ¡ Bob ¡ Friend ¡ Ann ¡ Sally ¡ Friend ¡ Ann ¡ Billy ¡ Sibling ¡ Bob ¡ Billy ¡ Friend ¡ Introduction to Machine Learning September 2, 2015 14

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky “Target” Feature When trying to predict a particular feature given the others target , label , class , concept Introduction to Machine Learning September 2, 2015 15

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Missing Data • An important issue in data processing (more later) is the idea of missing data • The cause could be failure (e.g. sensor) or lack of information, but should not be lightly confused/replaced with a 0 or default value • Similar to the concept of/issues with NULL in relational databases Introduction to Machine Learning September 2, 2015 16

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Source Processes • Degree of randomness [w.r.t. modeling goals] – Deterministic : every output can be uniquely determined by a set of parameters and by sets of previous states; always performs the same way for a given set of initial conditions – Stochastic ( probabilistic ): randomness is present, and variable states are not described by unique values, but rather by probability distributions – Often: deterministic process + hypothesized distribution of noise • e.g. Gaussian Mixture Model • Problem state can be fully vs. partially observable – States/variables are either directly measured (observable), or inferred from data • Hidden : aspects of physical reality that cannot/are not measured • Latent : Abstract categories that are useful (e.g. predict other data, reduce problem dimensionality) Introduction to Machine Learning September 2, 2015 17

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Tasks, Datasets, Algorithms • It is important to keep clear the difference between the type of task, a particular dataset, and the various algorithms you could apply • Each task type specifies input/output constraints, to which a dataset must adhere – Forms a hypothesis space • Every algorithm makes certain modeling assumptions and commits to performance tradeoffs in searching the hypothesis-space search and knowledge representation Introduction to Machine Learning September 2, 2015 18

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Machine Learning Tasks • Supervised – Given a training set and a target variable, generalize ; measured over a testing set • Unsupervised – Given a dataset, find “interesting” patterns; potentially no “right” answer • Reinforcement – Learn an optional action policy over time; given an environment that provides states, affords actions, and provides feedback as numerical reward , maximize the expected future reward Introduction to Machine Learning September 2, 2015 19

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Supervised Learning Training ¡Set ¡ Tes,ng ¡Set ¡ … ¡ α ¡ β ¡ β ¡ γ ¡ ? ¡ … ¡ Goal: ¡ generaliza,on ¡ Introduction to Machine Learning September 2, 2015 20

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Supervised Tasks (1) Classification: Discrete target Binary vs. multi-class SepalLength ¡ SepalWidth ¡ PetalLength ¡ PetalWidth ¡ Species ¡ 5.1 ¡ 3.5 ¡ 1.4 ¡ 0.2 ¡ setosa ¡ 4.9 ¡ 3.0 ¡ 1.4 ¡ 0.2 ¡ setosa ¡ 4.7 ¡ 3.2 ¡ 1.3 ¡ 0.2 ¡ setosa ¡ Introduction to Machine Learning September 2, 2015 21

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Supervised Tasks (2) Regression Continuous target Introduction to Machine Learning September 2, 2015 22

Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky Common Algorithms • Instance-based – Nearest Neighbor (kNN) • Tree-based – ID3, C4.5 • Optimization-based – Linear/logistic regression, support vector machines (SVM) • Probabilistic – Naïve Bayes • Artificial Neural Networks – Backpropagation – Deep learning Introduction to Machine Learning September 2, 2015 23

Introduction to Machine Learning Lecture 1 Introduction to Machine - PowerPoint PPT Presentation

Wentworth Institute of Technology COMP4050 Machine Learning | Fall 2015 | Derbinsky Introduction to Machine Learning Lecture 1 Introduction to Machine Learning September 2, 2015 1 Wentworth Institute of Technology COMP4050 Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training

Agent-Based Modeling and Simulation Introduction to Reinforcement Learning Dr. Alejandro

Machine Learning: Algorithms and Applications Floriano Zini Free University of Bozen-Bolzano

CPSC 533 Reinforcement Learning Paul Melenchuk Eva Wong Winson Yuen Kenneth Wong Outline

Build an Alien Sightings Dashboard BUILDIN G W EB AP P LICATION S W ITH S H IN Y IN R Kaelen

Cracking the Container Scale Problem with Apache Mesos Connor Doyle connor@mesosphere.io Sunil

Combinatorial Interaction Testing Justyna Petke C entre for R esearch in E volution, S earch and