SLIDE 2 6/2/2017 2
T ypic al Mac hine L e ar ning topic s
Supervised Learning Unsupervised Learning Supervised learning is a type of machine learning algorithm that uses a known dataset (called the training dataset) to make
- predictions. The training dataset includes input data and
response values. From it, the supervised learning algorithm seeks to build a model that can make predictions of the response values for a new dataset. Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting
- f input data without labeled responses.
Classification / Regression Clustering, semi‐supervised learning
- Classification: for categorical response values, where the
data can be separated into specific “classes”.
- Regression: for continuous-response values.
- Clustering: the method for exploratory data
analysis to find hidden patterns or grouping in
- data. The clusters are modeled using a measure of
similarity which is defined upon metrics such as Euclidean or probabilistic distance.
- Semi‐supervised learning is a class of supervised
learning tasks and techniques that also make use of unlabeled data for training – typically a small amount of labeled data with a large amount of unlabeled data.
learning is to build a model that makes predictions based on evidence in the presence of uncertainty.
- As adaptive algorithms identify patterns in data, a computer
ʺlearnsʺ from the observations. When exposed to more
the computer improves its predictive performance.
– Classification: Decision Trees, Discriminant Analysis, Naive Bayes, k Nearest Neighbors (kNN), Support Vector Machines (SVM), Classification Ensembles ( a predictive model composed of a weighted combination of multiple classification models) – Regression: Linear Regression, Generalized Linear Models, Nonlinear Regression, Support Vector Machines (SVM), Gaussian Process Regression Models, Decision Trees, Regression Tree Ensembles
Supe r vise d L e ar ning Algor ithms