Linear Regression 4/14/17 Hypothesis Space Supervised learning - PowerPoint PPT Presentation

Linear Regression 4/14/17

Hypothesis Space Supervised learning • For every input in the data set, we know the output Regression • Outputs are continuous • A number, not a category label The learned model: • A linear function mapping input to output • A weight for each feature (including bias)

Linear Models In two dimensions: f ( x ) = wx + b In d dimensions:       1 w b x 0 w 0 x 0 x 1       f ( ~ x ) = ~ . .       x ≡  · . .  .      . . .      . w d x d x d We want to find the linear model that fits our data best. When have we seen a model like this before?

Linear Regression We want to find the linear model that fits our data best. Key idea: model data as linear model plus noise. Pick the weights to minimize noise magnitude.     1 w b w 0 x 0     f ( ~ x ) =  + ✏ . .      · . .     . .   w d x d

Squared Error         1 1 w b w b w 0 x 0 w 0 x 0         ˆ f ( ~ x ) =  + ✏ f ( ~ x ) =  .   .   .   .   ·  · . . . .         . . . .      w d x d w d x d Define error for a data point to be the squared distance between correct output and predicted output: ⌘ 2 ⇣ x ) − ˆ = ✏ 2 f ( ~ f ( ~ x ) Error for the model is the sum of point errors: ⇣ ⌘ y − ˆ X X ✏ 2 f ( ~ x ) = ~ x ~ x ∈ data ~ x ∈ data

Minimizing Squared Error Goal: pick weights that minimize squared error. Approach #1: gradient descent Your reading showed how to do this for 1D inputs:

Minimizing Squared Error Goal: pick weights that minimize squared error. Approach #2 (the right way): analytical solution • The gradient is 0 at the error minimum. • There is generally a unique global minimum. ⌘ − 1 ⇣ X T X X T ~ ~ w = y 2 3 1 1 1 . . . x 00 x 01 . . . x 0 n 6 7 6 7 ⇥ ~ ⇤ x 10 x 11 . . . x 1 n x 1 . . . ~ ~ X ≡ 6 7 x 0 x n ≡ 6 7 . . . ... . . . 6 7 . . . 4 5 x d 0 x d 1 . . . x dn

Change of Basis Polynomial regression is just linear regression with a change of basis.   cubic x 0 basis ( x 0 ) 2   quadratic   x 0   ( x 0 ) 3 basis   ( x 0 ) 2         x 0 x 1 x 0     x 1  ( x 1 ) 2    x 1 x 1        ( x 1 ) 2  .     ( x 1 ) 3 .  − →      − → . .     .     . . . .       . .     . x d x d     x d     x d   ( x d ) 2   ( x d ) 2   ( x d ) 3 Perform linear regression on the new representation.

Change of Basis Demo

Locally Weighted Regression Recall from KNN: locally weighted averaging We can apply the same idea here: points that are further away should contribute less to the estimate. To estimate the value for a specific test point x t compute a linear regression with error weighted by distance: ⇣ ⌘ y − ˆ f ( ~ x ) ✏ 2 X X ~ x x ) = dist ( ~ || ~ x || 2 x t , ~ x t − ~ x ∈ data ~ ~ x ∈ data

Exam Topics Covers the machine learning portion of the class. • Supervised learning • Regression • Classification • Unsupervised learning • Clustering • Dimensionality reduction • Semi-supervised learning • Reinforcement learning Know the differences between these topics. Know what algorithms apply to which problems.

Machine Learning Algorithms • neural networks • EM • perceptrons • K-means • backpropagation • Gaussian mixtures • auto-encoders • hierarchical clustering • deep learning • agglomerative • decision trees • divisive • naive Bayes • principal component analysis • k-nearest neighbors • growing neural gas • support vector machines • Q-learning • locally-weighted average • approximate Q-learning • linear regression • ensemble learning

Linear Regression 4/14/17 Hypothesis Space Supervised learning - PowerPoint PPT Presentation

Linear Regression 4/14/17 Hypothesis Space Supervised learning For every input in the data set, we know the output Regression Outputs are continuous A number, not a category label The learned model: A linear function mapping

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

CS70: Lecture 35. Regression (contd.): Linear and Beyond CS70: Lecture 35. Regression (contd.):

Chapter 7 Linear Regression 04/05/2016 Huamei Dong 1. Review Least square regression line 2.

Technical conditions for linear regression Jo Hardin Professor, Pomona College DataCamp

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan

Midnight Laundry 6 PM 7 8 9 10 11 12

Dependences and Hazards Lecture 17 CS301 Administrative Daily Review of todays lecture

CSEE 3827: Fundamentals of Computer Systems Lecture 21 and 22 April 22 and 27, 2009 Martha Kim

I can represent the multiplication visually by drawing the vector. National 5 Slides WB 29th Jan

Kaplan-Meier estimate Heidi Seibold Statistician at LMU Munich DataCamp Survival Analysis in R

I-205 SB Closed at X Johnson Creek Blvd I-205 SB Detour Route: Johnson Creek Blvd WB to OR213

Bus Use of Shoulders in ODOT District 12 March 19, 2015 Introductions Introductions

The Basics: Pipelining J. Nelson Amaral University of Alberta