STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin - PowerPoint PPT Presentation

Outline Simple Linear Regression Model STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016

Outline Simple Linear Regression Model Outline Simple Linear Regression Model

Outline Simple Linear Regression Model The Project Find a relationship between a response variable ( Y ) and one or more predictor/explanatory variables, X 1 , . . . , X k . Y = f ( X ) + ε DATA = PATTERN + IDIOSYNCRACIES • One vs two means: Y quantitative, X categorical • Simple Linear Regression: Both quantitative (but still just one X )

Outline Simple Linear Regression Model Examples • Y = Home Price X = Home size • Y = Exam score X = Hours spent studying • Y = State % in poverty X = State % with no health insurance • Y = SAT score X = Family income

Outline Simple Linear Regression Model The Simple Linear Model Y = β 0 + β 1 · X + ε aka Response = Intercept + Slope · Predictor + Random Error Standard form: Assume the ε ∼ N (0 , σ ε ) and are independent Parameters to estimate: β 0 , β 1 and σ ε

Outline Simple Linear Regression Model SLM Visualized

Outline Simple Linear Regression Model SLM With Data

Outline Simple Linear Regression Model Presidential Approval and Re-election Margin ● ● 20 Reelection Margin (%) ● ● 10 ● ● 5 ● ● 0 ● ● −10 ● 30 40 50 60 70 Incumbent Approval (%)

Outline Simple Linear Regression Model Conditions for SLM Pattern 1. Mean Y at each X is a linear function of X : µ Y ( X ) = f ( X ) = β 0 + β 1 X Residuals 2. Zero mean: Residuals centered at 0 3. Constant variance: Same variability at all X (Homoskedasticity) 4. Independence: No relationship among errors 5. Normality (for standard form): At each X , Y s are Normally distributed

Outline Simple Linear Regression Model Exploring violations of conditions https://gallery.shinyapps.io/slr_diag/

Outline Simple Linear Regression Model Re-election Margin: Two Models ● ● ● ● 20 20 Reelection Margin (%) Reelection Margin (%) ● ● ● ● 10 10 ● ● ● ● 5 5 ● ● ● ● 0 0 ● ● ● ● −10 −10 ● ● 30 40 50 60 70 30 40 50 60 70 Incumbent Approval (%) Incumbent Approval (%) Figure: Left: Constant Model Y = β 0 + ε ; Right: Best Fit Linear Model: Y = β 0 + β 1 X + ε

Outline Simple Linear Regression Model FIT: What parameters? The Simple Linear Model Y = β 0 + β 1 · X + ε aka Response = Intercept + Slope · Predictor + Random Error Standard form: Assume the ε ∼ N (0 , σ ε ) and are independent Parameters to estimate: β 0 , β 1 and σ ε

Outline Simple Linear Regression Model Minimizing Sum of Squared Residuals • From data, pick estimates ˆ β 0 and ˆ β 1 to define an estimated f ( X ) (can write ˆ f ( X ) ). Defines prediction equation: Y i = ˆ ˆ f ( X i ) = ˆ β 0 + ˆ β 1 X i • If we want ˆ f ( X i ) to represent mean Y at X i , choose ˆ β 0 and ˆ β 1 to minimize sum of squared residuals: � ( Y i − ˆ Y i ) 2 SSR = • How? Multivariable calculus gives us formulae: � ( X i − ¯ X )( Y i − ¯ Y ) ˆ β 0 = ¯ ˆ Y − ˆ β 1 ¯ β 1 = X � ( X i − ¯ X ) 2

Outline Simple Linear Regression Model Re-election Margin: Two Models ● ● ● ● 20 20 Reelection Margin (%) Reelection Margin (%) ● ● ● ● 10 10 ● ● ● ● 5 5 ● ● ● ● 0 0 ● ● ● ● −10 −10 ● ● 30 40 50 60 70 30 40 50 60 70 Incumbent Approval (%) Incumbent Approval (%) Figure: Left: Best fit Constant Model Y = ¯ ε ; Right: Best Fit Y + ˆ Linear Model: Y = ˆ β 0 + ˆ β 1 X + ˆ ε

Outline Simple Linear Regression Model Estimating σ ε • The standard estimate of the population standard deviation of residuals, σ ε is (almost) the sample standard deviation of the residuals �� ( Y i − ˆ � SSR Y i ) 2 σ ε = ˆ n − 2 = n − 2 • We usually have n − 1 in the denominator when computing sample variance. Why n − 2 here?

Outline Simple Linear Regression Model ASSESS: Check conditions with residual plots https://gallery.shinyapps.io/slr_diag/

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin - PowerPoint PPT Presentation

Outline Simple Linear Regression Model STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline Simple Linear Regression Model Outline Simple Linear Regression Model Outline Simple Linear Regression

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Simple linear regression STAT 401A - Statistical Methods for Research Workers Jarad Niemi Iowa

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

STAT 213 Interactions in Multiple Regression Colin Reimer Dawson Oberlin College 29 March 2016

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Slide 4 / 213 Slide 4 (Answer) / 213 Slide 5 / 213 Derivatives Exploration Exploration into the

Linear regression Linear regression is a simple approach to supervised learning. It assumes

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Bayesian linear regression Dr. Jarad Niemi STAT 544 - Iowa State University April 23, 2019

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Logistic Regression II Colin Reimer Dawson Oberlin College 28 April 2016 Outline

STAT 213 ANOVA as Multiple Regression Colin Reimer Dawson Oberlin College 5 April 2016 Outline

R01 - Simple linear regression STAT 587 (Engineering) Iowa State University October 17, 2020

Outline The Simple Linear Regression Model (12.1) Fitting the Regression Line (12.2)

Berkeley/Stanford Recovery-oriented October 25, 2001 Computing Course Lecture Problem definition

How Green is Multipath TCP for Mobile Devices? Yeon-sup Lim 1 , Yung-Chih Chen 1 , Erich M. Nahum

Forecasting Methodologies Dave Appleby Types Standard Erlang-C Holt-Winters ARIMA So

Large systems of diffusions interacting through their ranks Mykhaylo Shkolnikov INTECH

Welcome Back! EDUC 7610 Chapter 2 The Simple Regression Model Fall 2018 Tyson S. Barrett,

2.4 OLS: Goodness of Fit and Bias ECON 480 Econometrics Fall 2020 Ryan Safner

CEE 697K ENVIRONMENTAL REACTION KINETICS Lecture #18 Chloramines with Surface Reactions: Pipe

CEE 697K ENVIRONMENTAL REACTION KINETICS Lecture #18 Chloramines with Surface Reactions: Pipe

Sambuz

Useful Links

Newsletter

Mail Us