Key Algebraic Results in Linear Regression James H. Steiger - PowerPoint PPT Presentation

Key Algebraic Results in Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 30

Key Algebraic Results in Linear Regression Introduction 1 Bivariate Linear Regression 2 Multiple Linear Regression 3 Multivariate Linear Regression 4 Extensions to Random Variables and Random Vectors 5 Partial Correlation 6 James H. Steiger (Vanderbilt University) 2 / 30

Introduction Introduction In this module, we explore the algebra of least squares linear regression systems with a special eye toward developing the properties useful for deriving factor analysis and structural equation modeling. A key insight is that important properties hold whether or not variables are observed. James H. Steiger (Vanderbilt University) 3 / 30

Bivariate Linear Regression Bivariate Linear Regression In bivariate linear regression performed on a sample of n observations, we seek to examine the extent of the linear relationship between two observed variables, X and Y . One variable (usually the one labeled Y ) is the dependent or criterion variable, the other (usually labeled X ) is the independent or predictor variable. Each data point represents a pair of scores, x i , y i that may be plotted as a point in the plane. Such a plot, called a scatterplot , is shown on the next slide. In these data, gathered on a group of male college students, the independent variable plotted on the horizontal ( X ) axis is shoe size, and the dependent variable plotted on the vertical ( Y ) axis is height in inches. James H. Steiger (Vanderbilt University) 4 / 30

Bivariate Linear Regression Bivariate Linear Regression 80 75 Height in Inches 70 65 8 10 12 14 Shoe Size James H. Steiger (Vanderbilt University) 5 / 30

Bivariate Linear Regression Bivariate Linear Regression It would be a rare event, indeed, if all the points fell on a straight line. However, if Y and X have an approximate linear relationship, then a straight line, properly placed, should fall close to many of the points. Choosing a straight line involves choosing the slope and intercept, since these two parameters define any straight line. The regression model in the sample is that y i = ˆ β 0 + ˆ β 1 x i + e i (1) Generally, the least squares criterion, minimizing � n i =1 e 2 i under choice of ˆ β 0 and ˆ β 1 , is employed. Minimizing � n i =1 e 2 i is accomplished with the following well-known least squares solution . r Y , X S Y = s Y , X ˆ = s − 1 β 1 = X , X s x , y (2) s 2 S X X ˆ β 0 = Y • − β 1 X • (3) James H. Steiger (Vanderbilt University) 6 / 30

Bivariate Linear Regression Bivariate Linear Regression Deviation Score Formulas Suppose we were to convert X into deviation score form. This would have no effect on any variance, covariance or correlation involving X , but would change the mean of X to zero. What would be the effect on the least squares regression? Defining x ∗ i = x i − X • , we have the new least squares setup y i = ˆ 0 + ˆ β ∗ β ∗ 1 x ∗ i + e ∗ (4) i From the previous slide, we know that ˆ 1 = S Y , X ∗ / S X ∗ , X ∗ = S Y , X / S X , X = ˆ β ∗ β 1 , and that ∗ ˆ 0 = Y • − ˆ β ∗ β ∗ 1 X • = Y • . Thus, if X is shifted to deviation score form, the slope of the regression line remains unchanged, but the intercept shifts to Y • . It is easy to see that, should we also re-express the Y variable in deviation score form, the regression line intercept will shift to zero and the slope will still remain unchanged. James H. Steiger (Vanderbilt University) 7 / 30

Bivariate Linear Regression Bivariate Linear Regression Variance of Predicted Scores Using linear transformation rules, one may derive expressions for the variance of the predicted (ˆ y i ) scores, the residual ( e i ) scores, and the covariance between them. For example consider the variance of the predicted scores. Remember that adding a constant (in this case ˆ β 0 ) has no effect on a variance, and multiplying by a constant multiplies the variance by the square of y i = ˆ β 1 x i + ˆ the multiplier. So, since ˆ β 0 , it follows immediately that s 2 β 2 ˆ 1 S 2 = ˆ X Y ( r Y , X S Y / S X ) 2 S 2 = X r 2 Y , X S 2 = (5) Y James H. Steiger (Vanderbilt University) 8 / 30

Bivariate Linear Regression Bivariate Linear Regression Covariance of Predicted and Criterion Scores The covariance between the criterion scores ( y i ) and predicted scores (ˆ y i ) is obtained by the heuristic rule. Begin by re-expressing ˆ y i as β 1 x i + β 0 , then recall that additive constant β 0 cannot affect a covariance. So the covariance between y i and ˆ y i is the same as the covariance between y i and ˆ β 1 x i . β 1 X = ˆ Using the heuristic approach, we find that S Y , ˆ Y = S Y , ˆ β 1 S Y , X Recalling that S Y , X = r Y , X S Y S X , and ˆ β 1 = r Y , X S Y / S X , one quickly arrives at ˆ S Y , ˆ = β 1 S Y , X Y = ( r Y , X S Y S X )( r Y , X S Y / S X ) r 2 Y , X S 2 = Y S 2 = (6) ˆ Y James H. Steiger (Vanderbilt University) 9 / 30

Bivariate Linear Regression Bivariate Linear Regression Covariance of Predicted and Residual Scores Calculation of the covariance between the predicted scores and residual scores proceeds in much the same way. Re-express e i as y i − ˆ y i , then use the heuristic rule. One obtains = S ˆ S ˆ Y , Y − ˆ Y , E Y Y , Y − S 2 = S ˆ ˆ Y S 2 Y − S 2 = (from Equation 6) ˆ ˆ Y = 0 (7) James H. Steiger (Vanderbilt University) 10 / 30

Bivariate Linear Regression Bivariate Linear Regression Covariance of Predicted and Residual Scores Calculation of the covariance between the predicted scores and residual scores proceeds in much the same way. Re-express e i as y i − ˆ Y i , then use the heuristic rule. One obtains = S ˆ S ˆ Y , y − ˆ Y , E Y Y , y − S 2 = S ˆ ˆ Y S 2 Y − S 2 = (from Equation 6) ˆ ˆ Y = 0 (8) Predicted and error scores always have exactly zero covariance, and zero correlation, in linear regression. James H. Steiger (Vanderbilt University) 11 / 30

Bivariate Linear Regression Bivariate Linear Regression Additivity of Variances Linear regression partitions the variance of Y into non-overlapping portions. Using a similar approach to the previous proofs, we may show easily that S 2 Y = S 2 Y + S 2 (9) ˆ E James H. Steiger (Vanderbilt University) 12 / 30

Multiple Linear Regression Multiple Linear Regression Multiple linear regression with a single criterion variable and several predictors is a straightforward generalization of bivariatelinear regression. To make the notation simpler, assume that the criterion variable Y and the p predictor variables X j , j = 1 , . . . , p are in deviation score form. Let y be an n × 1 vector of criterion scores, and X be the n × p matrix with the predictor variables in columns. Then the multiple regression prediction equation in the sample is ˆ y = y + e X ˆ = β + e (10) James H. Steiger (Vanderbilt University) 13 / 30

Multiple Linear Regression Multiple Linear Regression The least squares criterion remains essentially as before, i.e., minimize � e 2 i = e ′ e under choice of ˆ β . The unique solution is � − 1 X ′ y ˆ � X ′ X β = (11) which may also be written as β = S − 1 ˆ XX S XY (12) James H. Steiger (Vanderbilt University) 14 / 30

Multivariate Linear Regression Multivariate Linear Regression The notation for multiple linear regression with a single criterion generalizes immediately to situations where more than one criterion is being predicted simultaneously. Specifically, let n × q matrix Y contain q criterion variables, and let ˆ β be a p × q matrix of regression weights. The least squares criterion is satisfied when the sum of squared errors across all variables (i.e. Tr( E ′ E )) is minimized. The unique solution is the obvious generalization of Equation 11, i.e., � − 1 X ′ Y ˆ � X ′ X B = (13) James H. Steiger (Vanderbilt University) 15 / 30

Multivariate Linear Regression Multivariate Linear Regression We will now prove some multivariate generalizations of the properties we developed earlier for bivariate linear regression systems. First, we prove that ˆ Y = XB and E = Y − X ˆ B are uncorrelated. To do this, we examine the covariance matrix between them, and prove that it is a null matrix. Recall from the definition of the sample covariance matrix that, when scores in Y and X are in deviation score form, that S YX = 1 / ( n − 1) Y ′ X . Hence, (moving the n − 1 to the left of the formula for simplicity), James H. Steiger (Vanderbilt University) 16 / 30

Multivariate Linear Regression Multivariate Linear Regression ′ E ˆ ( n − 1) S YE = Y � ′ � � � X ˆ Y − X ˆ = B B ′ X ′ � � ˆ Y − X ˆ = B B ˆ ′ X ′ Y − ˆ ′ X ′ X ˆ = B B B � − 1 X ′ Y − Y ′ X � − 1 X ′ X � − 1 X ′ Y Y ′ X � X ′ X � X ′ X � X ′ X = � − 1 X ′ Y − Y ′ X � − 1 X ′ Y Y ′ X � X ′ X � X ′ X = = 0 (14) James H. Steiger (Vanderbilt University) 17 / 30

Key Algebraic Results in Linear Regression James H. Steiger - PowerPoint PPT Presentation

Key Algebraic Results in Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 30 Key Algebraic Results in Linear Regression Introduction 1

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

CS70: Lecture 35. Regression (contd.): Linear and Beyond CS70: Lecture 35. Regression (contd.):

Chapter 7 Linear Regression 04/05/2016 Huamei Dong 1. Review Least square regression line 2.

Technical conditions for linear regression Jo Hardin Professor, Pomona College DataCamp

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan

Statistics! EDUC 7610 Chapter 3 The Multiple Regression Model ! " = $ % + $ ' ( '" $

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Welcome to the BOOST Collaborative! Please familiarize yourself with the control panel. The

Business Through a Crisis Brian J. Sharkey, Director-in-Charge, Business Advisory Robert. S.

Outline Introduction. Paper: System Design for Ultra-Low Power. Bernier, C. Hameau, F.,

High-dimensional consistency in score-based and hybrid structure learning Marloes Maathuis joint

Variational Inference of Sparse Network from Count Data Julien Chiquet, Mahendra Mariadasou, St

Prs Prsrt Pt

Key Algebraic Results in Linear Regression James H. Steiger - PowerPoint PPT Presentation

Key Algebraic Results in Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 30 Key Algebraic Results in Linear Regression Introduction 1

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

CS70: Lecture 35. Regression (contd.): Linear and Beyond CS70: Lecture 35. Regression (contd.):

Chapter 7 Linear Regression 04/05/2016 Huamei Dong 1. Review Least square regression line 2.

Technical conditions for linear regression Jo Hardin Professor, Pomona College DataCamp

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan

Statistics! EDUC 7610 Chapter 3 The Multiple Regression Model ! &quot; = $ % + $ ' ( '&quot; $

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Welcome to the BOOST Collaborative! Please familiarize yourself with the control panel. The

Business Through a Crisis Brian J. Sharkey, Director-in-Charge, Business Advisory Robert. S.

Outline Introduction. Paper: System Design for Ultra-Low Power. Bernier, C. Hameau, F.,

High-dimensional consistency in score-based and hybrid structure learning Marloes Maathuis joint

Variational Inference of Sparse Network from Count Data Julien Chiquet, Mahendra Mariadasou, St

Prs Prsrt Pt

Statistics! EDUC 7610 Chapter 3 The Multiple Regression Model ! " = $ % + $ ' ( '" $