1
Review of Linear Regression
STAT E-150 Statistical Methods
2
Regression analysis is used to investigate whether there is a linear relationship between two quantitative variables. The variable we want to predict is the response variable; the variable we use for this prediction is the explanatory variable.
3
If a linear relationship exists, we can create a model for the relationship, and use this model to answer these questions:
What is the relationship between the variables? What does the slope of this linear model tell us? When is it appropriate to use this linear model to make predictions?
4
A First-Order Linear Model is of the form y = 0 + 1x + where y = the response variable x = the independent, or predictor, or explanatory variable = the random error 0 = where the regression line crosses the y-axis; the y-intercept of the regression line is the point (0, 0 ) 1 = the slope of the regression line change in y change in x change in y for every unit increase in x = =
5 6
Steps in regression
- 1. Hypothesize the form of the model for E(y), the mean or expected
value of y
- 2. Collect the sample data
- 3. Use the sample data to estimate the unknown parameters in the
model.
- 4. Specify the probability distribution of and estimate any unknown
parameters in the distribution. Check the validity of the assumptions made about the probability distribution.
- 5. Statistically check the usefulness of the model
- 6. If the model is useful, use the model for appropriate prediction and