Regression Modeling A Conceptual Introduction James H. Steiger - - PowerPoint PPT Presentation

regression modeling a conceptual introduction
SMART_READER_LITE
LIVE PREVIEW

Regression Modeling A Conceptual Introduction James H. Steiger - - PowerPoint PPT Presentation

Models as Representations of Reality Eliminating Systematic Model Error Summary Regression Modeling A Conceptual Introduction James H. Steiger Department of Psychology and Human Development Vanderbilt University Multilevel Regression


slide-1
SLIDE 1

Models as Representations of Reality Eliminating Systematic Model Error Summary

Regression Modeling — A Conceptual Introduction

James H. Steiger

Department of Psychology and Human Development Vanderbilt University

Multilevel Regression Modeling, 2009

Multilevel Conceptual Introduction

slide-2
SLIDE 2

Models as Representations of Reality Eliminating Systematic Model Error Summary

Regression Modeling — A Conceptual Introduction

1

Models as Representations of Reality The Fundamental Equation of Regression Modeling

2

Eliminating Systematic Model Error Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

3

Summary

Multilevel Conceptual Introduction

slide-3
SLIDE 3

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation At some time in the dim past, we were all exposed for the first time to simple linear regression and correlation analysis Because the equations surrounding these analyses were messy and very challenging, we may have missed the“big picture” The“big picture”is embodied in the following simple equation Data = Model + Error

Multilevel Conceptual Introduction

slide-4
SLIDE 4

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation At some time in the dim past, we were all exposed for the first time to simple linear regression and correlation analysis Because the equations surrounding these analyses were messy and very challenging, we may have missed the“big picture” The“big picture”is embodied in the following simple equation Data = Model + Error

Multilevel Conceptual Introduction

slide-5
SLIDE 5

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation At some time in the dim past, we were all exposed for the first time to simple linear regression and correlation analysis Because the equations surrounding these analyses were messy and very challenging, we may have missed the“big picture” The“big picture”is embodied in the following simple equation Data = Model + Error

Multilevel Conceptual Introduction

slide-6
SLIDE 6

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation At some time in the dim past, we were all exposed for the first time to simple linear regression and correlation analysis Because the equations surrounding these analyses were messy and very challenging, we may have missed the“big picture” The“big picture”is embodied in the following simple equation Data = Model + Error

Multilevel Conceptual Introduction

slide-7
SLIDE 7

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation For example, suppose we have data relating shoe size to standardized reading level for 100 boys, and our model is that there is a linear relationship between the two variables We do a standard linear regression, and the scatterplot looks like this

Multilevel Conceptual Introduction

slide-8
SLIDE 8

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation For example, suppose we have data relating shoe size to standardized reading level for 100 boys, and our model is that there is a linear relationship between the two variables We do a standard linear regression, and the scatterplot looks like this

Multilevel Conceptual Introduction

slide-9
SLIDE 9

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

The Fundamental Equation For example, suppose we have data relating shoe size to standardized reading level for 100 boys, and our model is that there is a linear relationship between the two variables We do a standard linear regression, and the scatterplot looks like this

Multilevel Conceptual Introduction

slide-10
SLIDE 10

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

  • 2

4 6 8 10 12 2 3 4 5 6 7 8 Shoe Size Reading Level

Multilevel Conceptual Introduction

slide-11
SLIDE 11

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error In the preceding slide, it appeared that, in fact, shoe size and reading level are linearly related in this sample of boys However, the data deviated from a straight line In this case, using standard linear regression, we“modeled”the error as independent and normally distributed around the regression line The model seems to have some validity Of course, that doesn’t mean that the model is“conveying the truth”about the relationship between shoe size and reading ability

Multilevel Conceptual Introduction

slide-12
SLIDE 12

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error In the preceding slide, it appeared that, in fact, shoe size and reading level are linearly related in this sample of boys However, the data deviated from a straight line In this case, using standard linear regression, we“modeled”the error as independent and normally distributed around the regression line The model seems to have some validity Of course, that doesn’t mean that the model is“conveying the truth”about the relationship between shoe size and reading ability

Multilevel Conceptual Introduction

slide-13
SLIDE 13

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error In the preceding slide, it appeared that, in fact, shoe size and reading level are linearly related in this sample of boys However, the data deviated from a straight line In this case, using standard linear regression, we“modeled”the error as independent and normally distributed around the regression line The model seems to have some validity Of course, that doesn’t mean that the model is“conveying the truth”about the relationship between shoe size and reading ability

Multilevel Conceptual Introduction

slide-14
SLIDE 14

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error In the preceding slide, it appeared that, in fact, shoe size and reading level are linearly related in this sample of boys However, the data deviated from a straight line In this case, using standard linear regression, we“modeled”the error as independent and normally distributed around the regression line The model seems to have some validity Of course, that doesn’t mean that the model is“conveying the truth”about the relationship between shoe size and reading ability

Multilevel Conceptual Introduction

slide-15
SLIDE 15

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error In the preceding slide, it appeared that, in fact, shoe size and reading level are linearly related in this sample of boys However, the data deviated from a straight line In this case, using standard linear regression, we“modeled”the error as independent and normally distributed around the regression line The model seems to have some validity Of course, that doesn’t mean that the model is“conveying the truth”about the relationship between shoe size and reading ability

Multilevel Conceptual Introduction

slide-16
SLIDE 16

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error In the preceding slide, it appeared that, in fact, shoe size and reading level are linearly related in this sample of boys However, the data deviated from a straight line In this case, using standard linear regression, we“modeled”the error as independent and normally distributed around the regression line The model seems to have some validity Of course, that doesn’t mean that the model is“conveying the truth”about the relationship between shoe size and reading ability

Multilevel Conceptual Introduction

slide-17
SLIDE 17

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-18
SLIDE 18

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-19
SLIDE 19

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-20
SLIDE 20

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-21
SLIDE 21

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-22
SLIDE 22

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-23
SLIDE 23

Models as Representations of Reality Eliminating Systematic Model Error Summary The Fundamental Equation of Regression Modeling

The Fundamental Equation of Regression Modeling

Systematic vs. Random Error

Systematic vs. Random Error Model error can be systematic or random Systematic error can result from several sources:

The model may have ignored important predictors The functional form of the model may be incorrect The data may have a hierarchical structure that the model has ignored

In general, we will find that when we eliminate systematic error, we gain in accuracy and statistical power

Multilevel Conceptual Introduction

slide-24
SLIDE 24

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Identify Missing Independent Variables

Identify Missing Independent Variables A model for the data generally involves selecting one or more dependent variables, then constructing a model function to explain the dependent variable data as a function of independent variables Often, we can improve a model by realizing that an important independent variable is missing from the model For example, suppose we had measured IQ scores for the 120 boys in the reading level example cited earlier We might wish to incorporate IQ into our model, and, in so doing, we might reduce the amount of error

Multilevel Conceptual Introduction

slide-25
SLIDE 25

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Identify Missing Independent Variables

Identify Missing Independent Variables A model for the data generally involves selecting one or more dependent variables, then constructing a model function to explain the dependent variable data as a function of independent variables Often, we can improve a model by realizing that an important independent variable is missing from the model For example, suppose we had measured IQ scores for the 120 boys in the reading level example cited earlier We might wish to incorporate IQ into our model, and, in so doing, we might reduce the amount of error

Multilevel Conceptual Introduction

slide-26
SLIDE 26

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Identify Missing Independent Variables

Identify Missing Independent Variables A model for the data generally involves selecting one or more dependent variables, then constructing a model function to explain the dependent variable data as a function of independent variables Often, we can improve a model by realizing that an important independent variable is missing from the model For example, suppose we had measured IQ scores for the 120 boys in the reading level example cited earlier We might wish to incorporate IQ into our model, and, in so doing, we might reduce the amount of error

Multilevel Conceptual Introduction

slide-27
SLIDE 27

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Identify Missing Independent Variables

Identify Missing Independent Variables A model for the data generally involves selecting one or more dependent variables, then constructing a model function to explain the dependent variable data as a function of independent variables Often, we can improve a model by realizing that an important independent variable is missing from the model For example, suppose we had measured IQ scores for the 120 boys in the reading level example cited earlier We might wish to incorporate IQ into our model, and, in so doing, we might reduce the amount of error

Multilevel Conceptual Introduction

slide-28
SLIDE 28

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Identify Missing Independent Variables

Identify Missing Independent Variables A model for the data generally involves selecting one or more dependent variables, then constructing a model function to explain the dependent variable data as a function of independent variables Often, we can improve a model by realizing that an important independent variable is missing from the model For example, suppose we had measured IQ scores for the 120 boys in the reading level example cited earlier We might wish to incorporate IQ into our model, and, in so doing, we might reduce the amount of error

Multilevel Conceptual Introduction

slide-29
SLIDE 29

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form Sometimes we have the“right”independent variables, but our functional form is suboptimal On occasion, we can fix things by“transforming”one or more

  • f the variables

So, for example, if y is not a linear function of x, but is a linear function of log(x), then we can simply log-transform x and fit a linear function to the transformed data

Multilevel Conceptual Introduction

slide-30
SLIDE 30

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form Sometimes we have the“right”independent variables, but our functional form is suboptimal On occasion, we can fix things by“transforming”one or more

  • f the variables

So, for example, if y is not a linear function of x, but is a linear function of log(x), then we can simply log-transform x and fit a linear function to the transformed data

Multilevel Conceptual Introduction

slide-31
SLIDE 31

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form Sometimes we have the“right”independent variables, but our functional form is suboptimal On occasion, we can fix things by“transforming”one or more

  • f the variables

So, for example, if y is not a linear function of x, but is a linear function of log(x), then we can simply log-transform x and fit a linear function to the transformed data

Multilevel Conceptual Introduction

slide-32
SLIDE 32

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form Sometimes we have the“right”independent variables, but our functional form is suboptimal On occasion, we can fix things by“transforming”one or more

  • f the variables

So, for example, if y is not a linear function of x, but is a linear function of log(x), then we can simply log-transform x and fit a linear function to the transformed data

Multilevel Conceptual Introduction

slide-33
SLIDE 33

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In other situations, we know in advance that the scale of the dependent variable makes a straightforward linear model suboptimal For example, suppose we are interested in constructing a model to predict the probability of being admitted to law school as a function of LSAT scores We know that probabilities range from zero to 1, and that in a variety of situations, the graph will become nonlinear near the edges of its range

Multilevel Conceptual Introduction

slide-34
SLIDE 34

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In other situations, we know in advance that the scale of the dependent variable makes a straightforward linear model suboptimal For example, suppose we are interested in constructing a model to predict the probability of being admitted to law school as a function of LSAT scores We know that probabilities range from zero to 1, and that in a variety of situations, the graph will become nonlinear near the edges of its range

Multilevel Conceptual Introduction

slide-35
SLIDE 35

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In other situations, we know in advance that the scale of the dependent variable makes a straightforward linear model suboptimal For example, suppose we are interested in constructing a model to predict the probability of being admitted to law school as a function of LSAT scores We know that probabilities range from zero to 1, and that in a variety of situations, the graph will become nonlinear near the edges of its range

Multilevel Conceptual Introduction

slide-36
SLIDE 36

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In other situations, we know in advance that the scale of the dependent variable makes a straightforward linear model suboptimal For example, suppose we are interested in constructing a model to predict the probability of being admitted to law school as a function of LSAT scores We know that probabilities range from zero to 1, and that in a variety of situations, the graph will become nonlinear near the edges of its range

Multilevel Conceptual Introduction

slide-37
SLIDE 37

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In this case, we can employ a“statistical trick”known as generalized linear modeling to change the dependent variable so that we can fit a straight line Two of the best known special cases are logistic regression and Poisson regression

Multilevel Conceptual Introduction

slide-38
SLIDE 38

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In this case, we can employ a“statistical trick”known as generalized linear modeling to change the dependent variable so that we can fit a straight line Two of the best known special cases are logistic regression and Poisson regression

Multilevel Conceptual Introduction

slide-39
SLIDE 39

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Changing the Functional Form In this case, we can employ a“statistical trick”known as generalized linear modeling to change the dependent variable so that we can fit a straight line Two of the best known special cases are logistic regression and Poisson regression

Multilevel Conceptual Introduction

slide-40
SLIDE 40

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Dangers of Overfitting

Dangers of Overfitting There is a significant danger when we modify a model repeatedly, each time rechecking the fit, to reduce error Almost always a more complicated variant of a model will fit better Adding additional independent variables, for example, almost always improves model fit Repeated rechecks increase the probability that you are customizing your model to conform to chance variation in the data We have to be careful to avoid misleading ourselves

Multilevel Conceptual Introduction

slide-41
SLIDE 41

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Dangers of Overfitting

Dangers of Overfitting There is a significant danger when we modify a model repeatedly, each time rechecking the fit, to reduce error Almost always a more complicated variant of a model will fit better Adding additional independent variables, for example, almost always improves model fit Repeated rechecks increase the probability that you are customizing your model to conform to chance variation in the data We have to be careful to avoid misleading ourselves

Multilevel Conceptual Introduction

slide-42
SLIDE 42

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Dangers of Overfitting

Dangers of Overfitting There is a significant danger when we modify a model repeatedly, each time rechecking the fit, to reduce error Almost always a more complicated variant of a model will fit better Adding additional independent variables, for example, almost always improves model fit Repeated rechecks increase the probability that you are customizing your model to conform to chance variation in the data We have to be careful to avoid misleading ourselves

Multilevel Conceptual Introduction

slide-43
SLIDE 43

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Dangers of Overfitting

Dangers of Overfitting There is a significant danger when we modify a model repeatedly, each time rechecking the fit, to reduce error Almost always a more complicated variant of a model will fit better Adding additional independent variables, for example, almost always improves model fit Repeated rechecks increase the probability that you are customizing your model to conform to chance variation in the data We have to be careful to avoid misleading ourselves

Multilevel Conceptual Introduction

slide-44
SLIDE 44

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Dangers of Overfitting

Dangers of Overfitting There is a significant danger when we modify a model repeatedly, each time rechecking the fit, to reduce error Almost always a more complicated variant of a model will fit better Adding additional independent variables, for example, almost always improves model fit Repeated rechecks increase the probability that you are customizing your model to conform to chance variation in the data We have to be careful to avoid misleading ourselves

Multilevel Conceptual Introduction

slide-45
SLIDE 45

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Changing the Functional Form

Dangers of Overfitting

Dangers of Overfitting There is a significant danger when we modify a model repeatedly, each time rechecking the fit, to reduce error Almost always a more complicated variant of a model will fit better Adding additional independent variables, for example, almost always improves model fit Repeated rechecks increase the probability that you are customizing your model to conform to chance variation in the data We have to be careful to avoid misleading ourselves

Multilevel Conceptual Introduction

slide-46
SLIDE 46

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure Many data sets, especially in education and the social sciences, are hierarchical in nature For example, children study within classrooms, which are situated within schools, which are in turn situated within school districts, and so on Each level of the hierarchy may require special modeling in

  • rder to properly capture the variation between children

Multilevel Conceptual Introduction

slide-47
SLIDE 47

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure Many data sets, especially in education and the social sciences, are hierarchical in nature For example, children study within classrooms, which are situated within schools, which are in turn situated within school districts, and so on Each level of the hierarchy may require special modeling in

  • rder to properly capture the variation between children

Multilevel Conceptual Introduction

slide-48
SLIDE 48

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure Many data sets, especially in education and the social sciences, are hierarchical in nature For example, children study within classrooms, which are situated within schools, which are in turn situated within school districts, and so on Each level of the hierarchy may require special modeling in

  • rder to properly capture the variation between children

Multilevel Conceptual Introduction

slide-49
SLIDE 49

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure Many data sets, especially in education and the social sciences, are hierarchical in nature For example, children study within classrooms, which are situated within schools, which are in turn situated within school districts, and so on Each level of the hierarchy may require special modeling in

  • rder to properly capture the variation between children

Multilevel Conceptual Introduction

slide-50
SLIDE 50

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure For example, suppose that, in our shoe size data, the 120 boys were taken from 4th, 5th, and 6th grades in the same school Our original model did not take into account this hierarchical structure It simply lumped all the boys together into one large group We should probably look inside the grade levels, fit a linear model to each grade’s data separately, and see whether the relationship between shoe size and reading level persists and/or changes across levels It turns out that, if we do that, we get a rather different picture of the relationship between shoe size and reading level

Multilevel Conceptual Introduction

slide-51
SLIDE 51

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure For example, suppose that, in our shoe size data, the 120 boys were taken from 4th, 5th, and 6th grades in the same school Our original model did not take into account this hierarchical structure It simply lumped all the boys together into one large group We should probably look inside the grade levels, fit a linear model to each grade’s data separately, and see whether the relationship between shoe size and reading level persists and/or changes across levels It turns out that, if we do that, we get a rather different picture of the relationship between shoe size and reading level

Multilevel Conceptual Introduction

slide-52
SLIDE 52

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure For example, suppose that, in our shoe size data, the 120 boys were taken from 4th, 5th, and 6th grades in the same school Our original model did not take into account this hierarchical structure It simply lumped all the boys together into one large group We should probably look inside the grade levels, fit a linear model to each grade’s data separately, and see whether the relationship between shoe size and reading level persists and/or changes across levels It turns out that, if we do that, we get a rather different picture of the relationship between shoe size and reading level

Multilevel Conceptual Introduction

slide-53
SLIDE 53

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure For example, suppose that, in our shoe size data, the 120 boys were taken from 4th, 5th, and 6th grades in the same school Our original model did not take into account this hierarchical structure It simply lumped all the boys together into one large group We should probably look inside the grade levels, fit a linear model to each grade’s data separately, and see whether the relationship between shoe size and reading level persists and/or changes across levels It turns out that, if we do that, we get a rather different picture of the relationship between shoe size and reading level

Multilevel Conceptual Introduction

slide-54
SLIDE 54

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure For example, suppose that, in our shoe size data, the 120 boys were taken from 4th, 5th, and 6th grades in the same school Our original model did not take into account this hierarchical structure It simply lumped all the boys together into one large group We should probably look inside the grade levels, fit a linear model to each grade’s data separately, and see whether the relationship between shoe size and reading level persists and/or changes across levels It turns out that, if we do that, we get a rather different picture of the relationship between shoe size and reading level

Multilevel Conceptual Introduction

slide-55
SLIDE 55

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Incorporating Hierarchical Structure For example, suppose that, in our shoe size data, the 120 boys were taken from 4th, 5th, and 6th grades in the same school Our original model did not take into account this hierarchical structure It simply lumped all the boys together into one large group We should probably look inside the grade levels, fit a linear model to each grade’s data separately, and see whether the relationship between shoe size and reading level persists and/or changes across levels It turns out that, if we do that, we get a rather different picture of the relationship between shoe size and reading level

Multilevel Conceptual Introduction

slide-56
SLIDE 56

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Analyzing by Grade

2 4 6 8 10 12 2 3 4 5 6 7 8 Shoe Size Reading Level

  • grade

4 5 6

Multilevel Conceptual Introduction

slide-57
SLIDE 57

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Analyzing by Grade

Analyzing by Grade We see that the overall linear fit, shown in black, is not the same as the fit within groups As grade level increases, we see a change in the mean level of shoe size and reading level We also see a change in the slope and intercept of the lines of fit within groups These lines are shown in red, blue, and green

Multilevel Conceptual Introduction

slide-58
SLIDE 58

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Analyzing by Grade

Analyzing by Grade We see that the overall linear fit, shown in black, is not the same as the fit within groups As grade level increases, we see a change in the mean level of shoe size and reading level We also see a change in the slope and intercept of the lines of fit within groups These lines are shown in red, blue, and green

Multilevel Conceptual Introduction

slide-59
SLIDE 59

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Analyzing by Grade

Analyzing by Grade We see that the overall linear fit, shown in black, is not the same as the fit within groups As grade level increases, we see a change in the mean level of shoe size and reading level We also see a change in the slope and intercept of the lines of fit within groups These lines are shown in red, blue, and green

Multilevel Conceptual Introduction

slide-60
SLIDE 60

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Analyzing by Grade

Analyzing by Grade We see that the overall linear fit, shown in black, is not the same as the fit within groups As grade level increases, we see a change in the mean level of shoe size and reading level We also see a change in the slope and intercept of the lines of fit within groups These lines are shown in red, blue, and green

Multilevel Conceptual Introduction

slide-61
SLIDE 61

Models as Representations of Reality Eliminating Systematic Model Error Summary Identify Missing Independent Variables Change the Functional Form Incorporate Hierarchical Structure

Incorporating Hierarchical Structure

Analyzing by Grade

Analyzing by Grade We see that the overall linear fit, shown in black, is not the same as the fit within groups As grade level increases, we see a change in the mean level of shoe size and reading level We also see a change in the slope and intercept of the lines of fit within groups These lines are shown in red, blue, and green

Multilevel Conceptual Introduction

slide-62
SLIDE 62

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Goals

Summary In setting up a regression model, we need to think carefully about how to Select variables relevant to our theoretical goals Choose an appropriate linear, nonlinear, or generalized linear model for our data Avoid overfitting Reduce random error noise Exploit and investigate hierarchical aspects of the data structure

Multilevel Conceptual Introduction

slide-63
SLIDE 63

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Goals

Summary In setting up a regression model, we need to think carefully about how to Select variables relevant to our theoretical goals Choose an appropriate linear, nonlinear, or generalized linear model for our data Avoid overfitting Reduce random error noise Exploit and investigate hierarchical aspects of the data structure

Multilevel Conceptual Introduction

slide-64
SLIDE 64

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Goals

Summary In setting up a regression model, we need to think carefully about how to Select variables relevant to our theoretical goals Choose an appropriate linear, nonlinear, or generalized linear model for our data Avoid overfitting Reduce random error noise Exploit and investigate hierarchical aspects of the data structure

Multilevel Conceptual Introduction

slide-65
SLIDE 65

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Goals

Summary In setting up a regression model, we need to think carefully about how to Select variables relevant to our theoretical goals Choose an appropriate linear, nonlinear, or generalized linear model for our data Avoid overfitting Reduce random error noise Exploit and investigate hierarchical aspects of the data structure

Multilevel Conceptual Introduction

slide-66
SLIDE 66

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Goals

Summary In setting up a regression model, we need to think carefully about how to Select variables relevant to our theoretical goals Choose an appropriate linear, nonlinear, or generalized linear model for our data Avoid overfitting Reduce random error noise Exploit and investigate hierarchical aspects of the data structure

Multilevel Conceptual Introduction

slide-67
SLIDE 67

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Goals

Summary In setting up a regression model, we need to think carefully about how to Select variables relevant to our theoretical goals Choose an appropriate linear, nonlinear, or generalized linear model for our data Avoid overfitting Reduce random error noise Exploit and investigate hierarchical aspects of the data structure

Multilevel Conceptual Introduction

slide-68
SLIDE 68

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Necessary Skills

Summary In achieving these goals, and to feel comfortable reading the textbook we need to develop some skills: Master or recall some key aspects of linear regression modeling Learn a tiny bit of matrix algebra (about one day’s worth) Learn the basic ideas behind generalized linear models Become familiar with R, WINBUGS, and (to a lesser extent) HLM Master enough technical details (things like when to center, when to standardize) to keep out of trouble

Multilevel Conceptual Introduction

slide-69
SLIDE 69

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Necessary Skills

Summary In achieving these goals, and to feel comfortable reading the textbook we need to develop some skills: Master or recall some key aspects of linear regression modeling Learn a tiny bit of matrix algebra (about one day’s worth) Learn the basic ideas behind generalized linear models Become familiar with R, WINBUGS, and (to a lesser extent) HLM Master enough technical details (things like when to center, when to standardize) to keep out of trouble

Multilevel Conceptual Introduction

slide-70
SLIDE 70

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Necessary Skills

Summary In achieving these goals, and to feel comfortable reading the textbook we need to develop some skills: Master or recall some key aspects of linear regression modeling Learn a tiny bit of matrix algebra (about one day’s worth) Learn the basic ideas behind generalized linear models Become familiar with R, WINBUGS, and (to a lesser extent) HLM Master enough technical details (things like when to center, when to standardize) to keep out of trouble

Multilevel Conceptual Introduction

slide-71
SLIDE 71

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Necessary Skills

Summary In achieving these goals, and to feel comfortable reading the textbook we need to develop some skills: Master or recall some key aspects of linear regression modeling Learn a tiny bit of matrix algebra (about one day’s worth) Learn the basic ideas behind generalized linear models Become familiar with R, WINBUGS, and (to a lesser extent) HLM Master enough technical details (things like when to center, when to standardize) to keep out of trouble

Multilevel Conceptual Introduction

slide-72
SLIDE 72

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Necessary Skills

Summary In achieving these goals, and to feel comfortable reading the textbook we need to develop some skills: Master or recall some key aspects of linear regression modeling Learn a tiny bit of matrix algebra (about one day’s worth) Learn the basic ideas behind generalized linear models Become familiar with R, WINBUGS, and (to a lesser extent) HLM Master enough technical details (things like when to center, when to standardize) to keep out of trouble

Multilevel Conceptual Introduction

slide-73
SLIDE 73

Models as Representations of Reality Eliminating Systematic Model Error Summary

Summary

Necessary Skills

Summary In achieving these goals, and to feel comfortable reading the textbook we need to develop some skills: Master or recall some key aspects of linear regression modeling Learn a tiny bit of matrix algebra (about one day’s worth) Learn the basic ideas behind generalized linear models Become familiar with R, WINBUGS, and (to a lesser extent) HLM Master enough technical details (things like when to center, when to standardize) to keep out of trouble

Multilevel Conceptual Introduction