Why Mixed Effects Models? Mixed Effects Models Recap/Intro Three - - PowerPoint PPT Presentation

why mixed effects models mixed effects models recap intro
SMART_READER_LITE
LIVE PREVIEW

Why Mixed Effects Models? Mixed Effects Models Recap/Intro Three - - PowerPoint PPT Presentation

Why Mixed Effects Models? Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative


slide-1
SLIDE 1

Why Mixed Effects Models?

slide-2
SLIDE 2

Mixed Effects Models Recap/Intro

  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focus on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting
slide-3
SLIDE 3

Problem One: Multiple Random Effects Problem One: Multiple Random Effects

  • Most studies sample

both subjects and items Subject 1 Subject 1 Subject 2 Subject 2 Knight Knight story story Monkey Monkey story story

slide-4
SLIDE 4

Problem One: Crossed Random Effects Problem One: Crossed Random Effects

  • Most studies sample

both subjects and items

– Typically, subjects

crossed with items

  • Each subject sees a

version of each item

– May also be only

partially crossed

  • Each subject sees only

some of the items

slide-5
SLIDE 5

...or Hierarchical Random Effects ...or Hierarchical Random Effects

  • Most studies sample

both subjects and items

– Typically, subjects

crossed with items

– May also have one

nested within the

  • ther (hierarchical)
  • e.g. autobiographical

memory

  • How to incorporate

this into model?

slide-6
SLIDE 6

Problem One: Multiple Random Effects Problem One: Multiple Random Effects

  • Why do we care about items, anyway?
  • #1: Investigate robustness of effects across items

– Concern is that effect could be driven by just 1 or 2

items – might not really be what we thought it was

– Psycholinguistics: View is that we studying language

too, not just people

  • Other areas of psychology have not tended to care about this

– Note: Including items in a model doesn't really “confirm” that the

effect is robust across items. It's still possible to get a reliable effect driven by a small number of items. But it allows you investigate how variable the effect is across items and why different items might be differentially influenced.

slide-7
SLIDE 7

Problem One: Multiple Random Effects Problem One: Multiple Random Effects

  • Why do we care about items?
  • #2: Violations of independence

– A BIG ISSUE – Suppose Amélie and Zhenghan see

items A & B but Tuan sees items C & D

– Likely that Amélie's results are more like

Zhenghan's than like Tuan's

– But ANOVA assumes observations

independent

– Even a small amount of dependency

can lead to spurious results (Quene & van

den Bergh, 2008)

  • Dependency you didn't account for makes the variance

look smaller than it actually is

A A B B C C D D

slide-8
SLIDE 8

What Constitutes an “Item”? What Constitutes an “Item”?

  • Items assumed to be independently sampled

sampled from population of relevant items

  • 2 related words / sentences not

independently sampled

– “The coach knew you missed practice.” – “The coach knew that you missed practice.” – Not a coincidence both are in your

experiment!

  • Should be considered the same

item

  • But 2 unrelated things can be

different items

ALL POSSIBLE DISCOURSES

slide-9
SLIDE 9
  • ANOVA solution

– Subjects analysis:

Average over multiple items for each subject

– Items analysis:

Average over multiple subjects for each item

  • Two sets of results

– Sometime combined

with min F'

– An approximation of

true min F

F1 = 18.31, p < .001 F2 = 22.10, p < .0001

Problem One: Crossed Random Effects Problem One: Crossed Random Effects

Note: not real data or statistical tests

slide-10
SLIDE 10
  • Some debate on how

accurate min F' is

– Scott will admit to not be fully

read up on this since I came in after people started switching to mixed effects models

  • Somewhat less

relevant now that we can use mixed effects models instead

F1 = 18.31, p < .001 F2 = 22.10, p < .0001

Problem One: Crossed Random Effects Problem One: Crossed Random Effects

Note: not real data or statistical tests

slide-11
SLIDE 11

Mixed Effects Models Recap/Intro

  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focus on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting
slide-12
SLIDE 12

Problem Two: Categorical Data

  • ANOVA assumes our response is continuous
  • But, we often want to look at categorical data

'Lightning hit the church.” vs. “The church was hit by lightning.”

RT: 833 ms

Choice of syntactic structure Item recalled

  • r not

Region fixated in eye-tracking experiment

slide-13
SLIDE 13

Problem One: Categorical Data

  • Traditional solution:

Analyze proportions

  • Violates assumptions of

ANOVA

– Among other issues: ANOVA

assumes normal distribution, which has infinite tails

– But proportions are clearly

bounded

– Model could predict

impossible values like 110%

Problem Two: Categorical Data

But proportions 1

slide-14
SLIDE 14

Problem One: Categorical Data

  • Traditional solution:

Analyze proportions

  • Violates assumptions of

ANOVA

– Among other issues: ANOVA

assumes normal distribution, which has infinite tails

– But proportions are clearly

bounded

– Model could predict

impossible values like 110%

Problem Two: Categorical Data

But proportions 1

slide-15
SLIDE 15

Problem One: Categorical Data

  • Traditional solution:

Analyze proportions

  • Violates assumptions
  • f ANOVA
  • Can lead to:

– Spurious effects (Type

I error)

– Missing a true effects

(Type II error)

Problem Two: Categorical Data

slide-16
SLIDE 16

Problem One: Categorical Data

  • Transformations improve the situation but

don't solve it

– Empirical logit is good (Jaeger, 2008) – Arcsine less so

  • Situation is worse for very high or very low

proportions (Jaeger, 2008)

– .30 to .70 are OK

Problem Two: Categorical Data

slide-17
SLIDE 17

Problem One: Categorical Data

  • Why can't we just use logistic regression?

– Predict if each trial's response is in category A or

category B

  • This is essentially what we will end up doing
  • But, if we are looking at things at a trial-by-

trial basis...

– Need to control for the different items on each trial – Problem One again!

Problem Two: Categorical Data

slide-18
SLIDE 18

Mixed Effects Models Recap/Intro

  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focus on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting
slide-19
SLIDE 19

Problem Three: Focus on Fixed Effects Problem Three: Focus on Fixed Effects

  • ANOVA doesn't characterize differences

between subjects or items

  • The bird that they spotted was a ....
  • We just have a mean effect
  • No info. about how much it varies

across participants or items

Predictable 283 ms Unpredictable 309 ms

cardinal cardinal pitohui pitohui

26 ms

MEAN READING TIME ENDING

slide-20
SLIDE 20

Problem Three: Focus on Fixed Effects Problem Three: Focus on Fixed Effects

  • Can try to account for some of this with an

ANCOVA

– But not typically done – And would have to be done separately for

participants and items (Problem One again)

Predictable 283 ms Unpredictable 309 ms 26 ms

MEAN

slide-21
SLIDE 21
  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focused on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting

Mixed Effects Models Recap/Intro

Power of subjects analysis! Power of items analysis!

Captain MLM to the rescue!

slide-22
SLIDE 22

Mixed Effects Models to the Rescue!

  • ANOVA: Unit of analysis is cell mean
  • MLM: Unit of analysis is individual trial!
slide-23
SLIDE 23

Mixed Models to the Rescue!

  • Look at individual trials
  • Model outcome using regression

=

Item Item

+ +

RT RT Prime? Prime? Subject Subject Semantic categorization: Is it a dinosaur? Problem One solved! Problem One solved!

slide-24
SLIDE 24

Mixed Models to the Rescue!

  • This means you will need your data formatted

differently than you would for an ANOVA

– Each trial gets its own line

slide-25
SLIDE 25

Mixed Models to the Rescue!

  • Is this useful for what we care about?

– Stereotypical view of regression is that it's about

predicting values

– In experimental settings we more typically want to

know if Variable X matters

  • Yes! We can test individual effects: Do they

contribute to the model?

– e.g. does priming predict something about RT?

=

Item Item

+ +

RT RT Prime? Prime? Jason Jason Subject Subject

slide-26
SLIDE 26
  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focus on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting

Mixed Effects Models Recap/Intro

slide-27
SLIDE 27

Fixed vs. Random Slopes

  • Fixed Slope: Same for all participants/items
  • Random Slope: Can vary by participants/items

=

+ +

RT RT Prime? Prime?

+

26 ms 88 ms

Laurel Laurel Stego. Stego.

slide-28
SLIDE 28

Fixed vs. Random Slopes

  • Fixed Slope: Same for all participants/items
  • Random Slope: Can vary by participants/items

=

+ +

RT RT Prime? Prime? Laurel Laurel

+

26 ms 315 ms

  • Dr. L
  • Dr. L

Example: Some items may show a larger priming effect than others

slide-29
SLIDE 29

Fixed vs. Random Slopes

  • Fixed Slope: Same for all participants/items
  • Random Slope: Can vary by participants/items
  • Can also test what explains variation

=

+ +

RT RT Prime? Prime? Laurel Laurel

+

26 ms 15 ms

  • Dr. L
  • Dr. L

+

Lex.Freq. Lex.Freq.

300 ms

e.g. Adding lexical frequency to the model may account for variation in priming effect

slide-30
SLIDE 30

Fixed vs. Random Slopes

  • Fixed Slope: Same for all participants/items
  • Random Slope: Can vary by participants/items
  • Can also test what explains variation

=

+ +

RT RT Prime? Prime? Laurel Laurel

+

26 ms 15 ms

  • Dr. L
  • Dr. L

+

Lex.Freq. Lex.Freq.

300 ms Problem Three Problem Three Solved! Solved!

slide-31
SLIDE 31
  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focus on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting

Mixed Effects Models Recap/Intro

slide-32
SLIDE 32

Link Functions

  • Specifies how to connect predictors to

the outcome

  • Every model has one....
  • ...sometimes, just the identity function

– With Gaussian (normal) data

+ + + + + + + +

RT RT Item Item Prime? Prime? Subject Subject 1300 ms

slide-33
SLIDE 33

Link Functions

  • Specifies how to connect predictors to

the outcome

  • For binomial (yes/no) outcomes: Model

log odds to predict outcome

+ + + + + + + +

Item Item Prime? Prime? Subject Subject Yes/No

Problem Two solved!

Accuracy Accuracy

slide-34
SLIDE 34

Link Functions

  • Default link function for binomial data is logit

(log odds)

– Odds: p(yes)/p(no) or p(yes)/[1-p(yes)]

  • No upper bound, but lower bound at 0

– Log Odds: ln(Odds)

  • Now unbounded at both ends
  • Can also use probit

– Based on cumulative distribution function of normal

distribution

– Very highly correlated with logit; almost always give

you same results as logit

  • Probit assumes slightly fewer hits at low end of distribution

& slightly more hits at high end

slide-35
SLIDE 35
  • Three issues with ANOVA

– Multiple random effects – Categorical data – Focus on fixed effects

  • What mixed effects models do

– Random slopes – Link functions

  • Iterative fitting

Mixed Effects Models Recap/Intro

slide-36
SLIDE 36

One Caveat...

Where do model results come from?

(Answer: When a design matrix and a data matrix really love each other...)

slide-37
SLIDE 37

One Caveat...

  • Fitting ANOVA / linear

regression has easy solution

  • A few matrix multiplications a computer can

do easily

– A “closed form solution”

  • Like a “beta machine” … you put your data in

and automatically get the One True Model

  • ut

b = (X'X)-1X'Y

slide-38
SLIDE 38

One Caveat...

  • MEMs requires iteration

– Check various sets of

betas until you find the best one

– R does this for you

  • An estimation

– Not mathematically guaranteed to be best fit

  • Complicated models take longer to fit

– If too many parameters relative to data, might completely fail

to converge (find the best set of betas)

– Scott's only experience with this is with multiple random

slopes of interactions

The best model: The one that smiles with its eyes