Applied Statistical Analysis EDUC 6050 Week 9 Finding clarity - - PowerPoint PPT Presentation

applied statistical analysis
SMART_READER_LITE
LIVE PREVIEW

Applied Statistical Analysis EDUC 6050 Week 9 Finding clarity - - PowerPoint PPT Presentation

Applied Statistical Analysis EDUC 6050 Week 9 Finding clarity using data Today 1. Relationships! 2. Correlation and Intro to Regression 3. Chapter 13 in Book 2 Comparing Means Assessing Relationships Is there a relationship between the


slide-1
SLIDE 1

Applied Statistical Analysis

EDUC 6050 Week 9

Finding clarity using data

slide-2
SLIDE 2

Today

2
  • 1. Relationships!
  • 2. Correlation and Intro to Regression
  • 3. Chapter 13 in Book
slide-3
SLIDE 3 3

Comparing Means

Is one group different than the

  • ther(s)?
  • Z-tests
  • T-tests
  • ANOVA

We compare the means and use the variability to decide if the difference is significant

Assessing Relationships

Is there a relationship between the two variables?

  • Correlation
  • Regression

We look at how much the variables “move together”

slide-4
SLIDE 4

Correlation

4
  • It is a whole class of methods
  • Generally used with observational designs
  • Has similar assumptions to t-test
  • Is a measure of effect size
  • Very related (and based on) z-scores
  • Tells us direction and strength of a

relationship between two variables

slide-5
SLIDE 5

Correlation and Z-Scores

5
  • Z-score is a univariate statistic (only

uses info from ONE variable)

  • Correlation is essentially the z-score

between TWO variables

𝑠 = ∑𝑎%𝑎& 𝑂 − 1

slide-6
SLIDE 6

Correlation and Z-Scores

6
  • Z-score is a univariate statistic (only

uses info from ONE variable)

  • Correlation is essentially the z-score

between TWO variables

𝑠 = ∑𝑎%𝑎& 𝑂 − 1

z-score of variable x z-score of variable y

slide-7
SLIDE 7 7
  • 1. Two or more

continuous variables,

  • 2. Not necessarily

directional (one causes the other)

General Requirements

ID Var 1 Var 2 1 8 7 2 6 2 3 9 6 4 7 6 5 7 8 6 8 5 7 5 3 8 5 5

slide-8
SLIDE 8 8
  • 1. Two or more

continuous variables,

  • 2. Not necessarily

directional (one causes the other)

  • 3. Linear Relationship

(or at least ordinal)

General Requirements

1 2 3 4 5 6 7 8 9 4 5 6 7 8 9 10 Var 2 Var 1
slide-9
SLIDE 9

Hypothesis Testing with Correlation

9
  • 1. Examine Variables to Assess Statistical

Assumptions

  • 2. State the Null and Research Hypotheses

(symbolically and verbally)

  • 3. Define Critical Regions
  • 4. Compute the Test Statistic
  • 5. Compute an Effect Size and Describe it
  • 6. Interpreting the results

The same 6 step approach!

slide-10
SLIDE 10

Examine Variables to Assess Statistical Assumptions

10

1

Basic Assumptions

  • 1. Independence of data
  • 2. Appropriate measurement of variables

for the analysis

  • 3. Normality of distributions
  • 4. Homoscedastic
slide-11
SLIDE 11

Examine Variables to Assess Statistical Assumptions

11

1

Basic Assumptions

  • 1. Independence of data
  • 2. Appropriate measurement of variables

for the analysis

  • 3. Normality of distributions
  • 4. Homoscedastic

Individuals are independent of each other (one person’s scores does not affect another’s)

slide-12
SLIDE 12

Examine Variables to Assess Statistical Assumptions

12

1

Basic Assumptions

  • 1. Independence of data
  • 2. Appropriate measurement of variables

for the analysis

  • 3. Normality of distributions
  • 4. Homoscedastic

Here we need interval/ratio variables

slide-13
SLIDE 13

Examine Variables to Assess Statistical Assumptions

1

Basic Assumptions

  • 1. Independence of data
  • 2. Appropriate measurement of variables

for the analysis

  • 3. Normality of distributions
  • 4. Homoscedastic

Multivariate normality (the two variables are jointly normal)

slide-14
SLIDE 14

Examine Variables to Assess Statistical Assumptions

14

1

Basic Assumptions

  • 1. Independence of data
  • 2. Appropriate measurement of variables

for the analysis

  • 3. Normality of distributions
  • 4. Homoscedastic

Variance around the line should be roughly equal across the whole line

slide-15
SLIDE 15

Examine Variables to Assess Statistical Assumptions

15

1

Examining the Basic Assumptions

  • 1. Independence: random sample
  • 2. Appropriate measurement: know what your

variables are

  • 3. Normality: Histograms, Q-Q, skew and

kurtosis

  • 4. Homoscedastic: Scatterplots
slide-16
SLIDE 16

State the Null and Research Hypotheses (symbolically and verbally)

16

2

Hypothesis Type Symbolic Verbal Difference between means created by: Research Hypothesis 𝜍 ≠ 0 There is a relationship between the variables True relationship Null Hypothesis 𝜍 = 0 There is no real relationship between the variables. Random chance (sampling error)

slide-17
SLIDE 17

Define Critical Regions

17

How much evidence is enough to believe the null is not true?

3

generally based on an alpha = .05

slide-18
SLIDE 18

Compute the Test Statistic

18

4

Click on “Correlation Matrix”

slide-19
SLIDE 19

Compute the Test Statistic

19

4

Bring variables to be correlated over here Results

slide-20
SLIDE 20

Compute the Test Statistic

20

4

Average of Y Y X Average of X

slide-21
SLIDE 21

Compute the Test Statistic

21

4

Average of Y Y X Average of X If more points are in the green than not, then correlation is positive

slide-22
SLIDE 22

Compute the Test Statistic

22

4

Average of Y Y X Average of X If more points are in the red than not, then correlation is negative

slide-23
SLIDE 23

Compute an Effect Size and Describe it

23

One of the main effect sizes for correlation is r2

5

𝒔𝟑 = 𝒔 𝟑

𝒔𝟑 Estimated Size of the Effect Close to .01 Small Close to .09 Moderate Close to .25 Large

slide-24
SLIDE 24

Interpreting the results

24

Put your results into words

6

Use the example around page 529 as a template

slide-25
SLIDE 25

Intro to Regression

25
slide-26
SLIDE 26

Intro to Regression

26

The foundation of almost everything we do in statistics

Comparing group means Assess relationships Compare means AND assess relationships at the same time

Can handle many types of outcome and predictor data types Results are interpretable

slide-27
SLIDE 27

Two Main Types of Regression

27

Simple Multiple

  • Only one predictor in

the model

  • When variables are

standardized, gives same results as correlation

  • When using a grouping

variable, same results as t-test or ANOVA

  • More than one variable in

the model

  • When variables are

standardized, is close to “partial” correlation

  • Predictors can be any

combination of categorical and continuous

slide-28
SLIDE 28

Logic of Regression

28

Y X

We are trying to find the best fitting line

slide-29
SLIDE 29

Logic of Regression

29

Y X

We are trying to find the best fitting line We do this by minimizing the difference between the points and the line (called the residuals)

slide-30
SLIDE 30

Logic of Regression

30

Average of Y Y X Average of X

Line always goes through the averages

  • f X and Y
slide-31
SLIDE 31

Questions?

Please post them to the discussion board before class starts

31

End of Pre-Recorded Lecture Slides

slide-32
SLIDE 32

In-class discussion slides

32
slide-33
SLIDE 33

https://www.youtube.com/watc h?v=sxYrzzy3cq8

slide-34
SLIDE 34 34

Average of Y Y X Average of X

How Correlation Works

slide-35
SLIDE 35

How Regression Works

35

Y X

We are trying to find the best fitting line We do this by minimizing the difference between the points and the line (called the residuals)

slide-36
SLIDE 36

Application

36

Example Using The Office/Parks and Rec Data Set Hypothesis Test with Correlation