applied statistical analysis
play

Applied Statistical Analysis EDUC 6050 Week 9 Finding clarity - PowerPoint PPT Presentation

Applied Statistical Analysis EDUC 6050 Week 9 Finding clarity using data Today 1. Relationships! 2. Correlation and Intro to Regression 3. Chapter 13 in Book 2 Comparing Means Assessing Relationships Is there a relationship between the


  1. Applied Statistical Analysis EDUC 6050 Week 9 Finding clarity using data

  2. Today 1. Relationships! 2. Correlation and Intro to Regression 3. Chapter 13 in Book 2

  3. Comparing Means Assessing Relationships Is there a relationship between the Is one group different than the two variables? other(s)? - Correlation - Z-tests - Regression - T-tests We look at how We compare the - ANOVA much the means and use variables “move the variability together” to decide if the difference is significant 3

  4. Correlation • It is a whole class of methods • Generally used with observational designs • Has similar assumptions to t-test • Is a measure of effect size • Very related (and based on) z-scores • Tells us direction and strength of a relationship between two variables 4

  5. Correlation and Z-Scores • Z-score is a univariate statistic (only uses info from ONE variable) • Correlation is essentially the z-score between TWO variables 𝑠 = ∑𝑎 % 𝑎 & 𝑂 − 1 5

  6. Correlation and Z-Scores • Z-score is a univariate statistic (only uses info from ONE variable) • Correlation is essentially the z-score between TWO variables z-score of variable x 𝑠 = ∑𝑎 % 𝑎 & z-score of variable y 𝑂 − 1 6

  7. ID Var 1 Var 2 General 1 8 7 Requirements 2 6 2 3 9 6 1. Two or more 4 7 6 continuous variables, 5 7 8 2. Not necessarily 6 8 5 directional (one causes the other) 7 5 3 8 5 5 7

  8. General 9 Requirements 8 7 6 5 Var 2 1. Two or more 4 3 continuous variables, 2 2. Not necessarily 1 0 directional (one 4 5 6 7 8 9 10 Var 1 causes the other) 3. Linear Relationship (or at least ordinal) 8

  9. Hypothesis Testing with Correlation The same 6 step approach! 1. Examine Variables to Assess Statistical Assumptions 2. State the Null and Research Hypotheses (symbolically and verbally) 3. Define Critical Regions 4. Compute the Test Statistic 5. Compute an Effect Size and Describe it 6. Interpreting the results 9

  10. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic 10

  11. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables Individuals are independent of for the analysis each other (one person’s scores 3. Normality of distributions does not affect another’s) 4. Homoscedastic 11

  12. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic Here we need interval/ratio variables 12

  13. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions Multivariate normality (the two 1. Independence of data variables are jointly normal) 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic

  14. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data Variance around the line should 2. Appropriate measurement of variables be roughly equal across the for the analysis whole line 3. Normality of distributions 4. Homoscedastic 14

  15. 1 Examine Variables to Assess Statistical Assumptions Examining the Basic Assumptions 1. Independence: random sample 2. Appropriate measurement: know what your variables are 3. Normality: Histograms, Q-Q, skew and kurtosis 4. Homoscedastic: Scatterplots 15

  16. 2 State the Null and Research Hypotheses (symbolically and verbally) Hypothesis Symbolic Verbal Difference between Type means created by: Research There is a relationship True relationship 𝜍 ≠ 0 Hypothesis between the variables Null There is no real Random chance 𝜍 = 0 Hypothesis relationship between (sampling error) the variables. 16

  17. 3 Define Critical Regions How much evidence is enough to believe the null is not true? generally based on an alpha = .05 17

  18. 4 Compute the Test Statistic Click on “Correlation Matrix” 18

  19. 4 Compute the Test Statistic Results Bring variables to be correlated over here 19

  20. 4 Compute the Test Statistic Average of X Y Average of Y X 20

  21. 4 Compute the Test Statistic Average of X If more points are in the green than not, then correlation is positive Y Average of Y X 21

  22. 4 Compute the Test Statistic Average of X Y Average of Y If more points are in the red than not, then correlation is negative X 22

  23. 5 Compute an Effect Size and Describe it One of the main effect sizes for correlation is r 2 𝒔 𝟑 = 𝒔 𝟑 𝒔 𝟑 Estimated Size of the Effect Close to .01 Small Close to .09 Moderate Close to .25 Large 23

  24. 6 Interpreting the results Put your results into words Use the example around page 529 as a template 24

  25. Intro to Regression 25

  26. Intro to Regression The foundation of almost everything we do in statistics Comparing group means Assess relationships Compare means AND assess relationships at the same time Can handle many types of outcome and predictor data types Results are interpretable 26

  27. Two Main Types of Regression Simple Multiple • • Only one predictor in More than one variable in the model the model • • When variables are When variables are standardized, gives same standardized, is close to results as correlation “partial” correlation • • When using a grouping Predictors can be any variable, same results combination of categorical as t-test or ANOVA and continuous 27

  28. Logic of Regression We are trying to find the best fitting line Y X 28

  29. Logic of Regression We are trying to find the best fitting line We do this by Y minimizing the difference between the points and the line (called the residuals) X 29

  30. Logic of Regression Average of X Line always goes through the averages of X and Y Y Average of Y X 30

  31. Questions? Please post them to the discussion board before class starts End of Pre-Recorded Lecture Slides 31

  32. In-class discussion slides 32

  33. https://www.youtube.com/watc h?v=sxYrzzy3cq8

  34. How Correlation Works Average of X Y Average of Y X 34

  35. How Regression Works We are trying to find the best fitting line We do this by Y minimizing the difference between the points and the line (called the residuals) X 35

  36. Application Example Using The Office/Parks and Rec Data Set Hypothesis Test with Correlation 36

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend