Correlation and Regression 9-1 Overview 9-2 Correlation 9-3 - PowerPoint PPT Presentation

Chapter 9 Slide 1 Correlation and Regression 9-1 Overview 9-2 Correlation 9-3 Regression 9-4 Variation and Prediction Intervals 9-5 Multiple Regression 9-6 Modeling Chapter 9, Triola, Elementary Statistics , MATH 1342

Section 9-1 & 9-2 Overview and Correlation and Regression Created by Erin Hodgess, Houston, Texas Chapter 9, Triola, Elementary Statistics , MATH 1342

Overview Slide 3 Paired Data (p.506) � Is there a relationship? � If so, what is the equation? � Use that equation for prediction. Chapter 9, Triola, Elementary Statistics , MATH 1342

Definition Slide 4 � A correlation exists between two variables when one of them is related to the other in some way. Chapter 9, Triola, Elementary Statistics , MATH 1342

Definition Slide 5 � A Scatterplot (or scatter diagram) is a graph in which the paired ( x, y ) sample data are plotted with a horizontal x- axis and a vertical y- axis. Each individual ( x, y ) pair is plotted as a single point. Chapter 9, Triola, Elementary Statistics , MATH 1342

Scatter Diagram Slide 6 of Paired Data (p.507) Chapter 9, Triola, Elementary Statistics , MATH 1342

Positive Linear Slide 7 Correlation (p.498) Figure 9-2 Scatter Plots Chapter 9, Triola, Elementary Statistics , MATH 1342

Negative Linear Slide 8 Correlation Figure 9-2 Scatter Plots Chapter 9, Triola, Elementary Statistics , MATH 1342

No Linear Correlation Slide 9 Figure 9-2 Scatter Plots Chapter 9, Triola, Elementary Statistics , MATH 1342

Definition (p.509) Slide 10 The linear correlation coefficient r measures strength of the linear relationship between paired x and y values in a sample. Chapter 9, Triola, Elementary Statistics , MATH 1342

Assumptions (p.507) Slide 11 1. The sample of paired data ( x, y ) is a random sample. 2. The pairs of ( x, y ) data have a bivariate normal distribution. Chapter 9, Triola, Elementary Statistics , MATH 1342

Notation for the Linear Correlation Coefficient Slide 12 n = number of pairs of data presented Σ denotes the addition of the items indicated. Σ x denotes the sum of all x - values. Σ x 2 indicates that each x - value should be squared and then those squares added. ( Σ x ) 2 indicates that the x - values should be added and the total then squared. Σ xy indicates that each x -value should be first multiplied by its corresponding y - value. After obtaining all such products, find their sum. r represents linear correlation coefficient for a sample ρ represents linear correlation coefficient for a population Chapter 9, Triola, Elementary Statistics , MATH 1342

Definition Slide 13 The linear correlation coefficient r measures the strength of a linear relationship between the paired values in a sample. n Σ xy – ( Σ x )( Σ y ) r = n ( Σ x 2 ) – ( Σ x ) 2 n ( Σ y 2 ) – ( Σ y ) 2 Formula 9-1 Calculators can compute r ρ (rho) is the linear correlation coefficient for all paired data in the population. Chapter 9, Triola, Elementary Statistics , MATH 1342

Rounding the Linear Slide 14 Correlation Coefficient r � Round to three decimal places so that it can be compared to critical values in Table A-6. (see p.510) � Use calculator or computer if possible. Chapter 9, Triola, Elementary Statistics , MATH 1342

Calculating r Slide 15 Data x 1 1 3 5 2 8 6 4 y This data is from exercise #7 on p.521. Chapter 9, Triola, Elementary Statistics , MATH 1342

Chapter 9, Triola, Elementary Statistics , MATH 1342 Calculating r

Calculating r Slide 17 Data x 1 1 3 5 2 8 6 4 y n Σ xy – ( Σ x )( Σ y ) r = n ( Σ x 2 ) – ( Σ x ) 2 n ( Σ y 2 ) – ( Σ y ) 2 4( 48 ) – (10)(20) r = 4(36) – (10) 2 4(120) – (20) 2 –8 r = = – 0.135 59.329 Chapter 9, Triola, Elementary Statistics , MATH 1342

Interpreting the Linear Slide 18 Correlation Coefficient (p.511) � If the absolute value of r exceeds the value in Table A - 6, conclude that there is a significant linear correlation. � Otherwise, there is not sufficient evidence to support the conclusion of significant linear correlation. Chapter 9, Triola, Elementary Statistics , MATH 1342

Example: Slide 19 Boats and Manatees Given the sample data in Table 9-1, find the value of the linear correlation coefficient r , then refer to Table A-6 to determine whether there is a significant linear correlation between the number of registered boats and the number of manatees killed by boats. Using the same procedure previously illustrated, we find that r = 0.922. Referring to Table A-6, we locate the row for which n =10. Using the critical value for α =5, we have 0.632. Because r = 0.922, its absolute value exceeds 0.632, so we conclude that there is a significant linear correlation between number of registered boats and number of manatee deaths from boats. Chapter 9, Triola, Elementary Statistics , MATH 1342

Properties of the Slide 20 Linear Correlation Coefficient r 1. –1 ≤ r ≤ 1 (see also p.512) 2. Value of r does not change if all values of either variable are converted to a different scale. 3. The r is not affected by the choice of x and y . interchange x and y and the value of r will not change. 4. r measures strength of a linear relationship. Chapter 9, Triola, Elementary Statistics , MATH 1342

Interpreting r : Slide 21 Explained Variation The value of r 2 is the proportion of the variation in y that is explained by the linear relationship between x and y . (p.503 and p.533) Chapter 9, Triola, Elementary Statistics , MATH 1342

Example: Slide 22 Boats and Manatees Using the boat/manatee data in Table 9-1, we have found that the value of the linear correlation coefficient r = 0.922 . What proportion of the variation of the manatee deaths can be explained by the variation in the number of boat registrations? With r = 0.922, we get r 2 = 0.850. We conclude that 0.850 (or about 85%) of the variation in manatee deaths can be explained by the linear relationship between the number of boat registrations and the number of manatee deaths from boats. This implies that 15% of the variation of manatee deaths cannot be explained by the number of boat registrations. Chapter 9, Triola, Elementary Statistics , MATH 1342

Common Errors Slide 23 Involving Correlation (pp.503-504) 1. Causation: It is wrong to conclude that correlation implies causality. 2. Averages: Averages suppress individual variation and may inflate the correlation coefficient. 3. Linearity: There may be some relationship between x and y even when there is no significant linear correlation. Chapter 9, Triola, Elementary Statistics , MATH 1342

Common Errors Slide 24 Involving Correlation FIGURE 9-3 Scatterplot of Distance above Ground and Time for Object Thrown Upward Chapter 9, Triola, Elementary Statistics , MATH 1342

Formal Slide 25 Hypothesis Test (p.504) � We wish to determine whether there is a significant linear correlation between two variables. � We present two methods. � Both methods let H 0 : ρ = 0 (no significant linear correlation) H 1 : ρ ≠ 0 (significant linear correlation) Chapter 9, Triola, Elementary Statistics , MATH 1342

FIGURE 9-4 Slide 26 Testing for a Linear Correlation (p.505) Chapter 9, Triola, Elementary Statistics , MATH 1342

Method 1: Slide 27 Test Statistic is t (follows format of earlier chapters) Test statistic: r t = 1 – r 2 n – 2 Critical values: Use Table A-3 with degrees of freedom = n – 2 Chapter 9, Triola, Elementary Statistics , MATH 1342

Method 2: Slide 28 Test Statistic is r (uses fewer calculations) � Test statistic: r � Critical values: Refer to Table A-6 (no degrees of freedom) Chapter 9, Triola, Elementary Statistics , MATH 1342

Example: Slide 29 Boats and Manatees Using the boat/manatee data in Table 9-1, test the claim that there is a linear correlation between the number of registered boats and the number of manatee deaths from boats. Use Method 1. r t = 1 – r 2 n – 2 0.922 t = = 6.735 1 – 0.922 2 10 – 2 Chapter 9, Triola, Elementary Statistics , MATH 1342

Method 1: Slide 30 Test Statistic is t (follows format of earlier chapters) Figure 9-5 (p.516) Chapter 9, Triola, Elementary Statistics , MATH 1342

Example: Slide 31 Boats and Manatees Using the boat/manatee data in Table 9-1, test the claim that there is a linear correlation between the number of registered boats and the number of manatee deaths from boats. Use Method 2. The test statistic is r = 0.922. The critical values of r = ± 0.632 are found in Table A-6 with n = 10 and α = 0.05. Chapter 9, Triola, Elementary Statistics , MATH 1342

Method 2: Slide 32 Test Statistic is r (uses fewer calculations) � Test statistic: r � Critical values: Refer to Table A-6 (10 degrees of freedom) Figure 9-6 (p.507) Chapter 9, Triola, Elementary Statistics , MATH 1342

Correlation and Regression 9-1 Overview 9-2 Correlation 9-3 - PowerPoint PPT Presentation

Chapter 9 Slide 1 Correlation and Regression 9-1 Overview 9-2 Correlation 9-3 Regression 9-4 Variation and Prediction Intervals 9-5 Multiple Regression 9-6 Modeling Chapter 9, Triola, Elementary Statistics , MATH 1342 Slide 2 Section 9-1

Correlation Course Title Correlation Correlation coe ffi cient between -1 and 1 Sign

Getting to Regression: The Workhorse of Quantitative Political Analysis Department of

Visualization of Linear Models Correlation and Regression Possums > ggplot(data = possum,

Interpretation of regression coe ffi cients Correlation and Regression Is that textbook

Theory of correlation transfer and correlation structure in recurrent networks Ruben Moreno-Bote

Business Statistics CONTENTS The correlation coefficient The rank correlation coefficient

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Introduction to Regression and Correlation James H. Steiger Department of Psychology and Human

201ab Quantitative methods L.09: Correlation, regression (2) Alt-text: Correlation doesn't imply

Bivariate Correlation r > 0 r < 0 r = 0 r = 0 r > 0 r = 0 remember: r measures

Chapter 7 Linear Regression 04/05/2016 Huamei Dong 1. Review Least square regression line 2.

Biostatistics Correlation and linear regression Burkhardt Seifert & Alois Tschopp

Assessing model fit Correlation and Regression How well does our textbook model fit? >

Coefficient of Correlation The regression equation Y = 0 + 1 x + shows the linear

Planning and Optimization B2. Regression: Introduction & STRIPS Case Malte Helmert and

Introduction to Race Management (Updated to reflect The Racing Rules of Sailing 2009-2012 and

Ship Building: A workshop for the WIP curious Length 60 - 90 min Topics Systems Thinking,

on Leveled Networks Costas Busch Shailesh Kelkar Malik Magdon-Ismail Rensselaer Polytechnic

Control of the motion of a boat Lionel Rosier Universit e Henri Poincar e Nancy 1 Control

Example Instances 22 101 10/10/96 58 103 11/12/96 We will use these S1 sid sname

Relational Calculus Another Theoretical QL-Relational Calculus Comes in two flavors: Tuple

Marriage Migrations, moral demands and Natural Resources in Northwestern Benin

Governance Body Meeting Thursday, January 12, 2017 12:00 PM 1:00 PM EDT Audio:

Sambuz

Useful Links

Newsletter

Mail Us

Correlation and Regression 9-1 Overview 9-2 Correlation 9-3 - PowerPoint PPT Presentation

Chapter 9 Slide 1 Correlation and Regression 9-1 Overview 9-2 Correlation 9-3 Regression 9-4 Variation and Prediction Intervals 9-5 Multiple Regression 9-6 Modeling Chapter 9, Triola, Elementary Statistics , MATH 1342 Slide 2 Section 9-1

Correlation Course Title Correlation Correlation coe ffi cient between -1 and 1 Sign

Getting to Regression: The Workhorse of Quantitative Political Analysis Department of

Visualization of Linear Models Correlation and Regression Possums &gt; ggplot(data = possum,

Interpretation of regression coe ffi cients Correlation and Regression Is that textbook

Theory of correlation transfer and correlation structure in recurrent networks Ruben Moreno-Bote

Business Statistics CONTENTS The correlation coefficient The rank correlation coefficient

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Introduction to Regression and Correlation James H. Steiger Department of Psychology and Human

201ab Quantitative methods L.09: Correlation, regression (2) Alt-text: Correlation doesn't imply

Bivariate Correlation r &gt; 0 r &lt; 0 r = 0 r = 0 r &gt; 0 r = 0 remember: r measures

Chapter 7 Linear Regression 04/05/2016 Huamei Dong 1. Review Least square regression line 2.

Biostatistics Correlation and linear regression Burkhardt Seifert &amp; Alois Tschopp

Assessing model fit Correlation and Regression How well does our textbook model fit? &gt;

Coefficient of Correlation The regression equation Y = 0 + 1 x + shows the linear

Planning and Optimization B2. Regression: Introduction &amp; STRIPS Case Malte Helmert and

Introduction to Race Management (Updated to reflect The Racing Rules of Sailing 2009-2012 and

Ship Building: A workshop for the WIP curious Length 60 - 90 min Topics Systems Thinking,

on Leveled Networks Costas Busch Shailesh Kelkar Malik Magdon-Ismail Rensselaer Polytechnic

Control of the motion of a boat Lionel Rosier Universit e Henri Poincar e Nancy 1 Control

Example Instances 22 101 10/10/96 58 103 11/12/96 We will use these S1 sid sname

Relational Calculus Another Theoretical QL-Relational Calculus Comes in two flavors: Tuple

Marriage Migrations, moral demands and Natural Resources in Northwestern Benin

Governance Body Meeting Thursday, January 12, 2017 12:00 PM 1:00 PM EDT Audio:

Sambuz

Useful Links

Newsletter

Mail Us

Visualization of Linear Models Correlation and Regression Possums > ggplot(data = possum,

Bivariate Correlation r > 0 r < 0 r = 0 r = 0 r > 0 r = 0 remember: r measures

Biostatistics Correlation and linear regression Burkhardt Seifert & Alois Tschopp

Assessing model fit Correlation and Regression How well does our textbook model fit? >

Planning and Optimization B2. Regression: Introduction & STRIPS Case Malte Helmert and