Cochrans Theorem . Yang Feng . . . . . . . . . . . . . - PowerPoint PPT Presentation

. Cochran’s Theorem . Yang Feng . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 1 / 22

Importance of Cochran’s Theorem Cochran’s theorem tells us about the distributions of partitioned sums of squares of normally distributed random variables. Traditional linear regression analysis relies upon making statistical claims about the distribution of sums of squares of normally distributed random variables (and ratios between them) In the simple normal regression model: ∑ ( Y i − ˆ Y i ) 2 SSE ∼ χ 2 ( n − 2) = σ 2 σ 2 Where does this come from? . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 2 / 22

Outline Establish the fact that the multivariate Gaussian sum of squares is χ 2 ( n ) distributed Provide intuition for Cochran’s theorem Prove a lemma in support of Cochran’s theorem Prove Cochran’s theorem Connect Cochran’s theorem back to matrix linear regression . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 3 / 22

χ 2 distribution Theorem 1: Suppose Z i are i . i . d . N (0 , 1), we have n Z 2 i ∼ χ 2 ( n ) ∑ i =1 . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 4 / 22

Proof: Z 2 i ∼ χ 2 (1) If Y 1 , · · · , Y n are i.i.d. random variables with moment generating functions (MGF) m Y 1 ( t ) , · · · , m Y n ( t ). Then the moment generating function for U = Y 1 + · · · + Y n is m U ( t ) = m Y 1 ( t ) × m Y 2 ( t ) · · · × m Y n ( t ) MGF fully characterize the distribution The MGF for χ 2 ( n ) is (1 − 2 t ) n / 2 . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 5 / 22

Quadratic Forms and Cochran’s Theorem Quadratic forms of normal random variables are of great importance in many branches of statistics Least Squares ANOVA Regression Analysis General idea: Split the sum of the squares of observations into a number of quadratic forms where each corresponds to some cause of variation . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 6 / 22

Quadratic Forms and Cochrans Theorem The conclusion of Cochran’s theorem is that, under the assumption of normality, the various quadratic forms are independent and χ 2 distributed. This fact is the foundation upon which many statistical tests rest. . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 7 / 22

Preliminaries: A Common Quadratic Form Let X ∼ N ( µ , Λ ) Consider the quadratic form that appears in the exponent of the normal density ( X − µ ) ′ Λ − 1 ( X − µ ) In the special case of µ = 0 and Λ = I , this reduces to X ′ X which by what we just proved we know is χ 2 ( n ) distributed Let’s prove it holds in the general case . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 8 / 22

Lemma 1 Let X ∼ N ( µ , Λ ) with | Λ | > 0 and n is the dimension of X , then ( X − µ ) ′ Λ − 1 ( X − µ ) ∼ χ 2 ( n ) . Proof . Let Y = Λ − 1 / 2 ( X − µ ), then we have Y ∼ N ( 0 , I ). Then, ( X − µ ) ′ Λ − 1 ( X − µ ) = Y ′ Y ∼ χ 2 ( n ) . . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 9 / 22

Cochran’s Theorem Let X 1 , X 2 , · · · , X n be i.i.d. N (0 , σ 2 )- distributed random variables, and suppose that n X 2 ∑ i = Q 1 + Q 2 + · · · + Q k , i =1 where Q 1 , Q 2 , · · · , Q k are positive semi-definite quadratic forms in X 1 , X 2 , · · · , X n , i.e., Q i = X ′ A i X , i = 1 , 2 , · · · , k Set r i = rank( A i ). If r 1 + r 2 + · · · + r k = n , then . . 1 Q 1 , Q 2 , · · · , Q k are independent. . . 2 Q i ∼ σ 2 χ 2 ( r i ) . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 10 / 22

Several linear algebra results X be a normal random vector. The components of X are independent if and only if they are uncorrelated. Let X ∼ N ( µ , Λ ), then Y = C ′ X ∼ N ( C ′ µ , C ′ ΛC ). We can find an orthogonal matrix C such that D = C ′ ΛC is a diagonal matrix. (Eigen Value Decomposition for Semi Positive Definite Matrix) The components of Y will be independent and var( Y k ) = λ k , where λ 1 , · · · , λ n are the eigenvalues of Λ . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 11 / 22

Lemma 2 Let X 1 , X 2 , · · · , X n be real numbers. Suppose that ∑ X 2 i can be split into a sum of positive semi-definite quadratic forms, that is ∑ X 2 i = Q 1 + Q 2 + · · · + Q k where Q i = X ′ A i X with rank( A i ) = r i . If ∑ r i = n , then there exists an orthogonal matrix C such that, with X = CY , we have Y 2 1 + Y 2 2 + · · · + Y 2 = Q 1 r 1 Y 2 r 1 +1 + Y 2 r 1 +2 + · · · + Y 2 Q 2 = r 1 + r 2 . . . Y 2 n − r k +1 + Y 2 n − r k +2 + · · · + Y 2 = Q k n . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 12 / 22

Remark Different quadratic forms contain different Y -variables and that the number of terms in each Q i equals that rank, r i , of Q i The Y 2 i end up in different sums, we’ll use this to prove independence of the different quadratic forms. Just prove for n = 2 case, the general case can be obtained by induction. . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 13 / 22

Proof For n = 2, we have Q = X ′ A 1 X + X ′ A 2 X There exists an orthogonal matrix C such that C ′ A 1 C = D , where D is a diagonal matrix with eigenvalues of A 1 . Since rank ( A 1 ) = r 1 , r 1 eigenvalues are positive and n − r 1 eigenvalues are 0. Suppose without loss of generality, the first r 1 eigenvalues are positive. Set X = CY , then we have X ′ X = Y ′ C ′ CY = Y ′ Y . . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 14 / 22

Proof Therefore, Q = ∑ n i =1 Y 2 i = ∑ r 1 i =1 λ i Y 2 i + Y ′ C ′ A 2 CY Then, rearranging the terms we have r 1 n ∑ (1 − λ i ) Y 2 ∑ Y 2 i + i = Y ′ C ′ A 2 CY i =1 i = r 1 +1 Since rank ( A 2 ) = r 2 = n − r 1 , we conclude that λ 1 = λ 2 = · · · = λ r 1 = 1 r 1 n Y 2 Y 2 ∑ ∑ Q 1 = i , Q 2 = i i =1 i = r 1 +1 . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 15 / 22

From this Lemma This lemma is about real numbers, not random variables It says that ∑ X 2 i can be split into a sum of positive semi-definite quadratic forms, then there is a orthogonal transformation X = CY such that each of the quadratic forms have nice properties: Each Y i appears in only one resulting sum of squares, which leads to the independence of the sum of squares. . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . . .. . .. .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng (Columbia University) Cochran’s Theorem 16 / 22

Cochrans Theorem . Yang Feng . . . . . . . . . . . . . - PowerPoint PPT Presentation

. Cochrans Theorem . Yang Feng . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng

31. Stokes Theorem Stokes theorem is to Greens theorem, for the work done, as the

Arrows Impossibility Theorem Lecture 12 Arrows Impossibility Theorem Lecture 12, Slide 1

Ch04. Maximum Theorem, Implicit Function Theorem and Envelope Theorem Ping Yu Faculty of

Make Your Reports Look Great with the Versatile Proc Tabulate Ben Cochran The Bedford Group

Generalized Intermediate Value Theorem Intermediate Value Theorem Theorem Intermediate Value

Arrows Impossibility Theorem Lecture 12 Arrows Impossibility Theorem Lecture 12, Slide 1

PCP Theorem [PCP Theorem is] the most important result in complexity theory since Cooks

Green's Theorem is a special case of Stoke's 1 Some examples for Stoke's Theorem 2 3 4 5 6

The Replacement Theorem Theorem (Theorem 1.10) Let V be a vector space and suppose G and L are

29. The divergence theorem Theorem 29.1 (Divergence Theorem; Gauss, Ostrogradsky) . Let S be a

The Central Limit Theorem: More of the Story Steven Janke November 2015 Steven Janke (Seminar)

CS 401 Master Theorem / Closest Points Xiaorui Sun 1 Master Theorem Master Theorem % & +

Section 10 Cosets and the Theorem of Lagrange Instructor: Yifan Yang Fall 2006 Instructor:

Kleenes Theorem 4-0 Kleenes Theorem Theorem For every language L (over

The Bellows Theorem (Proof) Giovanni Viglietta JAIST July 5, 2018 The Bellows Theorem

Sum of Degrees of Vertices Theorem Theorem (Sum of Degrees of Vertices Theorem) Suppose a graph

Business Statistics CONTENTS Comparing two s Comparing more than two s Analysis of

Statistical Methods by Robert W. Lindeman WPI, Dept. of Computer Science gogo@wpi.edu

Introduction to Business Statistics QM 220 QM 220 Chapter 13 Dr. Mohammad Zainal Chapter 13:

QstatLab: software for statistical process control and robust engineering I.N.Vuchkov Iniversity

Introduction to Data Analysis in R Ed D. J. Berry 12th January 2017 Overview Frequentist

Software for Intro Stats: Is Excel an Option? Roger L. Berger Arizona State University August

QMC methods for stochastic programs: Contents ANOVA decomposition of integrands

Experimental design and applied statistical methods Autumn 2008 Part 2 1 2 One-Way ANOVA 3

Sambuz

Useful Links

Newsletter

Mail Us

Cochrans Theorem . Yang Feng . . . . . . . . . . . . . - PowerPoint PPT Presentation

. Cochrans Theorem . Yang Feng . . . . . . . . . . . . . . . . . . . . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . Yang Feng

31. Stokes Theorem Stokes theorem is to Greens theorem, for the work done, as the

Arrows Impossibility Theorem Lecture 12 Arrows Impossibility Theorem Lecture 12, Slide 1

Ch04. Maximum Theorem, Implicit Function Theorem and Envelope Theorem Ping Yu Faculty of

Make Your Reports Look Great with the Versatile Proc Tabulate Ben Cochran The Bedford Group

Generalized Intermediate Value Theorem Intermediate Value Theorem Theorem Intermediate Value

Arrows Impossibility Theorem Lecture 12 Arrows Impossibility Theorem Lecture 12, Slide 1

PCP Theorem [PCP Theorem is] the most important result in complexity theory since Cooks

Green's Theorem is a special case of Stoke's 1 Some examples for Stoke's Theorem 2 3 4 5 6

The Replacement Theorem Theorem (Theorem 1.10) Let V be a vector space and suppose G and L are

29. The divergence theorem Theorem 29.1 (Divergence Theorem; Gauss, Ostrogradsky) . Let S be a

The Central Limit Theorem: More of the Story Steven Janke November 2015 Steven Janke (Seminar)

CS 401 Master Theorem / Closest Points Xiaorui Sun 1 Master Theorem Master Theorem % &amp; +

Section 10 Cosets and the Theorem of Lagrange Instructor: Yifan Yang Fall 2006 Instructor:

Kleenes Theorem 4-0 Kleenes Theorem Theorem For every language L (over

The Bellows Theorem (Proof) Giovanni Viglietta JAIST July 5, 2018 The Bellows Theorem

Sum of Degrees of Vertices Theorem Theorem (Sum of Degrees of Vertices Theorem) Suppose a graph

Business Statistics CONTENTS Comparing two s Comparing more than two s Analysis of

Statistical Methods by Robert W. Lindeman WPI, Dept. of Computer Science gogo@wpi.edu

Introduction to Business Statistics QM 220 QM 220 Chapter 13 Dr. Mohammad Zainal Chapter 13:

QstatLab: software for statistical process control and robust engineering I.N.Vuchkov Iniversity

Introduction to Data Analysis in R Ed D. J. Berry 12th January 2017 Overview Frequentist

Software for Intro Stats: Is Excel an Option? Roger L. Berger Arizona State University August

QMC methods for stochastic programs: Contents ANOVA decomposition of integrands

Experimental design and applied statistical methods Autumn 2008 Part 2 1 2 One-Way ANOVA 3

Sambuz

Useful Links

Newsletter

Mail Us

CS 401 Master Theorem / Closest Points Xiaorui Sun 1 Master Theorem Master Theorem % & +