the matrix f prior for estimating and testing covariance
play

The Matrix- F Prior for Estimating and Testing Covariance Matrices - PowerPoint PPT Presentation

The Matrix- F Prior for Estimating and Testing Covariance Matrices Joris Mulder & Luis R. Pericchi Department of Methodology & Statistics Tilburg University, the Netherlands CWI talk 2018, Amsterdam, 5-4-18 Mulder (Tilburg University)


  1. The Matrix- F Prior for Estimating and Testing Covariance Matrices Joris Mulder & Luis R. Pericchi Department of Methodology & Statistics Tilburg University, the Netherlands CWI talk 2018, Amsterdam, 5-4-18 Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 1 / 44

  2. Outline Problems with inverse gamma priors 1 Introducing the univariate F and matrix- F prior 2 The matrix- F prior in regularized regression 3 The matrix- F prior for testing covariance matrices 4 Testing a precise hypothesis Testing inequality constrained hypotheses The matrix- F prior for modeling random effects covariance matrices 5 Summary 6 Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 2 / 44

  3. Problems with inverse gamma priors Outline Problems with inverse gamma priors 1 Introducing the univariate F and matrix- F prior 2 The matrix- F prior in regularized regression 3 The matrix- F prior for testing covariance matrices 4 Testing a precise hypothesis Testing inequality constrained hypotheses The matrix- F prior for modeling random effects covariance matrices 5 Summary 6 Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 3 / 44

  4. Problems with inverse gamma priors Modeling variance components The inverse gamma prior is the default choice for modeling variance components, σ 2 ∼ IG ( α, β ) , with prior shape parameter α and prior scale parameter β . Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 4 / 44

  5. Problems with inverse gamma priors Modeling variance components The inverse gamma prior is the default choice for modeling variance components, σ 2 ∼ IG ( α, β ) , with prior shape parameter α and prior scale parameter β . The inverse gamma prior is conjugate for a variance of a normal population. Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 4 / 44

  6. Problems with inverse gamma priors Modeling variance components The inverse gamma prior is the default choice for modeling variance components, σ 2 ∼ IG ( α, β ) , with prior shape parameter α and prior scale parameter β . The inverse gamma prior is conjugate for a variance of a normal population. Default choice: α = β = ǫ > 0, with ǫ small, e.g., . 001. Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 4 / 44

  7. Problems with inverse gamma priors Modeling variance components The inverse gamma prior is the default choice for modeling variance components, σ 2 ∼ IG ( α, β ) , with prior shape parameter α and prior scale parameter β . The inverse gamma prior is conjugate for a variance of a normal population. Default choice: α = β = ǫ > 0, with ǫ small, e.g., . 001. The inverse gamma prior is a proper neighboring prior of the popular Jeffreys prior σ − 2 . Let p N ( σ 2 | x ) σ − 2 f ( x | σ 2 ) ∝ p ( σ 2 | x ) IG ( σ 2 ; ǫ, ǫ ) f ( x | σ 2 ) , ∝ then p ( σ 2 | x ) → p N ( σ 2 | x ) , as ǫ → 0 . Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 4 / 44

  8. Problems with inverse gamma priors Problems with the inverse gamma prior Surprisingly, the inverse gamma can unduly be highly informative as a prior for the random effects variance in a hierarchical model, N ( µ j , σ 2 ) i -th observation in group j : ∼ y ij N ( µ, τ 2 ) . random mean of group j : ∼ µ j Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 5 / 44

  9. Problems with inverse gamma priors Problems with the inverse gamma prior Surprisingly, the inverse gamma can unduly be highly informative as a prior for the random effects variance in a hierarchical model, N ( µ j , σ 2 ) i -th observation in group j : ∼ y ij N ( µ, τ 2 ) . random mean of group j : ∼ µ j The 8 schools example of Gelman (2006) showed the effect of the inverse gamma prior on τ 2 : 8 schools: posterior on τ given 8 schools: posterior on τ given 8 schools: posterior on τ given inv−gamma (1, 1) prior on τ 2 inv−gamma (.001, .001) prior on τ 2 uniform prior on τ 0 5 10 15 20 25 30 0 5 10 15 20 25 30 0 5 10 15 20 25 30 τ τ τ Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 5 / 44

  10. Introducing the univariate F and matrix- F prior Outline Problems with inverse gamma priors 1 Introducing the univariate F and matrix- F prior 2 The matrix- F prior in regularized regression 3 The matrix- F prior for testing covariance matrices 4 Testing a precise hypothesis Testing inequality constrained hypotheses The matrix- F prior for modeling random effects covariance matrices 5 Summary 6 Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 6 / 44

  11. Introducing the univariate F and matrix- F prior The F prior The issue of the inverse gamma prior can be resolved by mixing the scale parameter with a gamma distribution. This results in a univariate F prior: � F ( σ 2 ; ν, δ, b ) = IG ( σ 2 ; δ 2 , ψ 2 ) × G ( ψ 2 ; ν 2 , b − 1 ) d ψ 2 , with degrees of freedom parameters ν and δ , and scale parameter b . Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 7 / 44

  12. Introducing the univariate F and matrix- F prior The F prior The issue of the inverse gamma prior can be resolved by mixing the scale parameter with a gamma distribution. This results in a univariate F prior: � F ( σ 2 ; ν, δ, b ) = IG ( σ 2 ; δ 2 , ψ 2 ) × G ( ψ 2 ; ν 2 , b − 1 ) d ψ 2 , with degrees of freedom parameters ν and δ , and scale parameter b . Mixing a hyperparameter with another distribution is a way to robustify a prior. Example: The Student t prior is known to be more robust than a normal prior for regression analysis. The Student t prior is obtained by mixing the variance of a normal prior: � N ( β ; µ, σ 2 ) IG ( σ 2 ; ν 2 ) d σ 2 . t ( β ; µ, γ, ν ) = 2 , γ Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 7 / 44

  13. τ τ τ τ τ τ Introducing the univariate F and matrix- F prior The F prior Setting ν = 1, the standard deviation has a half- t distribution: � δ +1 2Γ( δ +1 ) � 1 + σ 2 2 2 p ( σ | ν = 1 , δ, b ) = . √ Γ( δ b 2 ) b π Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 8 / 44

  14. Introducing the univariate F and matrix- F prior The F prior Setting ν = 1, the standard deviation has a half- t distribution: � δ +1 2Γ( δ +1 ) � 1 + σ 2 2 2 p ( σ | ν = 1 , δ, b ) = . √ Γ( δ b 2 ) b π The F prior results in more desirable behavior than the inverse gamma prior for school data (Gelman, 2006). 3 schools: posterior on τ given 3 schools: posterior on τ given F(1,1,25)-prior on τ 2 uniform prior on τ 0 50 100 150 200 0 50 100 150 200 τ τ Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 8 / 44

  15. Introducing the univariate F and matrix- F prior The matrix- F prior In a multivariate setting, the inverse Wishart prior is the default choice for a k × k covariance matrix. Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 9 / 44

  16. Introducing the univariate F and matrix- F prior The matrix- F prior In a multivariate setting, the inverse Wishart prior is the default choice for a k × k covariance matrix. The inverse Wishart prior is a matrix generalization of the inverse gamma prior, and thus has similar issues. Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 9 / 44

  17. Introducing the univariate F and matrix- F prior The matrix- F prior In a multivariate setting, the inverse Wishart prior is the default choice for a k × k covariance matrix. The inverse Wishart prior is a matrix generalization of the inverse gamma prior, and thus has similar issues. We propose to robustify the inverse Wishart by mixing the scale matrix with a Wishart distribution : � F ( Σ ; ν, δ, S ) = IW ( Σ ; δ + k − 1 , Ψ ) × W ( Ψ ; ν, B ) d Ψ , where ν controls the behavior near the origin of | Σ | , δ controls the behavior in the tails of | Σ | , and B is a scale matrix. Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 9 / 44

  18. Introducing the univariate F and matrix- F prior The matrix- F prior In a multivariate setting, the inverse Wishart prior is the default choice for a k × k covariance matrix. The inverse Wishart prior is a matrix generalization of the inverse gamma prior, and thus has similar issues. We propose to robustify the inverse Wishart by mixing the scale matrix with a Wishart distribution : � F ( Σ ; ν, δ, S ) = IW ( Σ ; δ + k − 1 , Ψ ) × W ( Ψ ; ν, B ) d Ψ , where ν controls the behavior near the origin of | Σ | , δ controls the behavior in the tails of | Σ | , and B is a scale matrix. Setting S = I k yields the standard matrix- F distribution (Dawid, 1981). Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 9 / 44

  19. Introducing the univariate F and matrix- F prior Properties of the matrix- F distribution Reciprocity: Σ ∼ F ( ν, δ, S ) ⇒ Σ − 1 ∼ F ( δ + k − 1 , ν − k + 1 , S − 1 ) Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 10 / 44

  20. Introducing the univariate F and matrix- F prior Properties of the matrix- F distribution Reciprocity: Σ ∼ F ( ν, δ, S ) ⇒ Σ − 1 ∼ F ( δ + k − 1 , ν − k + 1 , S − 1 ) Invariant under marginalization: Σ ∼ F ( ν, δ, S ) ⇒ Σ 11 ∼ F ( ν, δ, S 11 ) Mulder (Tilburg University) The Matrix- F Prior CWI, Amsterdam 10 / 44

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend