Beta distributions on matrices and applications G erard Letac, - PDF document

Beta distributions on matrices and applications G´ erard Letac, Universit´ e Paul Sabatier, Toulouse, France. Conference at Banff, January 14th, 2008. 1

Hermitian matrices We take care simultaneously of the real case ( d = 1), the complex case ( d = 2) and the quaternionic case ( d = 4). It is convenient to sometimes denote by F 1 , F 2 and F 4 respecti- vely the real numbers, the complex numbers and the quaternions. We fix a positive inte- ger r. We denote by M r the real linear space of ( r, r ) matrices X = ( x ij ) 1 ≤ i,j ≤ r with elements in F d . The adjoint X ∗ = ( y ij ) 1 ≤ i,j ≤ r of X = ( x ij ) 1 ≤ i,j ≤ r is defined by y ij = x ji . The group K r is the group of u ∈ M r such that uu ∗ = u ∗ u = 1 . Thus K r is the ortho- gonal group for d = 1, the unitary group for d = 2 and the symplectic group for d = 4. Furthermore X ∈ M r is said to be Hermitian if X = X ∗ . Determinants of quaternionic Hermitian matrices and their eigenvalues need some care to be properly defined. 2

You people of random matrices, you who call ensemble ∗ what others call probability law on matrices remember that what you call β is the Peirce constant d of Jordan algebras. The number f = d/ 2 is the half Peirce constant. ∗ after Boltzmann 3

We denote by V r the real linear space of Her- mitian matrices (therefore real symmetric for d = 1 and quaternionic Hermitian for d = 4 . ) If X ∈ V r then X is said to be positive definite if for any z ∈ F r d written as a column then the number z ∗ Xz is a real number and is positive. We denote by Ω the cone of positive definite Hermitian matrices of order r. We denote by I r the identity matrix. We denote by Ω the cone of positive definite Hermitian matrices of order r . 4

Wishart If σ ∈ Ω and if p ∈ Λ = { f, 2 f, . . . , ( r − 1) f } ∪ (( r − 1) f, ∞ ) the Wishart distribution γ p,σ ( dx ) is the distribution on Ω whose Laplace transform is � Ω e − tr ( θx ) γ p,σ ( dx ) = det( I r + σ 1 / 2 θσ 1 / 2 ) − p Of course, for d = 1 , 2 the Laplace transform can be given the simpler form � Ω e − tr ( θx ) γ p,σ ( dx ) = det( I r + θσ ) − p . When p > ( r − 1) f then γ p,σ ( dx ) is : 1 Γ Ω ( p ) e − tr ( σ − 1 x ) (det x ) p − 1 − ( r − 1) f (det σ ) − p 1 Ω ( x ) dx, where r � Γ Ω ( p ) = C Γ( p − ( j − 1) f ) . (1) j =1 The numerical constant C does not depend on p but only on d and r . 5

If p = kf with k = 1 , 2 . . . , r − 1 then γ p,σ is concentrated on the elements of Ω with rank k and is a singular distribution. Λ is called the Gyndikin set, since Gyndikin has shown in 1975 that the above Laplace transform is not the Laplace transform of a positive measure if p / ∈ Λ but a Schwartz distribution. 6

Beta Proposition 1. If U and V are independent rv with respective distributions γ p,σ and γ q,σ where p and q are in Λ and are such that p + q > ( r − 1) f then 1. S = U + V is invertible. 2. Z = S − 1 / 2 US − 1 / 2 is independent of S 3. The distribution of Z does not depend on σ. 4. The distribution of Z is invariant by the action of K r defined by z �→ uzu ∗ . 5. If p > ( r − 1) f and q > ( r − 1) f then Z has a density concentrated on Ω ∩ ( I r − Ω) : C (det z ) p − 1 − ( r − 1) f (det( I r − z )) q − 1 − ( r − 1) f with C = Γ Ω ( p + q ) Γ Ω ( p ) The distribution of such a Z is called the beta distribution B p,q with parameters p, q. 7

Some linear algebra : We equip V r with the inner product � a, b � = tr ( ab ) . Thus V r be- comes Euclidean. Since V r is Euclidean, spea- king of symmetric linear operators acting on V r makes sense. By definition such a symmetric operator ϕ : V r �→ V r must satisfy for all a, b ∈ V r tr ( aϕ ( b )) = tr ( ϕ ( a ) b ) . Denote by S ( V r ) the linear space of these symmetric operators ϕ on V r . Here are two important examples of elements of S ( V r ) Example 1 : the operator P ( z ) . If z ∈ V r and a ∈ V r denote P ( z )( a ) = zaz. Thus for fixed z ∈ V r the map P ( z ) defined by a �→ zaz is linear. Furthermore it is symmetric since tr ( a P ( z )( b )) = tr ( P ( z )( a ) b ) or tr ( azbz ) = tr ( zazb ) by commutativity of traces. Example 2 : the operator z ⊗ z. If z ∈ V r and a ∈ V r define ( z ⊗ z )( a ) = z tr ( za ) . Thus a �→ ( z ⊗ z )( a ) is linear from V r to V r and it defines a symmetric operator on V r since tr (( z ⊗ z )( a ) b ) = tr ( za ) tr ( zb ) = tr ( a ( z ⊗ z )( b )) . 8

The magic operator Ψ. We want to avoid calculations (actually hidden in the proof of next proposition). The magic operator Ψ is a special linear map of S ( V r ) into itself which has the property that Ψ( z ⊗ z ) = P ( z ) for all z ∈ V r . Since this is for all z ∈ V r and since the elements of rank one z ⊗ z in S ( V r ) are actually numerous enough to generate S ( V r ) itself this is not surprising that Ψ( z ⊗ z ) = P ( z ) defines at most one Ψ . Proposition 2 There exists one and only one endomorphism Ψ of S ( V r ) such that for all z Ψ( z ⊗ z ) = P ( z ) Furthermore (recall that f = d/ 2) : Ψ( P ( z )) = fz ⊗ z + (1 − f ) P ( z ) . 9

It is not advisable to represent the operator Ψ by a matrix : dim S ( V r ) = ( r + d )( r + d +1) / 2. Suppose that d = r = 2 : then dim S ( V 2 ) = 10 and after having chosen a basis of S ( V 2 ) you have still to find the representative matrix of Ψ corresponding to this basis. A colleague became convinced of the usefulness of Ψ after he had completely written the hundred entries of such a matrix.. 10

The Olkin and Rubin Lemma. For simplicity, for u in K r denote k u ( z ) = uzu ∗ for all z ∈ V r . Since V r is Euclidean, when k is a linear operator on V r , it makes sense to define the adjoint operator k ∗ by tr ( ak ( b )) = tr ( k ∗ ( a ) b ) for all a and b in V r . In particular ( k u ) ∗ = k u ∗ . We denote by K the image of K r by u �→ k u . 11

The lemma looks for all the elements f of S ( V r ) such that for all k ∈ K one has f = kfk ∗ . An example of such an f is I r ⊗ I r : If k = k u let us compute f ( z ) and kfk ∗ ( z ) for all z ∈ V r . Thus f ( z ) = I r tr ( z ) and kfk ∗ ( z ) kf ( u ∗ zu ) = k ( I r tr ( u ∗ zu )) = k ( I r tr ( z )) = tr ( z ) k ( I r ) = tr ( z ) uI r u ∗ = tr ( z ) I r . = An other example of f such that f = kfk ∗ is simply the identity on V r : if f = id V r then f ( z ) = z and kfk ∗ ( z ) = uu ∗ zuu ∗ = z since uu ∗ = I r . The lemma says that these two examples are essentially the only ones. More specifically Lemma . Let f ∈ S ( V r ). Then f = kfk ∗ for all k ∈ K if and only if there exists two real numbers λ and µ such that f = λ id V r + µI r ⊗ I r . 12

First and second moments of Wishart Proposition 3. Let U ∼ γ p,σ where p ∈ Λ and σ ∈ Ω . Then E ( U ) = pσ and p 2 σ ⊗ σ + p P ( σ ) E ( U ⊗ U ) = pfσ ⊗ σ + ( p 2 + p (1 − f )) P ( σ ) E ( P ( U )) = which means p 2 tr ( σa ) tr ( σb ) + p tr ( aσbσ ) E ( tr ( Ua ) tr ( Ub )) = E ( tr ( aUbU )) = pf tr ( σa ) tr ( σb ) +( p 2 + p (1 − f )) tr ( aσbσ ) Proof of Proposition 3. The computations of E ( U ) and of E ( U ⊗ U ) are directly obtained from the Laplace transform. The computa- tion of E ( P ( U )) is an application of Proposi- tion 2 : apply Ψ-the-Magic to both sides of (9). We get by linearity p 2 Ψ( σ ⊗ σ ) + p Ψ( P ( σ )) E ( P ( U )) = E (Ψ( U ⊗ U )) = p 2 P ( σ ) + pfσ ⊗ σ = + p (1 − f ) P ( σ ) 13

First and second moments of beta Proposition 4. Let Z ∼ B p,q where p, q ∈ Λ p and p + q > ( r − 1) f. Then E ( Z ) = p + q I r and E ( Z ⊗ Z ) = λ 1 id V r + µ 1 I r ⊗ I r (2) E ( P ( Z )) = λ 2 id V r + µ 2 I r ⊗ I r (3) where p q = λ 1 p + q × ( p + q ) 2 + ( p + q )(1 − f ) − f p ( p + q + 1 − f ) − f p = µ 1 p + q × ( p + q ) 2 + ( p + q )(1 − f ) − f and λ 2 = (1 − f ) λ 1 + µ 1 and µ 2 = fλ 1 . (Note the simplicity of formulas in the complex case f = 1) . 14

Proof. We use Proposition 1 and write Z = S − 1 / 2 US − 1 / 2 with S = U + V and U ∼ γ p,σ V ∼ γ q,σ independent. Since the distribution of Z is invariant by K thus m = E ( Z ) is invariant by K , that means umu ∗ = m for all u ∈ K r : this implies that there exists a real number λ such that m = λI r (just diagona- lize m to verify this fact). For computing λ we write λI r = E ( Z | S ) since Z and S are independent. This implies by applying P ( S 1 / 2 ) to both sides λS = P ( S 1 / 2 )( E ( Z | S )) = E ( P ( S 1 / 2 )( Z ) | S ) = E ( U | S ) Now we take the expectation of both sides : λ E ( S ) = E ( U ) which leads from Proposition 3 to λ ( p + q ) σ = pσ (recall that S ∼ γ p + q,σ from p the Laplace transform). Finally E ( Z ) = p + q I r as desired. 15

For the second moments, we use the Olkin and Rubin lemma above. Since the distribution of Z is invariant by K , then f = E ( Z ⊗ Z ) must satisfy f = kfk ∗ for all k ∈ K and there exists two numbers λ and µ such that E ( Z ⊗ Z ) = λ id V r + µI r ⊗ I r . We translate this into E ( tr ( aZ ) tr ( bZ )) = λ tr ( ab ) + µ tr ( a ) tr ( b ) . 16

Beta distributions on matrices and applications G erard Letac, - PDF document

Beta distributions on matrices and applications G erard Letac, Universit e Paul Sabatier, Toulouse, France. Conference at Banff, January 14th, 2008. 1 Hermitian matrices We take care simultaneously of the real case ( d = 1), the complex

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Quasi logistic distributions and Gaussian scale mixing G erard Letac, Universit e Paul

Formal Modeling in Cognitive Science 1 Distributions Lecture 20: Joint, Marginal, and Conditional

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)

? ? ? ? Basic Charts Outline - Distributions & Histograms - Mean, Mode, Average - Chart

Beta Decay Beta Decay Microscopic picture Microscopic picture On a more fundamental level, beta

700 G ERARD A VENUE : EXISTING CONDITION PHOTOS 1- LOOKING SOUTH, AT THE EAST SIDE OF GERARD

JUST THE MATHS SLIDES NUMBER 9.9 MATRICES 9 (Modal & spectral matrices) by

Transformations and Matrices Transformations I Transformations are functions Matrices

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Structural Matrices in MDOF Systems Evaluation of Structural Matrices Choice of Property

Structural Matrices in MDOF Systems Structural Matrices Evaluation of Structural Giacomo Boffi

Lecture 5: Probability Distributions Random Variables Probability Distributions

Stat 5102 Lecture Slides: Deck 1 Empirical Distributions, Exact Sampling Distributions,

Urban Aboriginal Voices: A Landmark Study of Canadas Urban Aboriginal Peoples Winnipeg results

Click on Insert > new slide to choose your cover, then delete this page. Natural l Resources

Kingdom of Cambodia Nation Religion King The Training cause about Distinctness Uniformity

New insight into HBV evolution Jan Felix Drexler Institute of Virology, University of Bonn How

Carabao Group Strictly Private & Confidential May 2019 Corporate Presentation 1Q19 0

Conceptual / Background Questionnaire (Students Questionnaire) CAMBODIA OVERVIEWS Country

Title I Annual Meeting Shamrock Gardens Elementary 2017-2018 Wednesday, August 23rd 2017

Implementing the Community of Learning model for Urban Indian Health Kimberly T. Fowler, PhD

Beta distributions on matrices and applications G erard Letac, - PDF document

Beta distributions on matrices and applications G erard Letac, Universit e Paul Sabatier, Toulouse, France. Conference at Banff, January 14th, 2008. 1 Hermitian matrices We take care simultaneously of the real case ( d = 1), the complex

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Quasi logistic distributions and Gaussian scale mixing G erard Letac, Universit e Paul

Formal Modeling in Cognitive Science 1 Distributions Lecture 20: Joint, Marginal, and Conditional

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices &amp; quadratic forms)

? ? ? ? Basic Charts Outline - Distributions &amp; Histograms - Mean, Mode, Average - Chart

Beta Decay Beta Decay Microscopic picture Microscopic picture On a more fundamental level, beta

700 G ERARD A VENUE : EXISTING CONDITION PHOTOS 1- LOOKING SOUTH, AT THE EAST SIDE OF GERARD

JUST THE MATHS SLIDES NUMBER 9.9 MATRICES 9 (Modal &amp; spectral matrices) by

Transformations and Matrices Transformations I Transformations are functions Matrices

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Structural Matrices in MDOF Systems Evaluation of Structural Matrices Choice of Property

Structural Matrices in MDOF Systems Structural Matrices Evaluation of Structural Giacomo Boffi

Lecture 5: Probability Distributions Random Variables Probability Distributions

Stat 5102 Lecture Slides: Deck 1 Empirical Distributions, Exact Sampling Distributions,

Urban Aboriginal Voices: A Landmark Study of Canadas Urban Aboriginal Peoples Winnipeg results

Click on Insert &gt; new slide to choose your cover, then delete this page. Natural l Resources

Kingdom of Cambodia Nation Religion King The Training cause about Distinctness Uniformity

New insight into HBV evolution Jan Felix Drexler Institute of Virology, University of Bonn How

Carabao Group Strictly Private &amp; Confidential May 2019 Corporate Presentation 1Q19 0

Conceptual / Background Questionnaire (Students Questionnaire) CAMBODIA OVERVIEWS Country

Title I Annual Meeting Shamrock Gardens Elementary 2017-2018 Wednesday, August 23rd 2017

Implementing the Community of Learning model for Urban Indian Health Kimberly T. Fowler, PhD

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)

? ? ? ? Basic Charts Outline - Distributions & Histograms - Mean, Mode, Average - Chart

JUST THE MATHS SLIDES NUMBER 9.9 MATRICES 9 (Modal & spectral matrices) by

Click on Insert > new slide to choose your cover, then delete this page. Natural l Resources

Carabao Group Strictly Private & Confidential May 2019 Corporate Presentation 1Q19 0