a clt for information theoretic statistics of gram random
play

A CLT for Information-Theoretic Statistics of Gram Random Matrices - PowerPoint PPT Presentation

A CLT for Information-Theoretic Statistics of Gram Random Matrices Malika Kharouf Joint work with W.Hachem, J.Najim and J.Silverstein October 12, 2010 Workshop Large Random Matrices and their applications - October 11-13, 2010. The Model: A


  1. A CLT for Information-Theoretic Statistics of Gram Random Matrices Malika Kharouf Joint work with W.Hachem, J.Najim and J.Silverstein October 12, 2010 Workshop Large Random Matrices and their applications - October 11-13, 2010.

  2. The Model: A Non-Centered Random Matrices Consider a p × n random matrices: 1 √ nX n + A n , Σ n = where, ◮ X n ij , 1 ≤ i ≤ p , 1 ≤ j ≤ n are i.i.d. centered with unit variance and E | X 11 | 16 < ∞ . ◮ A n is a p × n deterministic matrix with uniformly bounded spectral norm.

  3. The Model: Information-Theoretic Statistics of Gram random matrices Linear spectral statistics: p I n ( ρ ) = 1 � � λ ( n ) � log + ρ , i p i =1 where, λ ( n ) , i = 1 , . . . , p are the eigenvalues of the Gram random i matrix Σ n Σ ∗ n and ρ is a nonnegative parameter. Objective: Understanding the asymptotic distribution of the fluctuations of I n ( ρ ), when the dimensions of the matrix Σ n converge to infinity at the same pace and obtain a simple form of the variance.

  4. Plan Motivations: Mutual Information for Multiple Antenna Radio Channels Asymptotic behavior of I n ( ρ ): First-order results Fundamental system of equations Deterministic equivalents Study of the fluctuations Definition of the variance The Central Limit Theorem Outline of the proof of the CLT The approach: REFORM method Main steps of the proof The bias Outline of the proof of the bias term

  5. Motivations: Mutual Information for Multiple Antenna Radio Channels

  6. Multi-user MIMO scheme Figure: MIMO Systems

  7. MIMO System: Mathematical Model The p -dimensional receiver vector r n is given by: r n = Σ n t n + b n , where, ◮ Σ n represents the channel matrix which assumed to be random. ◮ t n is the n -dimensional transmitter vector. ◮ b n is an additive white Gaussian noise with covariance matrix E b n b ∗ n = ρ I p . Performance indicator: The Mutual Information: p I n ( ρ ) = 1 n + ρ I p ) = 1 � p log det (Σ n Σ ∗ log ( λ i + ρ ) p i =1 Asymptotic behavior of I n ( ρ ) when n , p → ∞ at the same rate ?

  8. Asymptotic behavior of I n ( ρ ) : First-order results

  9. First-order results Let f n denotes the ST of µ Σ n Σ ∗ n , the spectral measure of the eigenvalues of Σ n Σ ∗ n . Then, � ∞ I n ( ρ ) = − f n ( − ω ) d ω. ρ Then the asymptotic behavior of I n ( ρ ) is closely linked to the asymptotic behavior of f n as p , n → ∞ with the same pace.

  10. State of the art n → H , H is a deterministic probability measure. ◮ F A n A ∗ Dozier and Silverstein (04): n weakly F Σ n Σ ∗ − − − − − → F , where, F is a deterministic probability measure which the Stieltjes transform is a unique solution of a given coupled equation.

  11. State of the art n → H , H is a deterministic probability measure. ◮ F A n A ∗ Dozier and Silverstein (04): n weakly F Σ n Σ ∗ − − − − − → F , where, F is a deterministic probability measure which the Stieltjes transform is a unique solution of a given coupled equation. ◮ V. L. Girko (91), Hachem-Loubaton-Najim (07) : Look for a deterministic approximation of the Stieltjes transform f n of F Σ n Σ ∗ n . ∃ a p × p deterministic valued function T n ( ρ ) such that: f n ( − ρ ) − 1 a . s p Tr T n ( − ρ ) − − − → n →∞ 0

  12. Fundamental equations Theorem (Girko ’91, Hachem-Loubaton-Najim ’07) The following system of two equations � − 1 △  A n A ∗ δ n ( ρ ) = 1 � = 1 � � 1 + ˜ n δ n ( ρ ) I p + n Tr T n ( ρ )  n Tr ρ   1 + δ n ( ρ )  � − 1 △ A ∗ δ n ( ρ ) = 1 � = 1 n A n ˜ n Tr ˜  n Tr ρ (1 + δ n ( ρ )) I n + T n ( ρ ) ,   1 + ˜  δ n ( ρ ) admits a unique solution ( δ n , ˜ δ n ) in S ( R + ) 2 . Moreover, � � a . s R + f ( λ ) dF Σ n Σ ∗ ∀ f ∈ C B ( R + ) , n ( λ ) − R + f ( λ ) π n ( d λ ) − n →∞ 0 , − − → where π n is the positive measure where δ n is the Stieltjes transform.

  13. First order result: Deterministic equivalents Theorem (Hachem-Loubaton-Najim ’07) � Let V n ( ρ ) = R + log( λ + ρ ) π n ( d λ ) . Then we have: E I n ( ρ ) − V n ( ρ ) − − − − − − − − − − → n → c > 0 0 . n , p →∞ , p Moreover, V n ( ρ ) admits a closed-form expression p � µ 2 � 1 � � n , i � 1 + ˜ V n ( ρ ) = log ρ δ n + 1 + δ n p i =1 + n p log (1 + δ n ) − ρ n p δ n ˜ δ n , where µ n , i are the singular values of the mean matrix A n .

  14. In the non-centered case, the first-order asymptotic study of the mutual information depends mainly on the limiting behavior of the singular values of the mean matrix A n .

  15. Study of the fluctuations

  16. CLT for p ( I n ( ρ ) − V n ( ρ )) In order to study the CLT for p ( I n ( ρ ) − V n ( ρ )) we study separately two quantities: ◮ The random quantity p ( I n ( ρ ) − E I n ( ρ )) from which the fluctuations arise and, ◮ The deterministic quantity p ( E I n ( ρ ) − V n ( ρ )) which yields a bias.

  17. Asymptotic distribution of the fluctuations: Definition of the variance Theorem (Hachem-Kharouf-Najim-Silverstein ’10) 11 , κ = E | X 11 | 4 − 2 − ϑ 2 and let Let ϑ = E X 2 T ¯ n Tr T 2 , ˜ T 2 , γ = 1 γ = 1 γ = 1 n Tr ˜ n Tr T ¯ γ = 1 n Tr ˜ ˜ T , ˜ T . Denote by 2     1 Θ 2 � Tr TAA ∗ T − ρ 2 γ ˜ = − log  1 − γ n    � 1 + ˜ n δ 2 � �   � � 1 � Tr ¯ T ¯ � AA ∗ T � − | ϑ | 2 ρ 2 γ ˜ − log 1 − ϑ γ  � �  � 1 + ˜ � � n δ � � + κρ 2 � � t 2 t 2 ˜ ii jj n 2 i j Then Θ 2 n is well defined.

  18. Some remarks ◮ The variance is the sum of tree terms: the first term would be the same in the Gaussian case. ◮ The variance depends on the singular values of the main matrix as well as on its singular vectors. = X ij e i α for all α ), the second term D ◮ In the circular case ( X ij disappears.

  19. Asymptotic distribution of the fluctuations: The CLT Theorem (Hachem-Kharouf-Najim-Silverstein ’10) The following convergence holds true: p D ( I n ( ρ ) − E I n ( ρ )) − p , n →∞ N (0 , 1) , − − − → Θ n where D stands for convergence in distribution.

  20. Proof of the CLT: The approach REFORM ( RE solvent FOR mula and M artingale). ◮ I n ( ρ ) − E I n ( ρ ) as a sum of increments of martingale. ◮ Identification of the variance.

  21. CLT for martingales Theorem Let Γ ( n ) 1 , . . . , Γ ( n ) be a sequence of increments of martingale with n respect to a given filtration F ( n ) 1 , . . . , F ( n ) n . Assume that there exists a sequence of nonnegative real numbers (Θ 2 n ) n uniformly bounded away from zero and from infinity. Assume that: ◮ n � Γ ( n ) 2 � |F ( n ) P � − Θ 2 − n →∞ 0 . − − → E n j j − 1 j =1 ◮ The Lyapunov’s condition n 1 | 2+ α − � E | Γ ( n ) ∃ α > 0 , n →∞ 0 , − − → holds . j Θ 2(1+ α ) n j =1 j =1 Γ ( n ) � n Then Θ − 1 converges in distribution to N (0 , 1) . n j

  22. Sum of martingale differences We have, n n △ � � I n − E I n = ( E j − E j − 1 ) ( − log(1 + ξ j )) = Γ j , j =1 j =1 where, � � 1 η ∗ n Tr Q j + a ∗ j Q j η j − j Q j a j ξ j = . 1 + 1 n Tr Q j + a ∗ j Q j a j with η j , a j are resp. the jth columns of matrices Σ n and A n , Q j is the resolvent of the matrix Σ j Σ ∗ j and E j stands for the conditional expectation with respect to the σ -algebra F ( n ) = σ ( x 1 , . . . , x j ). j

  23. Sum of the conditional variances Some properties of the function log, n n E j − 1 (( E j − E j − 1 ) log(1 + ξ j )) 2 − � � E j − 1 ( E j ξ j ) 2 P − p , n →∞ 0 − − − → j =1 j =1 where (recall) � � 1 η ∗ n Tr Q j + a ∗ j Q j η j − j Q j a j ξ j = . 1 + 1 n Tr Q j + a ∗ j Q j a j

  24. Study of the sum of conditional variances Standard calculations remain the problem to the study of the asymptotic behavior of the quantities: 1 n Tr ( E j Q n ) 2 j ( E j Q n ) 2 a j , a ∗ and where Q n is the resolvent of Σ n Σ ∗ n matrix.

  25. Outline of the proof A good comprehension of the asymptotic behavior of these terms requires a specific study of bilinear forms of type u ∗ n Q ( ρ ) v n where at least u n or v n is a given column of the deterministic mean matrix A n . If u n and v n are deterministics, Hachem-Loubaton-Najim-Vallet (preprint’10) u ∗ n Q ( ρ ) v n ≈ u ∗ n T ( ρ ) v n

  26. Asymptotic behavior of the bias: Theorem (Hachem-Kharouf-Najim-Silverstein ’10) We have, p ( E I n ( ρ ) − V n ( ρ )) − B n ( ρ ) − p , n →∞ 0 − − − → where, B n ( ρ ) = κ Cte ( ρ, δ, ˜ δ ) κ = E | X 11 | 4 − 2 − ϑ 2 .

  27. Outline of the proof of the bias term The bias term is given by p ( E I n ( ρ ) − V n ( ρ )) χ n ( ρ ) = � ∞ d d ω E log det (Σ n Σ ∗ = n + ω I p ) d ω p ρ � ∞ �� � d − p R + log ( λ + ω ) π n ( d λ ) d ω d ω ρ � ∞ = Tr ( E Q n ( ω ) − T n ( ω )) d ω. ρ Then it remains to study the asymptotic behavior of Tr ( E Q n ( ω ) − T n ( ω )). We prove, Tr ( E Q n ( ω ) − T n ( ω )) − κ Cte ( ρ, δ, ˜ δ ) − n →∞ 0 − − →

  28. Case of a non-centered separable random matrix model

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend