EigenvaluesofLvy CovariationMatrices Random matrix models for - - PowerPoint PPT Presentation

eigenvaluesofl vy covariationmatrices
SMART_READER_LITE
LIVE PREVIEW

EigenvaluesofLvy CovariationMatrices Random matrix models for - - PowerPoint PPT Presentation

EigenvaluesofLvy CovariationMatrices Random matrix models for datasets with fixed time horizons Gregory Zitelli SIAM-ALA18, Hong Kong May, 2018 Laloux et al. (1999) Which is real? Which is fake? Contents 1. Motivation Markowitz portfolio


slide-1
SLIDE 1

EigenvaluesofLévy CovariationMatrices

Random matrix models for datasets with fixed time horizons

Gregory Zitelli

SIAM-ALA18, Hong Kong May, 2018

slide-2
SLIDE 2

Laloux et al. (1999)

slide-3
SLIDE 3

Which is real? Which is fake?

slide-4
SLIDE 4

Contents

  • 1. Motivation

Markowitz portfolio theory, Bai’s Theorem

  • 2. M–P Law

Universality, eigenvalue outliers

  • 3. Sample Lévy Covariance Ensemble (SLCE)

Lévy processes, GGC/EGGC processes

  • 4. Limiting Spectral Distributions

Stieltjes transform algorithm, approximate densities

slide-5
SLIDE 5

Motivation

slide-6
SLIDE 6

Markowitz Portfolio Theory

1/29

p risky assets with return vector r = [r1 . . . rp]. µ µ µ = E[r] is the fixed 1 × p vector of expected returns Σ Σ Σ is the symmetric p × p covariance matrix. w is the 1 × p vector of weights on each asset, w1† ≤ 1.

For fixed 0 < σ2 < ∞, consider the optimization maximize wµ µ µ† (expected return on portfolio) subject to w ∈ R1×p, w1† ≤ 1, wΣ Σ Σw† ≤ σ2 (acceptable volatility) Let R(σ2,µ µ µ,Σ Σ Σ) denote the optimal return µ = wµ µ µ†.

slide-7
SLIDE 7

Markowitz Portfolio Theory

2/29

maximize wµ µ µ† = R(σ2,µ µ µ,Σ Σ Σ) subject to w ∈ R1×p, w1† ≤ 1, wΣ Σ Σw† ≤ σ2

R(σ2,µ µ µ,Σ Σ Σ) and the optimal w have closed-form expressions in terms of σ2, µ µ µ, and Σ Σ Σ. In practice, µ µ µ and Σ Σ Σ are unknown. How accurate is the portfolio construction when µ µ µ and Σ Σ Σ are estimated by independent sampling?

slide-8
SLIDE 8

Estimating the Efgicient Frontier

3/29

slide-9
SLIDE 9

Markowitz Portfolio Theory

4/29

Rectangular N × p data matrix. Let λ = p/N ∈ (0, 1), N samples, p assets. Bouchaud and Potters, mid 2000s, out-of-sample risk underestimated by √ 1 − λ, unproven. 2006, Bai Zhidong, Wong Wing-Keung, A Note on the Mean-Variance Analysis of Self-Financing Portfolios (unpublished), full proof published in 2009. Overestimation of returns, underestimation of risk. Relies on Marčenko–Pastur law.

slide-10
SLIDE 10

Bai’s Theorem

5/29

p assets, N observations, p/N → λ(0, 1). Σ Σ ΣN sequence of p × p covariance matrices µ µ µN sequence of 1 × p expected return vectors. Model: XN = 1 1 1†µ µ µN + YNΣ Σ Σ1/2

N .

Under mild assumptions (finite fourth moment), lim

N→∞

R(σ2, µ µ µN, Σ Σ ΣN) R(σ2,µ µ µN,Σ Σ ΣN) = √ 1 1 − λ

Proof is built on the Marčenko–Pastur (M–P) law applied to the sample data matrix YNΣ Σ Σ1/2

N .

slide-11
SLIDE 11

M–PLaw

slide-12
SLIDE 12

Marčenko–Pastur Law

6/29

Y =       y1,1 · · · y1,p . . . ... . . . . . . . . . yN,1 · · · yN,p                 

  • p

N, yj,k i.i.d. ∼ Y for all N

Rows are independent samples of p i.i.d. random variables with true covariance Ip, sample covariance matrix is S = 1 NY†Y If p is fixed and N → ∞, S → Ip, S has p eigenvalues converging to 1. What if p, N → ∞, p/N → λ ∈ (0, 1)?

slide-13
SLIDE 13

Marčenko–Pastur Densities

7/29

slide-14
SLIDE 14

Example

8/29

N = 2000, p = 500, λ = 0.25, i.i.d. Gaussian entries.

slide-15
SLIDE 15

Laloux et al. (1999)

9/29

slide-16
SLIDE 16

Outliers?

10/29

N = 2000, p = 500, λ = 0.25, i.i.d. Lognormal entries (normalized).

slide-17
SLIDE 17

SampleLévyCovariance Ensemble(SLCE)

slide-18
SLIDE 18

N → ∞

11/29

Marčenko–Pastur: N [y1,k y2,k y3,k . . . yN−1,k yN,k]† 

N′ [y1,k y2,k y3,k . . . yN−1,k yN,k yN+1,k . . . yN′−1,k yN′,k]† New approach, fixed horizon [0, T]: N [y1,k y2,k y3,k . . . yN−1,k yN,k]† 

  • N′

[y1,k y2,k y3,k . . . yN−1,k yN,k yN+1,k . . . yN′−1,k yN′,k]†

slide-19
SLIDE 19

Marčenko–Pastur

12/29

slide-20
SLIDE 20

Marčenko–Pastur, N increases

12/29

slide-21
SLIDE 21

Fixed time horizon

13/29

slide-22
SLIDE 22

Fixed time horizon, N increases

13/29

slide-23
SLIDE 23

Lévy Processes

14/29

Correspondence between Lévy processes Xt ⇐ ⇒ infinitely divisible (ID) distributions X through X1 ∼ X Lévy–Khintchine: Decomposition of characteristic function in terms

  • f Brownian motion (continuous part) and an additional measure

Π.

slide-24
SLIDE 24

Lévy Processes

15/29

1 t log φXt(ϑ) = iµϑ − 1 2σ2ϑ2 + ∫

R

[ eiϑx − 1 − iϑx 1 + x2 ] dΠ(x) Π positive Borel measure, Π({0}) = 0 dΠ(x) integrable near ±∞ x2dΠ(x) integrable near zero behavior of dΠ(x) near 0 ⇐ ⇒ path properties of Xt tail behavior of dΠ(x) ⇐ ⇒ tail behavior of fX

slide-25
SLIDE 25

GGCs

16/29

Thorin (1977): Nonnegative Lévy process such that dΠ(x) = x−1g+(x) dx, x > 0 where g+ : (0, ∞) → R is completely monotone (CM). Extended GGC (EGGC): Lefu and right tails, dΠ(x) = { x−1g+(x) dx, x > 0 |x|−1g−(|x|) dx, x < 0 g± both CM.

slide-26
SLIDE 26

EGGCs

17/29

Good properties: infinite activity, continuous densities. Lots of common distributions are GGC or EGGC.

α-stable, Student’s t, lognormal, gamma, Laplace, Pareto, generalized skew hyperbolic, generalized inverse Gaussian.

Popular models of asset returns are EGGC.

variance-gamma (VG) model of Madan and Seneta (1990), normal-inverse Gaussian (NIG) model of Barndorfg-Nielsen (1997), CGMY model of Carr et al. (2002).

Symmetric EGGCs are precisely the processes created by Brownian motion time-changed by a GGC!

slide-27
SLIDE 27

EGGCs

18/29

slide-28
SLIDE 28

Sample Lévy Covariance Ensemble

19/29

X =       x1,1 · · · x1,p . . . ... . . . . . . . . . xN,1 · · · xN,p                 

  • p

N, xj,k i.i.d. ∼ XT/N for each N

Fix an EGGC process Xt. Fix a time horizon T > 0. Fix a shape parameter λ ∈ (0, 1), p : N → N with p(N)/N → λ. Let X = X(N) be a sequence of independent random matrices with i.i.d. entries, such that [X(N)]jk ∼ XT/N.

slide-29
SLIDE 29

Sample Lévy Covariance Ensemble

20/29

X =       x1,1 · · · x1,p . . . ... . . . . . . . . . xN,1 · · · xN,p                 

  • p

N, xj,k i.i.d. ∼ XT/N for each N

Now consider the sample covariance S = 1 TX†X and its eigenvalues. Are these distinct from M–P?

slide-30
SLIDE 30

Variance-Gamma Process

21/29

slide-31
SLIDE 31

Normal-Inverse-Gaussian Process

22/29

slide-32
SLIDE 32

LimitingSpectralDistributions

slide-33
SLIDE 33

Variance of columns

23/29

MP law relies on the convergence of the sample variance of the columns to converge to something non-random.       x1,1 · · · x1,p . . . ... . . . . . . . . . xN,1 · · · xN,p       1 N

N

j=1

x2

j,k

If xj,k ∼ Y (mean zero) are i.i.d. then these converge to Var(Y). If xj,k ∼ XT/N then these converge to [X]T, the (random) quadratic variation process corresponding to Xt at time t = T.

slide-34
SLIDE 34

Random column variance

24/29

      x1,1 · · · x1,p . . . ... . . . . . . . . . xN,1 · · · xN,p       ∼       z1,1 · · · z1,p . . . ... . . . . . . . . . zN,1 · · · zN,p             √ν1 . . . √ν2 . . . . . . ... · · · √νp      

νj ∼ [X]T are i.i.d. copies of the quadratic variation process. This type of matrix has been well-studied, limiting spectral densities can be approximated by a modification

  • f the algorithm introduced in Yao et al. (2015).
slide-35
SLIDE 35

Algorithm

25/29

Let s(z) = ∫

R 1 x−zdL(x) be the Stieltjes transform of the

limiting spectral distribution L for z ∈ C+. The modified Stieltjes transform is s(z) = − 1−λ

z

+ λs(z). If ν ∼ [X]T is the measure of the quadratic variation process at time T > 0, then s(z) is given by s(z) = 1 −z + λ ∫

R w 1+ws(z)dν(w)

Iterations of the map for z = x + iϵ can be used with Stieltjes inversion to approximate the density for L. Density is approximately f(x) ≈ s(x + iϵ).

slide-36
SLIDE 36

Estimating the Density

26/29

slide-37
SLIDE 37

Empirical deviations from M–P

27/29

Comparison of S&P 500 (lefu, 476 assets) and Nikkei 225 (middle, 221 assets). Daily data (top) June 2013-May 2017 (approx. 900 datapoints) and minute-by-minute (bottom) January 2017-May 2017 (approx. 40000 [SPX] and 30000 [NKY] datapoints). Right column: SLCE generated with i.i.d. NIG entries, T = 0.05 × 900 (top) and T = 0.05 × 100 (bottom).

slide-38
SLIDE 38

Empirical deviations from M–P

28/29

slide-39
SLIDE 39

Conclusions

slide-40
SLIDE 40

Conclusions

29/29

Empirical evidence shows limiting distributions are independent of odd cumulants (skewness). Sensitive to higher moments, kurtosis not enough to determine shape (VG and NIG were normalized to have same excess kurtosis). Behavior of eigenvectors and top eigenvalue need to be investigated, efgects accuracy of weights in portfolio selection.

slide-41
SLIDE 41

Thank you!

slide-42
SLIDE 42

References

Bai, Z., Liu, H., Wong, W.K., 2009: Enhancement of the applicability of Markowitz’s portfolio optimization by utilizing random matrix theory. Mathematical Finance,

  • Vol. 19.

Laloux, L., Pierre, C., Bouchaud, J.P., Potters, M., 1999: Noise dressing of financial correlation matrices. Phys. Review Letters, Vol. 83, No. 7. Marčenko, V.A., Pastur, L.A. 1967: Distribution of eigenvalues for some sets of random matrices. Journal of Multivariate Analysis, Vol. 20. Yao, J., Zheng, S., Bai, Z., 2015: Large Sample Covariance Matrices and High-Dimensional Data Analysis. Cambridge University Press, New York.

slide-43
SLIDE 43

Convergence of a histogram

Y is N × p, p/N ≈ λ ∈ (0, 1), i.i.d. ∼ Y (fixed). S = 1

NY†Y, p × p symmetric matrix.

The histogram measure of S is µS = 1 p ∑

σ∈Eig[Y]

δσ M–P Law: As N, p → ∞, µS converges weakly to the M–P distribution with parameter λ (almost surely).

slide-44
SLIDE 44

Markowitz Portfolio Theory

Model: XN = 1 1 1†µ µ µN + YNΣ Σ Σ1/2

N .

p : N → N such that p(N)/N = p/N → λ ∈ (0, 1) µ µ µN ∈ R1×p, Σ Σ Σ is a positive-definite p × p symmetric matrix, s.t. the following limits exist 1Σ Σ Σ−1

N 1†

N → a1, 1Σ Σ Σ−1

N µ

µ µ†

N

N → a2, µ µ µNΣ Σ Σ−1

N µ

µ µ†

N

N → a3 YN is an N × p matrix whose entries are i.i.d. ∼ Y, where Y is some probability distribution which has mean zero, unit variance, finite fourth moment.

slide-45
SLIDE 45

Bai’s Theorem

Given the previous model, almost surely we have 1 Σ Σ Σ

−1 N 1†

N → 1 1 − λa1, 1 Σ Σ Σ

−1 N

µ µ µ†

N

N → 1 1 − λa2,

  • µ

µ µN Σ Σ Σ

−1 N

µ µ µ†

N

N → 1 1 − λa3 and in particular for any σ2 > 0 we have lim

N→∞

R(σ2, µ µ µN, Σ Σ ΣN) R(σ2,µ µ µN,Σ Σ ΣN) = √ 1 1 − λ where µ µ µN and Σ Σ ΣN = Σ Σ Σ1/2

N Y† NYNΣ

Σ Σ1/2

N

are the sample mean and covariance. Proof is built on the Marčenko–Pastur (M–P) law applied to the sample data matrix YNΣ Σ Σ1/2

N .