c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 1 –
Dimension Reduction with Heavy Tails Gabriel Kuhn Munich University - - PowerPoint PPT Presentation
Dimension Reduction with Heavy Tails Gabriel Kuhn Munich University - - PowerPoint PPT Presentation
Dimension Reduction with Heavy Tails Gabriel Kuhn Munich University of Technology http/www.ma.tum.de/stat 4th Conference on Extreme Value Analysis Gothenburg, August 15 . 19 . , 2005 Kl uppelberg, C. and Kuhn, G. (2005) Dimension
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 2 –
Factor Model
- Observable d-dimensional random vector X
X X
- Model: X
X X = µ µ µ + L L Lf f f + V V Ve e e – L L Lf f f: k-dimensional non-observable common factors f f f, loading matrix L L L – V V Ve e e: specific factors e e e, diagonal matrix V V V – (f f f,e e e) are uncorrelated (independent) Idea: Distribution of X X X described by linear combination of k factors with componentwise extra source of randomness.
- Classical model: (f
f f,e e e) ∼ Nk+d(0 0,I I I) and Cov(X X X) =: Σ Σ Σ = L L LL L LT + V V V2 Disadvantages: – Data may not be normal – No heavy-tailed model – Dependence in extremes cannot be modeled – Margins of the same type
- Task: Overcome disadvantages above
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 3 –
Distribution Models
- Elliptical Distribution:
X X X
d
=µ µ µ + GA A AU U U (k) G > 0 independent of U U U (k) ∼ unif{s s s : s s s = 1}, A A A ∈ Rd×k, Σ Σ Σ := A A AA A AT E(X X X) = µ µ µ, Cov(X X X) = EG2Σ Σ Σ/k, Corr(X X X) = diag(Σ Σ Σ)−1/2Σ Σ Σdiag(Σ Σ Σ)−1/2 =: R R R
- Example: – normal: X
X X
d
=µ µ µ +
- χ2
kA
A AU U U (k) ∼ Nd(µ µ µ,Σ Σ Σ) – multivariate tν: X X X
d
=µ µ µ +
- νχ2
k/χ2 νA
A AU U U (k) d =µ µ µ +
- ν/χ2
νNd(0
0,Σ Σ Σ)
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 4 –
Extended Factor Model
Drop normal assumption and consider (with L L LL L LT + V V V2 = Σ Σ Σ): X X X
d
=µ µ µ + L L Lf f f + V V Ve e e is elliptical: L L Lf f f + V V Ve e e = (L L L,V V V)
- f
f f e e e
- d
=G(L L L,V V V)U U U (k+d), e.g. choose multivariate tν-distribution Remark: – f f f and e e e are uncorrelated but not independent – alternatively: same dependence structure, but arbitrary margins (copula approach) – P(G > x) ∼ Cx−ν needed for modelling tail dependence
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 5 –
Factorization
- Standard approach uses (normal) ml algorithm for decomposition Σ
Σ Σ = L L LL L LT +V V V2 Definition: f (k)
ml (A
A A) = (L L L,V V V), (f (k)
ml (A
A A))2 := (L L L,V V V)(L L L,V V V)T = L L LL L LT + V V V2
- Lemma:
Σ Σ Σn
P
− → Σ Σ Σ = L L LL L LT +V V V2 and neighborhood of Σ Σ Σ decomposable by f (k)
ml
⇒
- f (k)
ml (Σ
Σ Σn) 2
P
− → Σ Σ Σ
- Interpretation:
Given some consistent and composable covariance (correlation) estimator, the algorithm computes a consistent decomposition (independent of distribution model)
- Remark:
In application algorithm almost always produces a decomposition
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 6 –
Dependence Concepts
- Kendall’s τ: Let (X, Y )
iid
∼ ( ˜ X, ˜ Y ) τ := P
- (X − ˜
X)(Y − ˜ Y ) > 0
- − P
- (X − ˜
X)(Y − ˜ Y ) < 0
- Elliptical distribution ⇒ R
R R = sin(πT T T/2), T T T = (τij)1≤i,j≤d
- Tail Dependence: (X, Y ) with margins FX, FY
λ := limuց0 P (Y < F ←
Y (u) |X < F ← X (u))
Elliptical distribution and P(G > x) ∼ Cx−ν, ν ∈ (0, ∞) ⇒ R R R = 1 − 2
- F ←
t,ν+1(1 − Λ/2)
2 ν + 1 +
- F ←
t,ν+1(1 − Λ/2)
2 Λ Λ Λ = (λij)1≤i,j≤d and Ft,ν : df of 1-dim tν
- Remark:
Both dependence concepts independent of margins
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 7 –
Example
- Choose d = 10, k = 2 factors with loadings
component 1 2 3 4 5 6 7 8 9 10 L L L·,1 .85 .76 .67 .58 .49 .41 .32 .23 .14 .05 L L L·,2 .17 .41 .55 .64 .71 .77 .81 .84 .85 .86 diag(V V V2) .25 .25 .25 .25 .25 .25 .25 .25 .25 .25 (⇒ L L LL L LT + V V V2 = R R R is a correlation matrix)
- consider factor model(s) (given before)
- 1. X
X X
d
=L L Lf f f + V V Ve e e ∼ td(0 0,R R R, ν) with ν = 6
- 2. X
X X has same t(0 0,R R R, ν) dependence structure, but different margins Fi = tνi, ν ν ν = (3, . . . , 10)
- Simulation length n = 2000, repeat 500 times
Plots of different estimation methods and 95%-CI’s of loadings
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 8 –
X X X ∼ td(0 0,R R R, ν)
2 2 2 2 2 2 2 2 2 2 2 2 4 4 4 4 4 4 4 4 4 4 4 4 6 6 6 6 6 6 6 6 6 6 6 6 8 8 8 8 8 8 8 8 8 8 8 8 10 10 10 10 10 10 10 10 10 10 10 10 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .6 .6 .6 .6 .6 .6 .6 .6 .6 .6 .6 .6 1 1 1 1 1 1 1 1 1 1 1 1 loadings loadings loadings loadings loadings loadings loadings loadings loadings loadings loadings loadings components components components components components components components components components components components components R R Rn R R RT
T T n
R R RΛ
Λ Λ n
R R Rmle
n
R R Rn R R RT
T T n
R R RΛ
Λ Λ n
R R Rmle
n
R R Rn R R RT
T T n
R R RΛ
Λ Λ n
R R Rmle
n
factor 1 factor 2 specific factor
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 9 –
X X X has t-dependence, different margins
2 2 2 2 2 2 2 2 2 2 2 2 4 4 4 4 4 4 4 4 4 4 4 4 6 6 6 6 6 6 6 6 6 6 6 6 8 8 8 8 8 8 8 8 8 8 8 8 10 10 10 10 10 10 10 10 10 10 10 10 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .6 .6 .6 .6 .6 .6 .6 .6 .6 .6 .6 .6 1 1 1 1 1 1 1 1 1 1 1 1 loadings loadings loadings loadings loadings loadings loadings loadings loadings loadings loadings loadings components components components components components components components components components components components components R R Rn R R RT
T T n
R R RΛ
Λ Λ n
R R Rmle
n
R R Rn R R RT
T T n
R R RΛ
Λ Λ n
R R Rmle
n
R R Rn R R RT
T T n
R R RΛ
Λ Λ n
R R Rmle
n
factor 1 factor 2 specific factor
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 10 –
Example
- Consider 8-dimensional set of data:
- il, s&p500, gbp, usd, chf, jpy, dkk and sek (exchange rates w.r.t. euro)
- Daily log-returns between May, 1985 to June, 2004 (n=4904)
- Apply factor analysis with different estimators as before
c (Gabriel Kuhn, TU Munich) EVA2005, Gothenburg – 11 –
−0.5 −0.5 −0.5 −0.5 0.0 0.0 0.0 0.0 0.5 0.5 0.5 0.5 1.0 1.0 1.0 1.0
factor 1 factor 2 factor 3 specific factors loadings loadings
emp emp emp emp mle mle mle mle tau tau tau tau taild taild taild taild
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4
1 1 1 1 2 2 2 2 3 3 3 3 4 4 4 4
- i
l
- i
l
- i
l
- i