ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails Marc S. - - PowerPoint PPT Presentation

alright asymmetric large scale i garch with hetero tails
SMART_READER_LITE
LIVE PREVIEW

ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails Marc S. - - PowerPoint PPT Presentation

ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails Marc S. Paolella Swiss Banking Institute, University of Z urich Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails Do Asset Returns Have Different Tail


slide-1
SLIDE 1

ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

Marc S. Paolella

Swiss Banking Institute, University of Z¨ urich

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-2
SLIDE 2

Do Asset Returns Have Different Tail Indices?

−30 −20 −10 10 20 30 −15 −10 −5 5 10 15 Bank of America Wal−Mart Scatterplot of BoA and Wal−Mart

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-3
SLIDE 3

Asset Returns Have Different Tail Indices

−30 −20 −10 10 20 30 −15 −10 −5 5 10 15 Bank of America Wal−Mart Scatterplot of BoA and Wal−Mart −10 −5 5 10 −10 −5 5 10 Bank of America Wal−Mart Fitted Multivariate Student t

ˆ k = 2.014

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-4
SLIDE 4

Asset Returns Have Different Tail Indices

5 10 15 20 25 30 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 Estimated Parameter k and 95% Bootstrap C.I.s The 30 individual stock return series Degrees of Freedom 5 10 15 20 25 30 3 4 5 6 7 8 9 10 11 12 Estimated Parameter k and 95% Bootstrap C.I.s The 30 individual stock return series Degrees of Freedom 5 10 15 20 25 30 −0.6 −0.4 −0.2 0.2 0.4 0.6 Estimated Parameter θ and 95% Bootstrap C.I.s The 30 individual stock return series Noncentrality Parameter 5 10 15 20 25 30 −0.6 −0.4 −0.2 0.2 0.4 0.6 Estimated Parameter θ and 95% Bootstrap C.I.s The 30 individual stock return series Noncentrality Parameter

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-5
SLIDE 5

2001 2002 2004 2005 2006 2008 −30 −20 −10 10 20 Bank of America Percentage Returns 2001 2002 2004 2005 2006 2008 −8 −6 −4 −2 2 4 6 8 10 Wal−Mart Percentage Returns 2001 2002 2004 2005 2006 2008 −10 −5 5 Bank of America GARCH−Filtered Residuals 2001 2002 2004 2005 2006 2008 −4 −2 2 4 6 Wal−Mart GARCH−Filtered Residuals Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-6
SLIDE 6

Meta-Elliptical t Distribution

The pdf of the meta-elliptical t distribution is given by fX(x; k, R) = ψ

  • Φ−1

k0 (Φk1(x1)), . . . , Φ−1 k0 (Φkd(xd)); R, k0

  • d
  • i=1

φki(xi), (1) where x = (x1, . . . , xd)′ ∈ Rd; k = (k0, k1, . . . , kd)′ ∈ Rd+1

>0 ;

φk(x) and Φk(x) denote, respectively, the univariate Student’s t pdf and cumulative distribution function (cdf) with k degrees of freedom, evaluated at x ∈ R; R is a d-dimensional correlation matrix, ...

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-7
SLIDE 7

Meta-Elliptical t Distribution

and, with z = (z1, z2, . . . , zd)′ ∈ Rd, the copula density function ψ(·; ·) = ψ (z1, z2, . . . , zd; R, k) multiplicatively relating the joint distribution of X to their distribution under independence is given by ψ(·; ·) = Γ{(k + d)/2}{Γ(k/2)}d−1

  • Γ{(k + 1)/2}

d |R|1/2

  • 1 + z′R−1z

k −(k+d)/2 ×

d

  • i=1
  • 1 + z2

i

k (k+1)/2 .

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-8
SLIDE 8

FaK (Fang, Fang Kotz)

We express a random variable T with location parameter µ = (µ1, . . . , µd)′ ∈ Rd, scale terms σ = (σ1, . . . , σd)′ ∈ Rd

>0, and

correlation matrix R, as T ∼ FaK (k, µ, σ, R), with FaK a reminder

  • f the involved authors, and density

fT(y; k, µ, σ, R) = fX(x; k, R) σ1σ2 · · · σd , x = y1 − µ1 σ1 , . . . , yd − µd σd

  • ,

(2) where fX(x; k, R) is given in (1). From its construction as a copula, the marginal distribution of each (Ti − µi)/σi is a standard Student’s t with ki degrees of freedom, irrespective of k0. If second moments exist for each Ti, then the variance-covariance matrix of T is given by Σ = V(T) = MRM, where M = diag(σ ⊙ κ), κ = (κ1, . . . , κd)′, and κi =

  • ki/(ki − 2),

i = 1, . . . , d. In particular, E[Ti] = µi and V(Ti) = σ2

i κ2 i .

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-9
SLIDE 9

FaK Parameter k0

While the marginals are not influenced by k0, its value does alter the dependency structure of the distribution. Via comparison with scatterplots of actual financial returns data,

  • ne might speculate that only values of k0 ≥ maxi ki, i = 1, . . . , d,

are of interest, and one could entertain just setting k0 = maxi ki. In the empirical comparison, we indeed find that ˆ k0 is very close to max(ˆ k1, ˆ k2) when it is freely estimated jointly with all other model parameters; and its attained maximum log-likelihood is statistically indistinguishable from that of the model which imposes the restriction k0 = maxi ki.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-10
SLIDE 10

Effect of Parameter k0

−80 −60 −40 −20 20 40 60 80 −15 −10 −5 5 10 15 R = I, k0 = 1, k1 = 2, k2 = 4 −80 −60 −40 −20 20 40 60 80 −15 −10 −5 5 10 15 R = I, k0 = 3, k1 = 2, k2 = 4 −80 −60 −40 −20 20 40 60 80 −15 −10 −5 5 10 15 R = I, k0 = 4, k1 = 2, k2 = 4 −80 −60 −40 −20 20 40 60 80 −15 −10 −5 5 10 15 R = I, k0 = 10, k1 = 2, k2 = 4 Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-11
SLIDE 11

FaK with Asymmetric Marginals: AFaK

Introduce noncentrality parameters θi ∈ R, i = 1, 2, . . . , d, so that, with φk,θ(x) and Φk,θ(x) the pdf and cdf of the noncentral t distribution at x ∈ R, fX(x; k, R, θ) is ψ

  • Φ−1

k0,θ0(Φk1,θ1(x1)), . . . , Φ−1 k0,θ0(Φkd,θd(xd)); R, k0

  • d
  • i=1

φki,θi(xi), still in conjunction with (2), and with θ0 = 0. The location-scale variant fT(y; k, µ, σ, R, θ) is analogous to (2), and we write T ∼ AFaK(k, µ, σ, R, θ), for asymmetric FaK. We have V(T) = MRM, where M = diag(σ ⊙ v1/2), where v = (V(S1), . . . , V(Sd))′, for Si = (Ti − µi)/σi ∼ t′(ki, θi, 0, 1), with the variance of Si computed from E

  • Si
  • = θi

ki 2 1/2 Γ(ki/2 − 1/2) Γ(ki/2) , ki > 1, (3) E[S2

i ] = [ki/(ki − 2)](1 + θ2 i ) for ki > 2, V(S) = E[S2] − (E[S])2.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-12
SLIDE 12

Examples of Bivariate AFaK

−10 −5 5 10 −10 −5 5 10 k0 = k1 = k2 = 3, θ0 = 0, θ1 = −0.7, θ2 = −0.7 r = 0 −10 −5 5 10 −10 −5 5 10 k0 = 4, k1 = 1.5, k2 = 3.5, θ0 = 0, θ1 = −0.7, θ2 = −0.7 r = 0 −10 −5 5 10 −10 −5 5 10 k0 = k1 = k2 = 3, θ0 = −0.7, θ1 = −0.7, θ2 = −0.7 r = 0 −10 −5 5 10 −10 −5 5 10 k0 = 4, k1 = 1.5, k2 = 3.5, θ0 = −0.7, θ1 = −0.7, θ2 = −0.7 r = 0

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-13
SLIDE 13

Bivariate Example: BoA and Wal-Mart

FaK loglika k0 k1 k2 µ1 µ2 scale MLE −7086.1 3.975 1.464 3.873 0.0331 0.0027 0.857 std err Hess (0.497) (0.067) (0.344) (0.026) (0.028) (0.028) std err NPB (0.562) (0.058) (0.376) (0.025) (0.031) (0.024) std err PB (0.526) (0.068) (0.349) (0.026) (0.028) (0.029) AFaK k0 k1 k2 θ1 θ2 µ1 µ2 scale MLE −7079.1 3.903 1.472 3.879 −0.165 0.136 0.190 −0.192 0.856 std err Hess (0.481) (0.068) (0.344) (0.055) (0.094) (0.057) (0.119) (0.028) std err NPB (0.551) (0.059) (0.374) (0.060) (0.094) (0.062) (0.115) (0.024) std err PB (0.486) (0.081) (0.330) (0.049) (0.096) (0.051) (0.122) (0.030) S-L 1 v1 v2 µ1 µ2 scale MLE −7092.2 1.618 3.731 0.0275 −0.0068 0.922 std err Hess (0.074) (0.306) (0.027) (0.028) (0.029) std err NPB (0.078) (0.317) (0.026) (0.029) (0.027) std err PB (0.082) (0.337) (0.035) (0.036) (0.033) S-L 2 v1 v2 µ1 µ2 scale MLE −7142.7 1.601 4.813 0.0313 −0.0057 0.926 std err Hess (0.077) (0.491) (0.027) (0.030) (0.030) std err NPB (0.072) (0.508) (0.027) (0.028) (0.027) std err PB (0.067) (0.498) (0.024) (0.024) (0.029)

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-14
SLIDE 14

Data Scatterplot and the Fitted Densities

−10 −5 5 10 −10 −5 5 10 Bank of America Wal−Mart Fitted Shaw−Lee Model #1 −10 −5 5 10 −10 −5 5 10 Bank of America Wal−Mart Fitted Shaw−Lee Model #2 −10 −5 5 10 −10 −5 5 10 Bank of America Wal−Mart Fitted FaK Distribution −10 −5 5 10 −10 −5 5 10 Bank of America Wal−Mart Fitted AFaK Distribution

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-15
SLIDE 15

Two-Step Unconditional Estimation

We propose the following, essentially obvious, two-step procedure:

1

The three (or four) parameters ki, µi and σi (and θi) based on the univariate data set corresponding to the ith variable are estimated via maximum likelihood, i = 1, . . . , d. Observe that only three (or four) parameters need to be estimated simultaneously. Set ˆ k0 to maxi(ˆ ki).

2

Parameter R is estimated as the sample correlation matrix, R, or a shrinkage-based variant of it; see below.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-16
SLIDE 16

Remarks on: Two-Step Unconditional Estimation

  • 1. Unlike with maximum likelihood, application of this two step

procedure (in particular, the second step) only makes sense if min(ki) > 2. Have a solution... In the more realistic case that a conditional model via GARCH will be used, the conditional tail index ki is, in all probability, larger than two.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-17
SLIDE 17

Remarks on: Two-Step Unconditional Estimation

  • 2. Observe that step 1 will be extremely fast in the symmetric (FaK)

case, as only the usual univariate Student’s t density is required for the likelihood. For the asymmetric case, computing the density of the noncentral t distribution at each point involves either a univariate numeric integration, or evaluation of an infinite sum, and will thus be massively slower than computing the usual Student’s t distribution. This bottleneck can be overcome by using the second-order closed-form saddlepoint approximation to the density, which is extremely accurate (even, and especially, in the tails) and about 1200 times faster to compute. The derivation and relevant formulae are given in Broda and Paolella (2007) and the references therein. Crucially, there is virtually no difference in the estimates when using either the true or the saddlepoint density.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-18
SLIDE 18

Remarks on: Two-Step Unconditional Estimation

  • 3. It is well-known that shrinkage of the estimated covariance matrix in

the traditional portfolio optimization setup is highly beneficial. They could be shrunk towards their mean value. We can express this algebraically as, with a = 1′˜ R − I

  • 1/ [d (d − 1)] and 1 a d-length

column of ones,

  • R = (1 − sc)

R + sc

  • (1 − a)I + a11′

. (4)

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-19
SLIDE 19

Remarks on: Two-Step Unconditional Estimation

  • 4. One might consider robust estimation of the covariance matrix, say
  • Σ, from which

R = D−1 Σ D−1 can be computed, where D = diag(σ), and the scale terms σi are estimated in step one.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-20
SLIDE 20

Simulation to Assess Quality: FaK

Consider the tri-dimensional FaK distribution with parameters k1 = 3, k2 = 5, k3 = 7, k0 = max(ki) = 7, µ1 = 0.2, µ2 = 0, µ3 = −0.2, σ1 = σ2 = σ3 = 2, R12 = 0.25, R13 = 0.5, R23 = 0.75, (and θ = 0). We assess, via simulation, the differences in the quality (bias and spread) of the estimated parameters when using joint maximum likelihood and the two-step procedure. This is conducted for the sample size T = 250, and based on 500 replications.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-21
SLIDE 21

Simulation to Assess Quality: FaK

−4 −2 2 4 6 8 10 12 14

k0 k1 k2 k3

MLE Parameter Bias using T=250 Observations −0.5 −0.4 −0.3 −0.2 −0.1 0.1 0.2 0.3 0.4 0.5

µ1 µ2 µ3 σ1 σ2 σ3 R12 R13 R23

MLE Parameter Bias using T=250 Observations −4 −2 2 4 6 8 10 12 14

k0 k1 k2 k3

2−Step Parameter Bias using T=250 Observations −0.5 −0.4 −0.3 −0.2 −0.1 0.1 0.2 0.3 0.4 0.5

µ1 µ2 µ3 σ1 σ2 σ3 R12 R13 R23

2−Step Parameter Bias using T=250 Observations Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-22
SLIDE 22

Simulation to Assess Quality: FaK

The average time for joint parameter estimation of this FaK model (using a 3GHz PC, Matlab) is 34 seconds. The two-step method requires 0.050 seconds. Observe that, by design, the required estimation time for the two-step method increases linearly in d, but will increase exponentially in d for the joint parameter estimation. Furthermore, as the number of parameters to be simultaneously estimated increases, the problems associated with avoiding inferior local maxima of the log-likelihood become exacerbated.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-23
SLIDE 23

Simulation to Assess Quality: AFaK

We use the tri-dimensional AFaK distribution with the parameters as given above, but additionally take the noncentrality parameters to be θ1 = −0.2 , θ2 = 0, θ3 = 0.2. A distinction can be seen for parameters k3, θ3 and µ3, for which the joint MLE does indeed perform noticeably better, albeit not demonstrably so. With regard to estimation time, using the same computing platform mentioned above, joint maximum likelihood (AFaK for d = 3 and T = 250) takes, on average, 14.0 minutes, while the two-step procedure, using the saddlepoint approximation, takes on average 0.82 seconds, i.e., it is over 1,000 times faster.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-24
SLIDE 24

Simulation to Assess Quality: FaK

−6 −4 −2 2 4 6 8 10 12 14

k0 k1 k2 k3 µ1 µ2 µ3

MLE Parameter Bias using T=250 Observations −2 −1.5 −1 −0.5 0.5 1 1.5 2

θ1 θ2 θ3 σ1 σ2 σ3 R12 R13 R23

MLE Parameter Bias using T=250 Observations −6 −4 −2 2 4 6 8 10 12 14

k0 k1 k2 k3 µ1 µ2 µ3

2−Step Parameter Bias using T=250 Observations −2 −1.5 −1 −0.5 0.5 1 1.5 2

θ1 θ2 θ3 σ1 σ2 σ3 R12 R13 R23

2−Step Parameter Bias using T=250 Observations Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-25
SLIDE 25

CCC-GARCH

We extend the model to CCC-GARCH. The 2-step procedure applies. Each marginal distribution is a (noncentral) Student’s t with its own degree of freedom (and asymmetry parameter). They are linked via the t-copula as the (A)FaK distribution, but such that each univariate time series is endowed with a time-varying scale term via a t′-(I)GARCH model. The correlation matrix is estimated from the multivariate set of t′-(I)GARCH residuals and is not time-varying. We will refer to this as the (A)FaK-(I-)CCC model.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-26
SLIDE 26

Density Forecasting

Good in-sample fit is nice... Good simulation results are good... but what counts is the ability to forecast. We forecast the entire multivariate density. The measure of interest is what we will call the (realized) predictive log-likelihood, given by πt(M, v) = log f M

t|It−1(yt;

ψ), (5) where v denotes the size of the rolling window used to determine It−1 (and the set of observations used for estimation of ψ) for each time point t.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-27
SLIDE 27

Density Forecasting

We suggest to use what we refer to as the normalized sum of the realized predictive log-likelihood, given by Sτ0,T(M, v) = 1 (T − τ0) d

T

  • t=τ0+1

πt(M, v), (6) where d is the dimension of the data. It is thus the average realized predictive log-likelihood, averaged over the number of time points used and the dimension of the random variable under study. This facilitates comparison over different d, τ0 and T. In our setting, we use the d = 30 daily return series of the DJ-30, with v = τ0 = 500, which corresponds to two years of data, and T = 1, 945.

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-28
SLIDE 28

Forecast Cusum Plots

To nicely illustrate the differences among the models and to contrast their sources of forecast improvement, plot difference of the cumulative sum (cusum) of the πt(Mi, 500), for two models i, and does so for 3 combinations

  • f interest.

600 800 1000 1200 1400 1600 1800 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 Cusum Difference Plots FaK−CCC minus CCC−GARCH FaK minus MVT FaK−CCC minus FaK

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-29
SLIDE 29

Shrinkage for the FaK-CCC Model

0.05 0.1 0.15 0.2 0.25 0.3 −1.526 −1.524 −1.522 −1.52 −1.518 −1.516 −1.514 −1.512 −1.51

FaK S500,1945(Ms, 500) Correlation Shrinkage

IGARCH: Shrinkage to zero IGARCH: Shrinkage to mean GARCH: Shrinkage to zero GARCH: Shrinkage to mean

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-30
SLIDE 30

Shrinkage for the FaK-CCC Model

0.05 0.1 0.15 0.2 0.25 0.3 −1.526 −1.524 −1.522 −1.52 −1.518 −1.516 −1.514 −1.512 −1.51

FaK S500,1945(Ms, 500) Correlation Shrinkage

IGARCH: Shrinkage to zero IGARCH: Shrinkage to mean GARCH: Shrinkage to zero GARCH: Shrinkage to mean 0.2 0.4 0.6 0.8 1 −1.526 −1.524 −1.522 −1.52 −1.518 −1.516 −1.514 −1.512 −1.51

FaK S500,1945(Ms, 500) DF Shrinkage

k* = median ki k* = 1 k* = 2 k* = 3 k* = 4 0.1 0.2 0.3 0.4 0.5 0.6 0.7 −1.526 −1.524 −1.522 −1.52 −1.518 −1.516 −1.514 −1.512 −1.51

FaK S500,1945(Ms, 500) Scale Shrinkage

GARCH IGARCH 0.2 0.4 0.6 0.8 1 −1.516 −1.515 −1.514 −1.513 −1.512 −1.511 −1.51

FaK S500,1945(Ms, 500) Mean Shrinkage

µ* = Median µi µ* = 0 Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-31
SLIDE 31

Asymmetry: Shrinkage for the AFaK-CCC Model

0.2 0.4 0.6 0.8 1 −1.516 −1.515 −1.514 −1.513 −1.512 −1.511 −1.51

AFaK S500,1945(Ms, 500) Shrinkage

Degrees of Freedom (to 3) Correlation (to mean) Scale (to mean) Asymmetry (to zero)

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-32
SLIDE 32

How Much Does Asymmetry Help? AFaK vs. FaK

500 1000 1500 2000 −100 −80 −60 −40 −20 20 Cusum Difference of AFaK and FaK No shrinkage Optimal Shrinkage

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-33
SLIDE 33

How Much Does Shrinkage Help?

500 1000 1500 2000 50 100 150 200 250 300 Cusum Difference of Shrinkage and No Shrinkage FaK AFaK

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-34
SLIDE 34

Estimating (Time Varying) k0

Now use a three-step procedure, with the final step allowing the incorporation of a time-varying copula into the model, by estimating the value of k0, conditional on all other model parameters.

500 700 900 1100 1300 1500 1700 1900 10 20 30 40 50 60 70 80 Conditional FaK Estimate of k0 0.2 0.4 0.6 0.8 1 −1.52 −1.519 −1.518 −1.517 −1.516 −1.515 −1.514 −1.513 −1.512 −1.511 −1.51

AFaK S500,1945(Ms, 500) k0 Shrinkage

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-35
SLIDE 35

Weighted Likelihood

The model is wrong w.p.1, but has value as a simplified filter, so use weighted likelihood to put more weight on recent observations. We use wt ∝ (T − t + 1)ρ−1, where the single parameter ρ dictates the shape of the weighting function, and the actual weights are just re-normalized such that they sum to one. When researchers choose a window length (usually an arbitrary multiple of 100), an implicit decision is made to weight all the

  • bservations in the window equally likely, and observations which

came (right) before it receive zero weight. Such a scheme should appear rather crude and primitive! The procedure applied to step one helps significantly with univariate density forecasting, but not with d = 30 assets. However, it does help with the correlation matrix. The weighted correlation matrix is formed in a natural way by taking the sample means, covariances, and correlations for assets i and j as

mi = T −1

T

  • t=1

wtri,t, vi,j = T −1

T

  • t=1

wt(ri,t −mi)(rj,t −mj), Ri,j = vi,j √vi,ivj,j ,

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails

slide-36
SLIDE 36

Weighted Likelihood for the Correlation Matrix

0.5 0.6 0.7 0.8 0.9 1 −1.5155 −1.515 −1.5145 −1.514 −1.5135 −1.513 −1.5125 −1.512 −1.5115 −1.511 Weighted Likelihood for FaK−CCC Correlation

Marc S. Paolella ALRIGHT: Asymmetric LaRge-Scale (I)GARCH with Hetero-Tails