Dynamic Equicorrelation Bryan Kelly (Joint work with Rob Engle) - - PowerPoint PPT Presentation

dynamic equicorrelation
SMART_READER_LITE
LIVE PREVIEW

Dynamic Equicorrelation Bryan Kelly (Joint work with Rob Engle) - - PowerPoint PPT Presentation

Dynamic Equicorrelation Bryan Kelly (Joint work with Rob Engle) The Problem with Covariances... Since early 80s, attempts have been made to estimate multivariate GARCH models Specifications so complex that traditional models are difficult


slide-1
SLIDE 1

Dynamic Equicorrelation

Bryan Kelly (Joint work with Rob Engle)

slide-2
SLIDE 2

The Problem with Covariances...

Since early ‘80s, attempts have been made to estimate multivariate GARCH models Specifications so complex that traditional models are difficult to estimate for more than a few assets In finance, we want to work with large cross sections Portfolio selection Derivatives (basket options, CDOs, etc.) Risk Management

slide-3
SLIDE 3

DCC: Problem Solved?

Engle (2002) introduces Dynamic Conditional Correlation Massive parameter reduction: an entire matrix evolution can be described by two parameters (sort of...) Computational burden even for a few parameters: must calculate inverse and determinant of N x N matrices many thousands of times in likelihood maximization A pain for a moderate systems Infeasible for very large systems? Other concerns Storing correlation matrices Digesting massive output: N(N-1)/2 series

slide-4
SLIDE 4

Dynamic Equicorrelation (DECO)

Where to begin? Simplify the problem: All assets share the same correlation each period, but this “equicorrelation” varies through time What does it buy? Analytic inverse and determinant likelihood simple to compute for system of any dimension Entire correlation evolution summarized by a single time series

slide-5
SLIDE 5

Outline

Model and theoretical properties DECO amid extant covariance models Monte Carlo evaluation Correlations among the S&P 500 Equicorrelation in action

slide-6
SLIDE 6

Introducing DECO

slide-7
SLIDE 7

Defining Equicorrelation

An equicorrelation matrix takes the form Lemma 1: Invertible and positive definite if and only if

Rt = (1 − ρt)In + ρtJn =     1 − ρt · · · ... . . . 1 − ρt     +     ρt ρt · · · ρt ... . . . ρt    

R−1

t

= 1 1 − ρt In + −ρt (1 − ρt)(1 + [n − 1]ρt)Jn det(Rt) = (1 − ρt)n−1(1 + [n − 1]ρt). ρt ∈ ( −1 n − 1, 1)

slide-8
SLIDE 8

The Model

rt, n x 1 vector, unit variance, correlations Rt DECO is born from the DCC process Average pairwise DCC correlations

RDCC

t

= ˜ Qt

− 1

2 Qt ˜

Qt

− 1

2

RDECO

t

= (1 − ρt)In + ρtJn×n

ρt = 1 n(n − 1)

  • ι′RDCC

t

ι − n

  • =

2 n(n − 1)

  • i=j,i>j

qi,j,t √qi,i,tqj,j,t

Qt = ¯ Q(1 − α − β) + α ˜ Qt−1

1 2 rt−1r′

t−1

˜ Qt−1

1 2 + βQt−1.

slide-9
SLIDE 9

The Model

Assumption 1: Theorem 1: Correlation matrices generated by every realization of a DECO process are p.d. and mean reverting

¯ Q is p.d., α + β < 1, α > 0, β > 0.

slide-10
SLIDE 10

Estimation

Gaussian (Quasi-) Maximum Likelihood Assume returns are conditionally normal Log likelihood can be decomposed into Important theorem: two-stage estimator will be consistent!

L = − 1 T

  • t
  • log |Dt|2 + ˜

r′

tD−2 t

˜ rt − r′

trt

  • − 1

T

  • t
  • log |Rt| + r′

tR−1 t rt

  • ˜

rt|t−1 ∼ N(0, Ht), Ht = DtRtDt, rt ≡ D−1

t

˜ rt

slide-11
SLIDE 11

Estimation

Proceed in two easy steps

  • 1. Stock-by-stock GARCH models to “de-volatize”

returns

  • 2. Estimate DECO on standardized returns
slide-12
SLIDE 12

Data Is Non-Equicorrelated?

Have no fear, DECO will provide consistent estimates anyway Theorem 2: As long as DCC (a very general, non-equicorrelated covariance model) is a consistent estimator of correlations, DECO will be too How useful: arbitrary dimension DCC model can be estimated via DECO, this could be infeasible with DCC alone

slide-13
SLIDE 13

Block DECO

More flexible structure with the tractability and robustness of DECO Example: industry model - each industry has a single DECO parameter and each industry pair has a single cross-equicorrelation parameter

Rt =     (1 − ρ1,1,t)In1 · · · ... . . . (1 − ρK,K,t)InK     +     ρ1,1,tJn1 ρ1,2,tJn1×n2 · · · ρ2,1,tJn2×n1 ... . . . ρK,K,tJnK    

slide-14
SLIDE 14

Block DECO

Theorem 3: Two-block DECO has easy analytic inverses and determinants - thus as computationally feasible as DECO

R−1 = b1In1 b2In2

  • +

c1Jn1×n1 c3Jn1×n2 c3Jn2×n1 c2Jn2×n2

  • det(R) = (1 − ρ1,1)n1−1(1 − ρ2,2)n2−1

(1 + [n1 − 1]ρ1,1)(1 + [n2 − 1]ρ2,2) − ρ2

1,2n1n2

  • c1

= ρ1,1

  • ρ2,2(n2 − 1) + 1
  • − ρ2

1,2n2

(ρ1,1 − 1)

  • [ρ1,1(n1 − 1) + 1][ρ2,2(n2 − 1) + 1] − n1n2ρ2

1,2

  • c2

= ρ2,2

  • ρ1,1(n1 − 1) + 1
  • − ρ2

1,2n1

(ρ2,2 − 1)

  • [ρ1,1(n1 − 1) + 1][ρ2,2(n2 − 1) + 1] − n1n2ρ2

1,2

  • c3

= ρ1,2 n1n2ρ2

1,2 −

  • ρ1,1(n1 − 1) + 1
  • ρ2,2(n2 − 1) + 1
slide-15
SLIDE 15

Block DECO

For more blocks - difficult analytics, but cozily falls into composite likelihood framework More information in block composite likelihood than DCC version - potentially more efficient Theorem 4: like DECO, block DECO is a QML estimator of non-block-equicorrelated systems

slide-16
SLIDE 16

Digression: Using Composite Likelihood

Composite likelihood splices together likelihood

  • f subsets of assets

In DCC, a subset is a pair of stocks, i and j In Block DECO, a subset is all the stocks in pair of blocks i and j

Rt =    

Pairs of stocks Pairs of Blocks

slide-17
SLIDE 17

DECO Amid Current Literature

slide-18
SLIDE 18

Related Models

Two types of approaches to estimating time- varying covariances in large systems

  • 1. Factor GARCH (Engle, Ng, Rothschild 1992,

Engle 2008)

  • 2. Composite likelihood (Engle, Shephard,

Sheppard, 2008)

slide-19
SLIDE 19

Factor (Double) ARCH

Impose factor structure on system

rt = BFt + ǫt V ar(rt) = BV ar(Ft)B′ + V ar(ǫt)

slide-20
SLIDE 20

Factor (Double) ARCH

Impose factor structure on system Univariate GARCH dynamics in factors can generate time-varying correlations while keeping the residual covariance matrix constant through time.

rt = BFt + ǫt V ar(rt) = BV ar(Ft)B′ + V ar(ǫt) V art(rt) = BV art(Ft)B′ + V ar(ǫt) Ft ∼ GARCH

slide-21
SLIDE 21

Factor (Double) ARCH

Impose factor structure on system Univariate GARCH dynamics in factors and residuals can generate time-varying correlations while keeping the residual correlation matrix constant through time.

rt = BFt + ǫt V ar(rt) = BV ar(Ft)B′ + V ar(ǫt) Ft, ǫt ∼ GARCH V art(rt) = BV art(Ft)B′ + V art(ǫt)

slide-22
SLIDE 22

Factor (Double) ARCH

Benefits

  • 1. Feasibility for large numbers of assets - only

estimate n+K GARCH (regression) models

  • 2. Full likelihood, potential for efficiency

Limitations

  • 1. Don’t know factors? Don’t have data?
  • 2. Misspecification - dynamics in residual correlations?
slide-23
SLIDE 23

Composite Likelihood DCC

Estimate DCC for arbitrary cross sections Modeling any pair will give consistent estimates

  • f

Randomly select subset of all pairs - a partial likelihood technique

RDCC

t

= ˜ Qt

− 1

2 Qt ˜

Qt

− 1

2

Qt = ¯ Q(1 − α − β) + α ˜ Qt

1 2 rt−1r′

t−1 ˜

Qt

1 2 + βQt−1

α, β

slide-24
SLIDE 24

Composite Likelihood

Benefits

  • 1. Very flexible - no structural assumption required

Limitations

  • 1. Partial likelihood - never efficient
slide-25
SLIDE 25

Fundamental Tradeoff

Factor ARCH - strict structural assumptions Composite Likelihood - abandons useful information

slide-26
SLIDE 26

Where Does DECO Fit?

Flexibly balances this tradeoff Structural models (like factor structures) can be estimated as part of the first stage, and DECO can clean up correlation dynamics in residuals With blocks or first-stage structure, can be as well-specified as composite likelihood, yet more efficient

slide-27
SLIDE 27

Monte Carlos

slide-28
SLIDE 28

Performance: DECO as DGP

As a first check, we ask “How does DECO do when correctly specified?” Simulate DECO processes using various

  • 1. Time series dimensions
  • 2. Cross section sizes
  • 3. Parameter ( ) values

α, β

slide-29
SLIDE 29

!

Table 1: DECO as Generating Process

slide-30
SLIDE 30

Performance: DCC as DGP

“How does DECO do when incorrectly specified?” Simulate DCC processes Standard deviation of pairwise correlations large, ~0.33

slide-31
SLIDE 31

! Table 2: DCC as Generating Process

slide-32
SLIDE 32

Correlation Among the S&P 500

slide-33
SLIDE 33

S&P 500, 1995-2008

Stocks included if traded over entire sample and a member of S&P 500 at some point in that time Final count: 466 stocks

slide-34
SLIDE 34

Estimation

Model menu: Choose one of each... First-Stage Model

  • 1. Constant Factor
  • 2. CAPM
  • 3. Fama-French Three-Factor
  • 4. 10 Industry Factors

Second-Stage (Correlation) Model

  • 1. DECO
  • 2. 10-Block DECO
  • 3. DCC
slide-35
SLIDE 35

Using Composite Likelihood

Composite likelihood splices together likelihood

  • f subsets of assets

In DCC, a subset is a pair of stocks, i and j In Block DECO, a subset is all the stocks in pair of blocks i and j

slide-36
SLIDE 36

!

Table 3: Full Sample Results

slide-37
SLIDE 37

Interpretation

Intuitively, DECO will outperform DCC when there is a dominating component of pairwise correlations inducing all pairwise correlations to move together In this case, smoothing reduces noise without compromising structure

slide-38
SLIDE 38
slide-39
SLIDE 39
slide-40
SLIDE 40

Out-of-Sample Forecasts

slide-41
SLIDE 41

Out-of-Sample Hedging

Pre-estimation window, 1995-1999 Forecast one-day ahead, form minimum variance portfolios Calculate sample variance of portfolios Which model delivers lower variance? Re-estimate model parameters every 22 days

slide-42
SLIDE 42

(G)MV Portfolios

Solution to Markowitz problem:

ωGMV = 1 AΣ−1ι ωMV = C − qB AC − B2 Σ−1ι + qA − B AC − B2 Σ−1µ, A = ι′Σ−1ι B = ι′Σ−1µ C = µ′Σ−1µ

slide-43
SLIDE 43

A Twist: Varying Block Structure

No reason that best block structure for estimation should be best for your application Once estimated (with any model) can vary block structure ex post After we estimate each model, we will also look at how ex post changes in blocks affect hedges

slide-44
SLIDE 44

Table 4: S&P 500 O.S. Hedging

!

slide-45
SLIDE 45

Equicorrelation in Action

slide-46
SLIDE 46

Equicorrelation Appeal

Life in a one-factor world If cross sectional dispersion of βj is small and idiosyncrasies have similar variance each period, system well-described by Dynamic Equicorrelation S&P data, perhaps surprisingly, well described by this case

rj = βjrm + ej, σ2

j = β2 j σ2 m + vj

slide-47
SLIDE 47

Equicorrelation and Options (1)

Natural one-factor model: credit derivatives (esp. CDO’s) Key feature in loan portfolios: correlation in default risk Wall Street model: one correlation if firms in same industry, another in different industries. More broadly, to price CDO’s, an LHP assumption

  • ften made: Each loan has same var, the same covar

with market and the same idiosyncratic var. In fact, LHP implies equicorrelation

ρ = β2σ2

m

β2σ2

m + v .

slide-48
SLIDE 48

Equicorrelation and Options (2)

Dispersion trades: long option on a basket, short

  • ptions on components

With delta hedging, value of strategy depends solely on correlations. Let basket weights given by w, covariance matrix of components S, variance of basket is We only know about implied variance, not covariances - so assume all correlations are equal

σ2 = w′Sw ρ = σ2 − n

j=1 w2 js2 j

  • i=j wiwjsisj

. σ2 =

n

  • j=1

w2

js2 j + ρ

  • i=j

wiwjsisj

slide-49
SLIDE 49

Equicorrelation and Portfolio Choice

Elton and Gruber (1973): Averaging pairwise correlations can reduce estimation noise and deliver superior portfolios Ledoit and Wolf (2003, 2004): Bayesian shrinkage to equicorrelated target improves portfolios

slide-50
SLIDE 50

Conclusion

DECO: estimating covariance models of arbitrary dimension Consistent even when equicorrelation is violated Block DECO loosens structure yet retains simplicity and robustness Good descriptor of correlation in the S&P 500