Goodness-of-Fit Testing with Empirical Copulas Sami Umut Can John - - PowerPoint PPT Presentation

goodness of fit testing with empirical copulas
SMART_READER_LITE
LIVE PREVIEW

Goodness-of-Fit Testing with Empirical Copulas Sami Umut Can John - - PowerPoint PPT Presentation

Overview of Copulas Goodness-of-Fit Testing Scanning Goodness-of-Fit Testing with Empirical Copulas Sami Umut Can John Einmahl Roger Laeven EURANDOM August 29, 2011 Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas


slide-1
SLIDE 1

Overview of Copulas Goodness-of-Fit Testing Scanning

Goodness-of-Fit Testing with Empirical Copulas

Sami Umut Can John Einmahl Roger Laeven

EURANDOM

August 29, 2011

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-2
SLIDE 2

Overview of Copulas Goodness-of-Fit Testing Scanning

Overview of Copulas

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-3
SLIDE 3

Overview of Copulas Goodness-of-Fit Testing Scanning

Overview of Copulas

A bivariate copula C is a bivariate cdf defined on [0, 1]2 with uniform marginal distributions on [0, 1].

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-4
SLIDE 4

Overview of Copulas Goodness-of-Fit Testing Scanning

Overview of Copulas

A bivariate copula C is a bivariate cdf defined on [0, 1]2 with uniform marginal distributions on [0, 1]. More precisely, a function C : [0, 1]2 → [0, 1] is called a bivariate copula if

C(x, 0) = C(0, y) = 0 for any x, y ∈ [0, 1] C(x, 1) = x, C(1, y) = y for any x, y ∈ [0, 1] C(x2, y2) − C(x1, y2) − C(x2, y1) + C(x1, y1) ≥ 0 for any x1, x2, y1, y2 ∈ [0, 1] with x1 ≤ x2 and y1 ≤ y2

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-5
SLIDE 5

Overview of Copulas Goodness-of-Fit Testing Scanning

Sklar’s Theorem: Let H be a bivariate cdf with continuous marginal cdf’s H(x, ∞) = F(x), H(∞, y) = G(y). Then there exists a unique copula C such that H(x, y) = C(F(x), G(y)). (1) Conversely, for any univariate cdf’s F and G and any copula C, (1) defines a bivariate cdf H with marginals F and G.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-6
SLIDE 6

Overview of Copulas Goodness-of-Fit Testing Scanning

Sklar’s Theorem: Let H be a bivariate cdf with continuous marginal cdf’s H(x, ∞) = F(x), H(∞, y) = G(y). Then there exists a unique copula C such that H(x, y) = C(F(x), G(y)). (1) Conversely, for any univariate cdf’s F and G and any copula C, (1) defines a bivariate cdf H with marginals F and G. C captures the dependence structure of two random

  • variables. It is used for dependence modeling in finance

and actuarial science.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-7
SLIDE 7

Overview of Copulas Goodness-of-Fit Testing Scanning

Goodness-of-Fit Testing

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-8
SLIDE 8

Overview of Copulas Goodness-of-Fit Testing Scanning

Goodness-of-Fit Testing

Given a sample (X1, Y1), . . . , (Xn, Yn) from an unknown bivariate distribution H, with unknown continuous marginal distributions F and G, and a corresponding copula C, how can we decide if a given copula C0 or a given parametric family of copulas {Cθ, θ ∈ Θ} is a good fit for the sample?

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-9
SLIDE 9

Overview of Copulas Goodness-of-Fit Testing Scanning

Goodness-of-Fit Testing

Given a sample (X1, Y1), . . . , (Xn, Yn) from an unknown bivariate distribution H, with unknown continuous marginal distributions F and G, and a corresponding copula C, how can we decide if a given copula C0 or a given parametric family of copulas {Cθ, θ ∈ Θ} is a good fit for the sample? In other words, we would like to perform a hypothesis test about C, with a null hypothesis of the form C = C0 or C ∈ {Cθ, θ ∈ Θ}. For now, we consider the simple hypothesis (C = C0) only.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-10
SLIDE 10

Overview of Copulas Goodness-of-Fit Testing Scanning

Goodness-of-Fit Testing

Given a sample (X1, Y1), . . . , (Xn, Yn) from an unknown bivariate distribution H, with unknown continuous marginal distributions F and G, and a corresponding copula C, how can we decide if a given copula C0 or a given parametric family of copulas {Cθ, θ ∈ Θ} is a good fit for the sample? In other words, we would like to perform a hypothesis test about C, with a null hypothesis of the form C = C0 or C ∈ {Cθ, θ ∈ Θ}. For now, we consider the simple hypothesis (C = C0) only. A natural starting point for constructing goodness-of-fit tests is the so-called empirical copula.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-11
SLIDE 11

Overview of Copulas Goodness-of-Fit Testing Scanning

Note that we can write C(x, y) = H(F −1(x), G−1(y)), (x, y) ∈ [0, 1]2, with F −1(x) = inf{t ∈ R : F(t) ≥ x}, and similarly for G−1.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-12
SLIDE 12

Overview of Copulas Goodness-of-Fit Testing Scanning

Note that we can write C(x, y) = H(F −1(x), G−1(y)), (x, y) ∈ [0, 1]2, with F −1(x) = inf{t ∈ R : F(t) ≥ x}, and similarly for G−1. So a natural way of estimating the copula C is using the empirical copula Cn(x, y) = Hn(F −1

n (x), G−1 n (y)),

(x, y) ∈ [0, 1]2, with Hn(x, y) = 1 n

n

  • i=1

1{Xi ≤ x, Yi ≤ y}, Fn(x) = 1 n

n

  • i=1

1{Xi ≤ x}, Gn(y) = 1 n

n

  • i=1

1{Yi ≤ y}

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-13
SLIDE 13

Overview of Copulas Goodness-of-Fit Testing Scanning

It is known that the empirical copula process Dn(x, y) = √ n(Cn(x, y) − C(x, y)), (x, y) ∈ [0, 1]2 converges weakly in ℓ∞([0, 1]2) to a C-Brownian pillow, under the assumption that Cx(x, y) is continuous on {(x, y) ∈ [0, 1]2 : 0 < x < 1}, Cy(x, y) is continuous on {(x, y) ∈ [0, 1]2 : 0 < y < 1}.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-14
SLIDE 14

Overview of Copulas Goodness-of-Fit Testing Scanning

It is known that the empirical copula process Dn(x, y) = √ n(Cn(x, y) − C(x, y)), (x, y) ∈ [0, 1]2 converges weakly in ℓ∞([0, 1]2) to a C-Brownian pillow, under the assumption that Cx(x, y) is continuous on {(x, y) ∈ [0, 1]2 : 0 < x < 1}, Cy(x, y) is continuous on {(x, y) ∈ [0, 1]2 : 0 < y < 1}. A C-Brownian sheet W(x, y) is a mean zero Gaussian process with covariance function Cov[W(x, y), W(x′, y′)] = C(x∧x′, y∧y′), x, x′, y, y′ ∈ [0, 1].

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-15
SLIDE 15

Overview of Copulas Goodness-of-Fit Testing Scanning

It is known that the empirical copula process Dn(x, y) = √ n(Cn(x, y) − C(x, y)), (x, y) ∈ [0, 1]2 converges weakly in ℓ∞([0, 1]2) to a C-Brownian pillow, under the assumption that Cx(x, y) is continuous on {(x, y) ∈ [0, 1]2 : 0 < x < 1}, Cy(x, y) is continuous on {(x, y) ∈ [0, 1]2 : 0 < y < 1}. A C-Brownian sheet W(x, y) is a mean zero Gaussian process with covariance function Cov[W(x, y), W(x′, y′)] = C(x∧x′, y∧y′), x, x′, y, y′ ∈ [0, 1]. A C-Brownian pillow D(x, y) is a mean zero Gaussian process that is equal in distribution to the C-Brownian sheet W, conditioned on W(x, y) = 0 for any (x, y) ∈ [0, 1]2 \ (0, 1)2.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-16
SLIDE 16

Overview of Copulas Goodness-of-Fit Testing Scanning

We have D(x, y) = W(x, y) − Cx(x, y)W(x, 1) − Cy(x, y)W(1, y) − (C(x, y) − xCx(x, y) − yCy(x, y))W(1, 1).

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-17
SLIDE 17

Overview of Copulas Goodness-of-Fit Testing Scanning

We have D(x, y) = W(x, y) − Cx(x, y)W(x, 1) − Cy(x, y)W(1, y) − (C(x, y) − xCx(x, y) − yCy(x, y))W(1, 1). So we know the asymptotic distribution of the empirical copula process Dn(x, y) = √ n(Cn(x, y) − C(x, y)), and we can take a functional of Dn (such as the sup over [0, 1]2 or an appropriate integral) as a test statistic for a goodness-of-fit test.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-18
SLIDE 18

Overview of Copulas Goodness-of-Fit Testing Scanning

We have D(x, y) = W(x, y) − Cx(x, y)W(x, 1) − Cy(x, y)W(1, y) − (C(x, y) − xCx(x, y) − yCy(x, y))W(1, 1). So we know the asymptotic distribution of the empirical copula process Dn(x, y) = √ n(Cn(x, y) − C(x, y)), and we can take a functional of Dn (such as the sup over [0, 1]2 or an appropriate integral) as a test statistic for a goodness-of-fit test. Problem: The asymptotic distribution of Dn, and that of the test statistic, depends on C. We would like to have a distribution-free goodness-of-fit test.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-19
SLIDE 19

Overview of Copulas Goodness-of-Fit Testing Scanning

Scanning

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-20
SLIDE 20

Overview of Copulas Goodness-of-Fit Testing Scanning

Scanning

Idea: Transform Dn into another process, say Zn, whose asymptotic distribution is independent of C. Use an appropriate functional of the new process Zn as a test statistic for goodness-of-fit tests.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-21
SLIDE 21

Overview of Copulas Goodness-of-Fit Testing Scanning

Scanning

Idea: Transform Dn into another process, say Zn, whose asymptotic distribution is independent of C. Use an appropriate functional of the new process Zn as a test statistic for goodness-of-fit tests. We use E. Khmaladze’s “scanning” idea to transform D into a standard two-parameter Wiener process Z defined on [0, 1]2. The same transformation applied to Dn will then produce a process Zn that will, hopefully, converge to Z.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-22
SLIDE 22

Overview of Copulas Goodness-of-Fit Testing Scanning

Scanning

Idea: Transform Dn into another process, say Zn, whose asymptotic distribution is independent of C. Use an appropriate functional of the new process Zn as a test statistic for goodness-of-fit tests. We use E. Khmaladze’s “scanning” idea to transform D into a standard two-parameter Wiener process Z defined on [0, 1]2. The same transformation applied to Dn will then produce a process Zn that will, hopefully, converge to Z. Assumptions on C: Continuous first-order partial derivatives on [0, 1]2 \ {(0, 0), (0, 1), (1, 0), (1, 1)}, continuous second-order partial derivatives on (0, 1)2, strictly positive mixed partial Cxy on (0, 1)2, and more (to be determined).

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-23
SLIDE 23

Overview of Copulas Goodness-of-Fit Testing Scanning

Define a grid {(xi, yj) : 0 ≤ i, j ≤ N} on [0, 1]2 such that 0 = x0 < x1 < . . . < xN = 1 0 = y0 < y1 < . . . < yN = 1 and define filtrations Fx(xi) = σ{D(xh, yk) : 0 ≤ h ≤ i, 0 ≤ k ≤ N}, 0 ≤ i ≤ N Fy(yj) = σ{D(xh, yk) : 0 ≤ h ≤ N, 0 ≤ k ≤ j}, 0 ≤ j ≤ N

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-24
SLIDE 24

Overview of Copulas Goodness-of-Fit Testing Scanning

Define a grid {(xi, yj) : 0 ≤ i, j ≤ N} on [0, 1]2 such that 0 = x0 < x1 < . . . < xN = 1 0 = y0 < y1 < . . . < yN = 1 and define filtrations Fx(xi) = σ{D(xh, yk) : 0 ≤ h ≤ i, 0 ≤ k ≤ N}, 0 ≤ i ≤ N Fy(yj) = σ{D(xh, yk) : 0 ≤ h ≤ N, 0 ≤ k ≤ j}, 0 ≤ j ≤ N “Scan” the process D with respect to the filtration {Fx}: K (N)

1

(xi, yj) =

i−1

  • h=0
  • D(xh+1, yj) − D(xh, yj)

− E[D(xh+1, yj) − D(xh, yj)|Fx(xh)]

  • Can, Einmahl, Laeven

Goodness-of-Fit Testing with Empirical Copulas

slide-25
SLIDE 25

Overview of Copulas Goodness-of-Fit Testing Scanning

We compute K (N)

1

(xi, yj) = D(xi, yj) −

i−1

  • h=0

D(xh, yj) E[D(xh, yj)D(xh+1, yj)] E[D(xh, yj)2] − 1

  • Can, Einmahl, Laeven

Goodness-of-Fit Testing with Empirical Copulas

slide-26
SLIDE 26

Overview of Copulas Goodness-of-Fit Testing Scanning

We compute K (N)

1

(xi, yj) = D(xi, yj) −

i−1

  • h=0

D(xh, yj) E[D(xh, yj)D(xh+1, yj)] E[D(xh, yj)2] − 1

  • Making the x-partitioning finer and finer, we obtain

K1(x, yj) = D(x, yj) − x D(s, yj)ξ1(ds, yj) as a limit in probability, where ξ1 is an absolutely continuous measure whose density is determined by C and its first- and second-order derivatives.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-27
SLIDE 27

Overview of Copulas Goodness-of-Fit Testing Scanning

Next, we scan K1 with respect to the filtration {Fy} and take the limit as the y-partition gets finer and finer: K(x, y) = D(x, y) − x D(s, y)ξ1(ds, y) − y D(x, t)ξ2(x, dt) + x y D(s, t)ξ1(ds, t)ξ2(s, dt), where ξ2 is another absolutely continuous measure whose density is determined by C and its first- and second-order derivatives.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-28
SLIDE 28

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: K is a C-Brownian sheet.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-29
SLIDE 29

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: K is a C-Brownian sheet. Proof:

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-30
SLIDE 30

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: K is a C-Brownian sheet. Proof:

K is a mean zero Gaussian process since D is, so it remains to show that K has the covariance structure of a C-Brownian sheet.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-31
SLIDE 31

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: K is a C-Brownian sheet. Proof:

K is a mean zero Gaussian process since D is, so it remains to show that K has the covariance structure of a C-Brownian sheet. K has independent (rectangle) increments by construction, so it will suffice to show that Var[K(x, y)] = C(x, y) for all (x, y) ∈ [0, 1]2.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-32
SLIDE 32

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: K is a C-Brownian sheet. Proof:

K is a mean zero Gaussian process since D is, so it remains to show that K has the covariance structure of a C-Brownian sheet. K has independent (rectangle) increments by construction, so it will suffice to show that Var[K(x, y)] = C(x, y) for all (x, y) ∈ [0, 1]2. The variance of the K-increment over a small rectangle is “close” to the variance of the D-increment over the same rectangle, which is in turn “close” to the W-increment over the same rectangle.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-33
SLIDE 33

Overview of Copulas Goodness-of-Fit Testing Scanning

Corollary: The process Z(x, y) = x y 1

  • Cxy(s, t)

dK(s, t), (x, y) ∈ [0, 1]2 is a standard two-parameter Wiener process.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-34
SLIDE 34

Overview of Copulas Goodness-of-Fit Testing Scanning

Corollary: The process Z(x, y) = x y 1

  • Cxy(s, t)

dK(s, t), (x, y) ∈ [0, 1]2 is a standard two-parameter Wiener process. We have thus transformed the C-Brownian pillow D into a standard two-parameter Wiener process Z, through a two-step transformation: D → K → Z.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-35
SLIDE 35

Overview of Copulas Goodness-of-Fit Testing Scanning

We apply the same two-step transformation to Dn, i.e. we define Kn(x, y) = Dn(x, y) − x Dn(s, y)ξ1(ds, y) − y Dn(x, t)ξ2(x, dt) + x y Dn(s, t)ξ1(ds, t)ξ2(s, dt), Zn(x, y) = x y 1

  • Cxy(s, t)

dKn(s, t) for (x, y) ∈ [0, 1]2.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-36
SLIDE 36

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: Zn converges weakly to Z.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-37
SLIDE 37

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: Zn converges weakly to Z. As an intermediate step, we need to show that Kn converges weakly to K.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-38
SLIDE 38

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: Zn converges weakly to Z. As an intermediate step, we need to show that Kn converges weakly to K. Thus the asymptotical distribution of Zn is independent of C, and functionals of Zn can be used for goodness-of-fit tests.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-39
SLIDE 39

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: Zn converges weakly to Z. As an intermediate step, we need to show that Kn converges weakly to K. Thus the asymptotical distribution of Zn is independent of C, and functionals of Zn can be used for goodness-of-fit tests. Future: Construct actual test statistics and procedures for goodness-of-fit tests. Consider composite null hypotheses

  • f the form C ∈ {Cθ : θ ∈ Θ} and consider m-dimensional

copulas with m > 2.

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas

slide-40
SLIDE 40

Overview of Copulas Goodness-of-Fit Testing Scanning

Theorem: Zn converges weakly to Z. As an intermediate step, we need to show that Kn converges weakly to K. Thus the asymptotical distribution of Zn is independent of C, and functionals of Zn can be used for goodness-of-fit tests. Future: Construct actual test statistics and procedures for goodness-of-fit tests. Consider composite null hypotheses

  • f the form C ∈ {Cθ : θ ∈ Θ} and consider m-dimensional

copulas with m > 2. Thank you for listening!

Can, Einmahl, Laeven Goodness-of-Fit Testing with Empirical Copulas