GTL Regression: A Linear Model with Skewed and Thick-Tailed - - PowerPoint PPT Presentation

gtl regression a linear model with skewed and thick
SMART_READER_LITE
LIVE PREVIEW

GTL Regression: A Linear Model with Skewed and Thick-Tailed - - PowerPoint PPT Presentation

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion GTL Regression: A Linear Model with Skewed and Thick-Tailed Disturbances Wim Vijverberg & Takuya Hasebe CUNY-Graduate Center &


slide-1
SLIDE 1

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

GTL Regression: A Linear Model with Skewed and Thick-Tailed Disturbances

Wim Vijverberg & Takuya Hasebe

CUNY-Graduate Center & Sophia University

24 February 2015

slide-2
SLIDE 2

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Linear model seems reasonable yi = X ′

i β + ǫ

Xs are plausibly exogenous ǫi is plausibly iid : So ... what are you waiting for? OLS is BLUE B stands for BEST L stands for LINEAR U stands for UNBIASED : So ... why write a paper on just such a model? OLS is not always BUE

slide-3
SLIDE 3

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

The Classical Linear Regression Model assumes yi = X ′

i β + ǫ

X is of full rank X ′X/n converges to a finite matrix (Grenander conditions) ǫi|X is iid(0, σ2) What is not covered? Poorly behaved Xs Higher moments of ǫ What does this paper offer? A flexible distributional assumption for ǫ An estimator that is efficient relative to OLS A test for normality that, if normality is rejected, points to this flexible distribution

slide-4
SLIDE 4

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Agenda Define this flexible distribution: Generalized Tukey Lambda (GTL) Define the GTL regression model Describe the GTL estimator of the linear model Discuss alternative approaches Monte Carlo results Applications Extensions

slide-5
SLIDE 5

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Representing statistical distribution: N(µ, σ2) Let u ∼ U[0, 1] Probability density function (PDF) φ(ǫ) = 1 (2πσ2)1/2 exp

  • − 1

2 ǫ − µ σ 2 Cumulative distribution function (CDF) u = Φ ǫ − µ σ

  • Link (quantile) function

ǫ = G(u) = µ + σΦ−1(u) The CDF has no analytical representation.

slide-6
SLIDE 6

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Tukey Lambda distribution and its extensions Tukey lambda distribution: defined by a link (quantile) function ǫ = G(u) = µ + σ

  • uλ − (1 − u)λ

, Its CDF is F(ǫ) = u = G −1(ǫ) In general, the CDF has no analytical solution Given u = G −1(ǫ), its PDF is

f (ǫ) = 1 G ′(G −1(ǫ)) G ′(u) = σλ

  • uλ−1 + (1 − u)λ−1
slide-7
SLIDE 7

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Tukey Lambda distribution and its extensions Tukey Lambda distribution: Tukey (1960) ǫ = G(u) = µ + σ

  • uλ − (1 − u)λ

Generalized Lambda Distribution GLD (GLD-RS: Ramberg and Schmeiser, 1974) ǫ = G(u) = µ + σ

  • uλ3 − (1 − u)λ4
  • Generalized Tukey Lambda Distribution GTL (GLD-FMKL:

Freimer, Mudholkar, Kollia, Lin, 1978) ǫ = Q(u) = µ + σ uα−δ − 1 α − δ − (1 − u)α+δ − 1 α + δ

slide-8
SLIDE 8

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

The impracticality of GLD The parameter space of the Generalized Lambda Distribution GLD has gaps ǫ = G(u) = µ + σ

  • uλ3 − (1 − u)λ4
  • Source: Karian et al. (1996)
slide-9
SLIDE 9

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Generalized Tukey Lambda distribution GTL is defined by a link (quantile) function ǫ = G(u) = µ + σ uα−δ − 1 α − δ − (1 − u)α+δ − 1 α + δ

  • ,

In general, its CDF has no analytical solution F(ǫ) = u = G −1(ǫ) Given u = G −1(ǫ), its PDF is

f (ǫ) = σ−1 uα−δ−1 + (1 − u)α+δ−1−1

GTL has no gaps in its parameter space

slide-10
SLIDE 10

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Flexibility of the GTL distribution GTL link function: ǫ = G(u) = µ + σ uα−δ − 1 α − δ − (1 − u)α+δ − 1 α + δ

  • where µ is a location parameter (but is not the mean)

where σ is a scale parameter (but is not the standard deviation) where α is a shape parameter controlling tail thickness where δ is a shape parameter controlling skewness GTL in canonical form: set µ = 0 and σ = 1. Standardized GTL: ˜ ǫ = ǫ−µǫ

σǫ

General GTL: r = τ1 + τ2ǫ

slide-11
SLIDE 11

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Flexibility of the GTL distribution The parameter α controls thickness of tails.

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

  • 3
  • 2
  • 1

1 2 3

α=0.35, δ=0.00 α=0.00, δ=0.00 α=-0.35, δ=0.00

The smaller α is, the thicker tails are. (δ is fixed at 0.)

slide-12
SLIDE 12

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Flexibility of the GTL distribution The parameter δ controls direction of skewness.

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

  • 3
  • 2
  • 1

1 2 3

α=0.00, δ=-0.33 α=0.00, δ=0.00 α=0.00, δ=0.33

δ = 0 (black) ⇒ symmetric δ > 0 (red) ⇒ left-skewed δ < 0 (blue) ⇒ right-skewed (α is fixed at 0.)

slide-13
SLIDE 13

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Flexibility of the GTL distribution A GTL distribution exhibits a variety of shapes.

0.00 0.10 0.20 0.30 0.40

  • 4
  • 2

2 4

Normal GTL α = 0.1436 δ = 0.0 α = -0.20 δ = -0.15 α = 0.25 δ = -0.22 α = 1.50 δ = 0.40

slide-14
SLIDE 14

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Flexibility of GTL distribution Relative to other distributions, GTL nests:

Logistic distribution α = 0, δ = 0 Exponential distribution α − δ → ∞ and α + δ = 0 Uniform distribution α = 1, δ = 0 ” α = 2, δ = 0 ” (α, δ) = (α, α − 1) for α → ∞ ” (α, δ) = (α, −α + 1) for α → ∞

and GTL approximately nests

Normal distribution α = 0.1436, δ = 0 Student’s t(r) for r ≥ 1 −0.8416 ≤ α ≤ 0.1436, δ = 0 Gumbel α = 0.1422, δ = −0.2290 χ2, Weibull, etc.

slide-15
SLIDE 15

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

GTL Approximation of Well-known Distributions Normal Distribution

−4 −3 −2 −1 1 2 3 4 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 GTL Normal

slide-16
SLIDE 16

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

GTL Approximation of Well-known Distributions Student’s t Distribution (degrees of freedom equal to 5)

−5 5 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 GTL t(5)

slide-17
SLIDE 17

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

GTL Approximation of Well-known Distributions χ2 Distribution (degrees of freedom equal to 10)

5 10 15 20 25 30 35 40 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 GTL χ 2(10)

slide-18
SLIDE 18

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

GTL Approximation of Well-known Distributions Weibull Distribution (1,2)

0.5 1 1.5 2 2.5 3 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 GTL Weibull(1,2)

slide-19
SLIDE 19

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Characteristics of the GTL distribution GTL link function: ǫ = G(u) = µ + σ uα−δ − 1 α − δ − (1 − u)α+δ − 1 α + δ

  • Range: ǫL ≤ ǫ ≤ ǫU

Lower bound:

ǫL = −∞ if α − δ ≤ 0 ǫL = −

1 α+δ if α − δ > 0

Upper bound:

ǫU = ∞ if α + δ ≤ 0 ǫU =

1 α+δ if α + δ > 0

The kth moment exists if min(α − δ, α + δ) > − 1

k

slide-20
SLIDE 20

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Characteristics of the GTL distribution Range: ǫL ≤ ǫ ≤ ǫU The kth moment exists if min(α − δ, α + δ) > − 1

k

1 1.25 0.5 0.75 1 1.25 Left bound on μ Left bound on σ Left bound on κ4

  • 1.25
  • 1
  • 0.75
  • 0.5
  • 0.25

0.25 0.5 0.75 1 1.25

  • 1.25
  • 1
  • 0.75
  • 0.5
  • 0.25

0.25 0.5 0.75 1 1.25 Left bound on εL finite Left bound on εR finite

  • 1.25
  • 1
  • 0.75
  • 0.5
  • 0.25

0.25 0.5 0.75 1 1.25

  • 1.25
  • 1
  • 0.75
  • 0.5
  • 0.25

0.25 0.5 0.75 1 1.25 Left bound on μ Left bound on σ Left bound on κ4

slide-21
SLIDE 21

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

The GTL regression model Model: yi = X ′

i β + σǫi

ǫi ∼ iid GTL(α, δ) The GTL distribution is stated in its canonical form. We allow (α, δ) ∈ R2 We do not impose E[ǫi] = 0 since E[ǫi] does not exist for all (α, δ). σ is merely a scaling parameter. If min(α − δ, α + δ) > − 1

2, the variance of σǫi exists.

If min(α − δ, α + δ) ≤ − 1

2, this variance does not exist.

slide-22
SLIDE 22

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

MLE estimation Define θ = (β′, α, δ)′. Define ui = G −1( 1

σ(yi − x′ i β))

The GTL estimator (GTLE) maximizes the following likelihood function: L(y, x, θ) = −n ln σ −

n

  • i=1

ln G ′(ui) = −n ln σ −

n

  • i=1

ln

  • uλ1−1

i

+ (1 − ui)λ2−1 . The function G −1 is numerically evaluated with a hybrid algorithm that mixes the bisection and Newton-Raphson algorithms (Press et al., 1986, Numerical Recipes).

slide-23
SLIDE 23

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Consistency and Asymptotic Normality Define θ = (β′, α, δ)′. Let θ0 be the true parameter vector. Let Θ be the parameter space. Assumptions A.1 Θ is compact with σ > 0. A.2 (α, δ) are such that α − δ < 1

2 and α + δ < 1 2.

A.3 θ0 ∈ int(Θ). A.4 (i) x ∈ X ⊂ Rk. (ii) x is weakly exogenous. (iii) x has finite moments up to the fourth order. (iv) E[xx′] has full rank. A.5 (i) ǫ is independently and identically GTL(α, δ)-distributed, with the GTL distribution stated in canonical form. (ii) ǫ is independent of x.

slide-24
SLIDE 24

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Consistency and Asymptotic Normality Theorem Given Assumptions 1, 3, 4, and 5, ˆ θ is a consistent estimator of θ0 for all θ0 ∈ Θ except for those θ0 for which (α0, δ0) = (1, 0) or (α0, δ0) = (2, 0). Theorem Given the linear regression model yi = X ′

i β + σǫi and Assumptions

1-5, ˆ θ is asymptotically normally distributed N(θ0, V0), where V0 = n−1A0 is estimated as V (ˆ θ) = −

  • n
  • i=1

∇θθℓi(ˆ θ) −1 where ℓi(θ) = ln gy(y|xi; θ) and gy(y|xi; θ) is the conditional density of y.

slide-25
SLIDE 25

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Consistency and Asymptotic Normality The additional assumption imposed to obtain asymptotic normality is Assumption 2: A.2 (α, δ) are such that α − δ < 1

2 and α + δ < 1 2.

  • 1.25
  • 1
  • 0.75
  • 0.5
  • 0.25

0.25 0.5 0.75 1

  • 1.25
  • 1
  • 0.75
  • 0.5
  • 0.25

0.25 0.5 Left bound on μ Left bound on σ Right bound on Asy.N.

δ α

slide-26
SLIDE 26

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

OLS vs GTL OLS is equivalent to quasi-MLE assuming iid N(0, σ2) disturbances FOC under QMLE-normal yield

n

  • i=1

˜ ǫixi = 0 FOC under MLE-GTL yield

n

  • i=1

1 σ G ′′(ui) (G ′(ui))2 xi = 0 ≡

n

  • i=1

w(˜ ǫi)˜ ǫixi ≡

n

  • i=1

ψ(˜ ǫi)xi where ui = G −1(σǫ˜ ǫi + µǫ).

slide-27
SLIDE 27

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

OLS vs GTL ψ(˜ ǫ) for selected values of α and δ

  • 10
  • 8
  • 6
  • 4
  • 2

2 4 6 8 10

  • 5
  • 3
  • 1

1 3 5 OLS α = 0.1436, δ = 0

Comparing N(0, 1) with GTL(0.1436, 0)

slide-28
SLIDE 28

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

OLS vs GTL ψ(˜ ǫ) for selected values of α and δ

  • 5
  • 4
  • 3
  • 2
  • 1

1 2 3 4 5

  • 5
  • 3
  • 1

1 3 5 OLS α = 0 α = -0.15 α = -0.30 α = -0.45

Increasingly thick tails: δ = 0

slide-29
SLIDE 29

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

OLS vs GTL ψ(˜ ǫ) for selected values of α and δ

  • 10
  • 6
  • 2

2 6 10

  • 5
  • 3
  • 1

1 3 5 OLS δ = 0.15 δ = 0 δ = -0.15

Thin tails with skewness: α = 0.30

slide-30
SLIDE 30

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

OLS vs GTL ψ(˜ ǫ) for selected values of α and δ

  • 5
  • 4
  • 3
  • 2
  • 1

1 2 3 4 5

  • 5
  • 3
  • 1

1 3 5 OLS δ = 0.15 δ = 0 δ = -0.15

Thick tails with skewness: α = −0.30

slide-31
SLIDE 31

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Test for normality – even when running OLS Wald test: general GTL(α, δ) model Likelihood ratio test: general GTL(α, δ) + restricted GTL(0.1436, 0) models Lagrange multiplier test: restricted GTL(0.1436, 0) model Vuong test: general GTL(α, δ) + “general” N(0, σ2) models

slide-32
SLIDE 32

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Test for normality – even when running OLS Lagrange multiplier test: restricted GTL(0.1436, 0) model Could we insert OLS residuals into the LM formula instead of estimating a restricted GTL(0.1436, 0) model? Range violations may occur: p(|ǫ| > 4.819|ǫ ∼ N(0, 1)) = 1.44 × 10−6. Thus, the probability of any violation in sample of n = 5000 equals 0.0072. (α, δ) = (0.1436, 0) vs N(0, σ2): power = 0.074 (α, δ) = (0.1436, 0) vs (α, δ) = (0.10, 0): power = 0.849 (α, δ) = (0.1436, 0) vs (α, δ) = (0.20, 0): power = 0.995 (α, δ) = (0.1436, 0) vs (α, δ) = (0.1436, 0.02): power = 0.770

slide-33
SLIDE 33

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Prediction Model for i = 1, . . . , n: yi = X ′

i β + ǫi ≡ µi + ǫi where we impose E[ǫj] = 0

Forecast type 1 for j > n: ˜ µj = ˜ X ′

j ˆ

βGTL ˜ µj ∼ AN( ˜ X ′

j β, ˜

X ′

j Var( ˆ

βGTL) ˜ Xj) Forecast type 2 for j > n: ˜ yj = ˜ X ′

j ˆ

βGTL ˜ ej ≡ ˜ yj − yj = ˜ X ′

j (ˆ

βGTL − β) − ǫj ˜ ej ∼ AN( ˜ X ′

j β, ˜

X ′

j Var(ˆ

βGTL) ˜ Xj) + GTL(ˆ α, ˆ δ)

slide-34
SLIDE 34

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results Data generating process: yi = β1 + β2x2i + β3x3i + σǫi for i = 1, . . . , n n = 250 or 5000 x2i is a standard normal random variate x3i is a χ2(5) random variate that is standardized to have mean 0 and variance x2i and x3i are independent ǫi is a canonical GTL(α, δ) variate for various combinations of α and δ. β1 = β2 = β3 = 1 Scaling parameter σ is chosen such that the range of the GTL distribution from the 0.1% quantile to the 99.9% quantile has the same length as that of a normal N(0, 2) distribution.

slide-35
SLIDE 35

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results

DGP RMSE of OLS RMSE of GTLE α δ σ β2 β3 β1 β2 β3 β1 A: GTL as an approximation of the standard normal distribution, N = 250 0.1436 0.00 1.188 0.107 0.110 0.110 0.108 0.111 0.126 B: Various GTL distributions, small sample, N = 250 0.33

  • 0.10

1.477 0.106 0.108 0.199 0.089 0.094 0.132 0.33 0.00 1.508 0.107 0.109 0.109 0.097 0.101 0.133 0.33 0.10 1.477 0.106 0.108 0.200 0.091 0.093 0.132

  • 0.33
  • 0.10

0.454 0.133 0.144 0.255 0.066 0.068 0.066

  • 0.33

0.00 0.482 0.118 0.122 0.124 0.071 0.073 0.071

  • 0.33

0.10 0.454 0.127 0.126 0.239 0.067 0.069 0.067

  • 0.67
  • 0.25

0.136 1.737 1.667 1.932 0.025 0.026 0.025

  • 0.67

0.00 0.202 0.336 0.337 0.341 0.038 0.040 0.037

  • 0.67

0.25 0.136 1.046 0.823 1.126 0.025 0.026 0.025

  • 1.00
  • 0.50

0.024 112.585 89.227 103.243 0.005 0.005 0.005

  • 1.00

0.00 0.079 2.503 2.217 2.408 0.018 0.019 0.018

  • 1.00

0.50 0.024 39.742 27.930 35.099 0.005 0.005 0.005

slide-36
SLIDE 36

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results QQ plots of OLS estimators of β2 and β3, n = 250

0.4 0.6 0.8 1 1.2 1.4 1.6 0.6 0.8 1 1.2 1.4 OLS estimate of β2 Normal quantile 95% upper 95% lower β2 45 degree 1 0.4 0.6 0.8 1 1.2 1.4 1.6 0.6 0.8 1 1.2 1.4 OLS estimate of β3 Normal quantile 95% upper 95% lower β3 45 degree 1

ˆ β2 for (α, δ) = (0.1436, 0) ˆ β3 for (α, δ) = (0.1436, 0)

slide-37
SLIDE 37

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results QQ plots of OLS estimators of β2 and β3, n = 250

  • 0.2

0.2 0.6 1 1.4 1.8 0.5 0.7 0.9 1.1 1.3 1.5 OLS estimate of β2 Normal quantile 95% upper 95% lower β2 45 degree 1 0.5 1 1.5 2 2.5 0.5 0.7 0.9 1.1 1.3 1.5 OLS estimate of β3 Normal quantile 95% upper 95% lower β3 45 degree 1

ˆ β2 for (α, δ) = (−0.33, 0) ˆ β3 for (α, δ) = (−0.33, 0)

slide-38
SLIDE 38

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results QQ plots of OLS estimators of β2 and β3, n = 250

  • 35
  • 25
  • 15
  • 5

5 15 25

  • 5
  • 3
  • 1

1 3 5 7 OLS estimate of β2 Normal quantile 95% upper 95% lower β2 45 degree 1

  • 25
  • 15
  • 5

5 15 25

  • 5
  • 3
  • 1

1 3 5 7 OLS estimate of β3 Normal quantile 95% upper 95% lower β3 45 degree 1

ˆ β2 for (α, δ) = (−0.67, 0) ˆ β3 for (α, δ) = (−0.67, 0)

slide-39
SLIDE 39

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results QQ plots of GTL estimators of β2 and β3, n = 250

0.85 0.9 0.95 1 1.05 1.1 1.15 0.9 0.95 1 1.05 1.1 GTL estimate of β2 Normal quantile 95% upper 95% lower β2 45 degree 1 0.85 0.9 0.95 1 1.05 1.1 1.15 0.9 0.95 1 1.05 1.1 GTL estimate of β3 Normal quantile 95% upper 95% lower β3 45 degree 1

ˆ β2 for (α, δ) = (−0.67, 0) ˆ β3 for (α, δ) = (−0.67, 0)

slide-40
SLIDE 40

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Monte Carlo results Ratio of average estimated variance to Monte Carlo variance

DGP OLS GTL α δ σ β2 β3 β1 β2 β3 β1 A: Various GTL distributions, small sample, N = 250

  • 0.33

0.00 0.482 1.073 1.004 0.960 1.005 0.945 1.024

  • 0.33

0.10 0.454 1.037 1.071 1.018 0.989 0.933 1.016

  • 0.67

0.00 0.202 0.960 0.994 0.931 0.997 0.922 1.045

  • 0.67

0.25 0.136 0.794 1.368 1.042 0.987 0.912 1.032

  • 1.00

0.00 0.079 0.830 1.132 0.893 0.978 0.896 1.054

  • 1.00

0.50 0.024 0.777 1.707 0.996 0.991 0.903 1.033 B: Various GTL distributions, large sample, N = 5000

  • 0.33

0.00 0.482 1.035 1.006 0.897 1.034 1.038 0.997

  • 0.33

0.10 0.454 1.076 0.999 0.921 1.042 1.035 1.003

  • 0.67

0.00 0.202 1.307 1.446 0.980 1.019 1.066 1.005

  • 0.67

0.25 0.136 1.529 2.627 1.020 1.035 1.034 1.011

  • 1.00

0.00 0.079 2.727 0.966 0.998 1.008 1.080 1.017

  • 1.00

0.50 0.024 20.456 0.382 1.005 1.024 0.998 1.014

slide-41
SLIDE 41

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Alternatives Choose another distribution that permits skewness and thick tails.

Skewed-t distribution Skewed Generalized Error distribution “Stable” distribution (has no second moment unless it coincides with N(0, σ2)) Mixture of normal distributions

Parameter-intensive: 3 normals use 6 parameters Allows more than one mode Overfitting

Choose another estimation approach

Compute Var(ˆ βOLS) by sandwich estimator (QMLE) or bootstrap Estimate by Least Absolute Deviation Use the M-estimation approach (QMLE) Estimate by data trimming or winsorizing

slide-42
SLIDE 42

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Alternatives Choose a non-parametric estimation approach

This ignores the linearity of the location assumption: yi = X ′

i β + ǫi

A nonparametric (kernel) estimator is essentially a weighted average of y values with Xs in the neighborhood of Xi. The weights account for the distance from Xi, not for the thickness

  • f the tails of the distribution of ǫi.

Thus, the nonparametric estimator is even more unstable than OLS.

Choose a semi-parametric estimation approach: single-index models

This makes the linear location assumption: yi = X ′

i β + ǫi

Assumption about the distribution of ǫi: E[ǫi|X] = 0, E[ǫ2

i |X]

must be finite. What else? Anything about higher-order moments?

slide-43
SLIDE 43

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Summary Tests for normality in six applications

Application n Wald LR LM LM V Vuong lnWage males 54687 1127.5 1303.4 583.7 42

  • 11.16

MORG US lnWage females 46045 1339.4 1618.2 670.4 52

  • 11.80

MORG US Housing prices 546 8.5 11.4 8.17

  • 1.69

Canada Speeding tickets, 31674 791.1 69015.0 5072.0 8

  • 52.67

Massachusetts Trade cr. and div. 37983 1326.8 4374.6 1299.0 24

  • 24.96

World CAPM, 25 portf 635 (25) (17) Fama/French, US Note: Critical value for Wald, LR, LM = 9.21 (1%), 5.99 (5%). Note: Critical value for Vuong = -2.33 (1% one-tailed), -1.645 (5% one-tailed)

slide-44
SLIDE 44

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Summary Estimates of (α, δ) in six applications

OLS residuals ˆ α ˆ δ Application Skewness Kurtosis Est. Sr.Err. Est. St.Err. lnWage males

  • 0.23

4.36 0.030 0.004 0.029 0.002 lnWage females

  • 0.19

4.89 0.003 0.004 0.013 0.003 Housing prices

  • 0.19

3.51 0.010 0.050 0.034 0.025 Speeding tickets,

  • 1.20

5.07

  • 1.761

0.083 1.308 0.048 Trade cr. and div.

  • 0.71

4.90

  • 0.078

0.008 0.139 0.005 CAPM, 25 portf. graph graph

Recall: For a normal distribution, (α, δ) = (0.1436, 0). Recall: For a logistic distribution, (α, δ) = (0, 0)

slide-45
SLIDE 45

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Summary Difference between normal and logistic (standardized)

Left bound on σ Left bound on κ4 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5

  • 4
  • 3
  • 2
  • 1

1 2 3 4 Normal Logistic

Range Normal Logistic −1000 < −2.375 0.0088 0.0133 −2.375 < −0.682 0.2395 0.2123 −0.682 < 0.682 0.5035 0.5488 0.682 < 2.375 0.2395 0.2123 2.375 < 1000 0.0088 0.0133

slide-46
SLIDE 46

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Trade creation and trade diversion, 1960-2005 Trade creation estimates

OLS GTLE Estimate Stan.Err. Estimate Stan.Err.

ˆ βOLS − ˆ βGTL ˆ βGTL SEGTL SEOLS

Trade creation dummy variables tc.nafta 0.371 0.252 0.500 0.220

  • 0.257

0.874 tc.eu 0.427 0.115 0.196 0.098 1.182 0.853 tc.efta 0.685 0.129 0.518 0.116 0.323 0.896 tc.eea 0.179 0.095 0.261 0.079

  • 0.312

0.837 tc.caricom 2.823 0.513 2.417 0.459 0.168 0.894 tc.ap 0.828 0.186 0.804 0.159 0.030 0.852 tc.mercosur 1.086 0.306 1.035 0.315 0.049 1.030 tc.asean 0.467 0.216 0.492 0.185

  • 0.051

0.858 tc.anzcerta 0.969 0.141 0.748 0.130 0.295 0.920 tc.apec 1.599 0.095 1.291 0.085 0.238 0.889 tc.laia

  • 0.133

0.141

  • 0.432

0.134

  • 0.691

0.950 tc.cacm 2.314 0.150 1.931 0.139 0.198 0.927 tc.bilateralPTA 0.110 0.128 0.098 0.117 0.122 0.916 Dependent variable: Log of bilateral imports.

slide-47
SLIDE 47

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Trade creation and trade diversion, 1960-2005 Trade diversion estimates

OLS GTLE Estimate Stan.Err. Estimate Stan.Err.

ˆ βOLS − ˆ βGTL ˆ βGTL SEGTL SEOLS

Trade diversion dummy variables td.nafta 0.151 0.073 0.081 0.061 0.875 0.841 td.eu 0.651 0.051 0.434 0.047 0.500 0.908 td.efta 0.376 0.059 0.202 0.054 0.866 0.921 td.eea

  • 0.142

0.048

  • 0.101

0.043 0.409 0.894 td.caricom

  • 0.577

0.100

  • 0.539

0.097 0.071 0.972 td.ap 0.105 0.074 0.115 0.068

  • 0.088

0.925 td.mercosur 0.030 0.073

  • 0.019

0.063

  • 2.554

0.869 td.asean 0.474 0.070 0.395 0.061 0.198 0.869 td.anzcerta

  • 0.759

0.098

  • 0.657

0.086 0.156 0.879 td.apec 0.439 0.049 0.341 0.042 0.288 0.871 td.laia

  • 0.561

0.060

  • 0.533

0.054 0.051 0.913 td.cacm

  • 0.174

0.078

  • 0.120

0.074 0.451 0.945 td.bilateralPTA

  • 0.292

0.054

  • 0.275

0.045 0.064 0.832 Dependent variable: Log of bilateral imports.

slide-48
SLIDE 48

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Trade creation and trade diversion, 1960-2005 Distributions estimated for the trade creation and diversion equation

0.05 0.1 0.15 0.2 0.25 0.3

  • 6
  • 4
  • 2

2 4 6 Normal GTL

slide-49
SLIDE 49

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Speeding tickets, Massachussets, 2001

OLS GTLE Estimate Stan.Err. Estimate Stan.Err.

ˆ βOLS − ˆ βGTL ˆ βGTL SEGTL SEOLS

ln(Mph over) 0.8649 0.0144 1.2332 0.0057

  • 0.299

0.396 OutTown 0.0071 0.0107 0.0001 0.0006 119.524 0.054 OutState 0.0353 0.0070 0.0007 0.0004 52.026 0.062 Afr.American

  • 0.0207

0.0095

  • 0.0010

0.0004 20.402 0.045 Hispanic 0.0216 0.0101

  • 0.0003

0.0007

  • 85.838

0.064 Female

  • 0.0564

0.0375 0.0003 0.0017

  • 204.497

0.046 ln(Age) 0.0002 0.0073 0.0015 0.0007

  • 0.878

0.092 Female × ln(Age) 0.0092 0.0105 0.0000 0.0005 946.064 0.049 ln(CourtDist) 0.0158 0.0035

  • 0.0002

0.0003

  • 81.445

0.074 ln(Pvalue.pc)

  • 0.0043

0.0218

  • 0.0007

0.0011 5.209 0.050 OR 0.0202 0.0700 0.0058 0.0020 2.502 0.028 OR × OutTown 0.0083 0.0630

  • 0.0082

0.0019

  • 2.008

0.031 OR × ln(CourtDist) 0.0019 0.0089 0.0003 0.0004 5.139 0.042 SP 0.0299 0.2949

  • 0.0016

0.0155

  • 19.662

0.053 SP × OutTown 0.0258 0.0187 0.0000 0.0008

  • 4569.156

0.041 SP × ln(CourtDist) 0.0059 0.0040 0.0007 0.0004 7.853 0.101 SP × ln(Pvalue.pc)

  • 0.0018

0.0265

  • 0.0002

0.0014 9.790 0.051 SP × OR

  • 0.0164

0.0319

  • 0.0018

0.0017 8.270 0.053 Intercept 2.2716 0.2551 1.4817 0.0190 σ 0.0030 0.0005 α

  • 1.7612

0.0833 δ 1.3083 0.0483 log Likelihood

  • 9440.55

25066.89 (Absolute) Average 360.723 0.075 Dependent variable: Log of amount of fine.

slide-50
SLIDE 50

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Speeding tickets, Massachussets, 2001 Distributions estimated for the speeding ticket equation

The GTL density is top-truncated; it peaks at 33.23 at ǫ = 0.0075.

1 2 3

  • 1.5
  • 1
  • 0.5

0.5 1 1.5 Normal GTL

slide-51
SLIDE 51

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

CAPM, basic and extended, 1960-2012 (monthly) Estimates of (α, δ), basic CAPM

  • 0.12
  • 0.1
  • 0.08
  • 0.06
  • 0.04
  • 0.02

0.02

  • 0.2
  • 0.15
  • 0.1
  • 0.05

0.05 0.1 0.15

δ α

slide-52
SLIDE 52

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

CAPM, basic and extended, 1960-2012 (monthly) Estimates of (α, δ), extended CAPM

  • 0.12
  • 0.1
  • 0.08
  • 0.06
  • 0.04
  • 0.02

0.02

  • 0.2
  • 0.15
  • 0.1
  • 0.05

0.05 0.1 0.15

δ α

slide-53
SLIDE 53

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

CAPM, basic and extended, 1960-2012 (monthly) Normal and GTL densities for CAPM and Extended CAPM models Portfolio ME:1, B/M:1, ˆ α = −0.135, ˆ δ = −0.044

0.05 0.1 0.15 0.2

  • 18
  • 12
  • 6

6 12 18 CAPM: normal CAPM: GTL ECAPM: normal ECAPM: GTL

slide-54
SLIDE 54

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Financial risk management and GTL-GARCH

Given the GTL link function ǫ = G(u), it is straightforward to compute Value at Risk of a variable r = τ1 + τ2ǫ: VaRp = τ1 + τ2G(p) Given p, the Expected Shortfall is computed analytically as ESp = τ1 + τ2ESǫ(p) ESǫ(p) = E (ǫ | ǫ ≤ VaRǫ) = − 2δ α2 − δ2 + pα−δ (α − δ) (α − δ + 1)+ (1 − p)α+δ+1 − 1 p(α + δ)(α + δ + 1) : A GARCH(q1, q2) process: rt = σt ˜ ǫt σ2

t

= ω +

q2

  • i=1

ηir 2

t−i + q1

  • j=1

βjσ2

t−j,

t ∈ Z = {0, ±1, . . .} :

Source: Chu-Ping Vijverberg, Wim Vijverberg, and S¨ uleyman Ta¸ spinar, “Financial Risk Measurement with the Generalized Tukey Lambda Distribution.”

slide-55
SLIDE 55

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

GTL and Binary Choice Models With the GTL(α, δ) family of distributions, a family of binary choice models arises: y∗

i

= X ′

i β + ǫi

yi = I(y∗

i ≥ 0) = I(ǫi ≥ −X ′ i β)

ǫi ∼ GTL(α, δ) This “pregibit” family nests exactly or approximately: Probit Logit Gossit or robit Loglog and cloglog Linear probability model

Source: Wim Vijverberg and Chu-Ping Vijverberg, “Pregibit: A Family of Binary Choice Models.” Empirical Economics, forthcoming.

slide-56
SLIDE 56

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Extending Heckman’s selection model A GTL(α, δ) density is univariate. With a copula function, two or more univariate densities can be joined to create a multivariate density. Thus, combine two univariate GTL densities to create a flexible generalization of the Heckman selection model:

1 Selection equation: si = 1(Ziγ + νi > 0) 2 Outcome equation: yi = Xi ′β + σǫi if si = 1 3 νi ∼ GTL(αs, δs) 4 ǫi ∼ GTL(α1, δ1) 5 F(νi, ǫi) = C(Fν(νi), Fǫ(ǫi))

This can be expanded with a second outcome equation, where one

  • r the other outcome is observed.

:

Source: Wim Vijverberg and Takuya Hasebe, “A Flexible Sample Selection Model: A GTL-Copula Approach.”

slide-57
SLIDE 57

Intro GTL distribution GTL regression What about OLS? Alternatives Applications Extensions Conclusion

Conclusion GTL regression: Deals with non-normal disturbances in a flexible way Efficient relative to OLS when disturbances are non-normal Time-efficient relative to semi-parametric estimation methods—and statistically efficient if the GTL distribution captures the data generating process well enough Permits disturbances that don’t even have a mean or variance (or skewness or kurtosis) LM test can be used as a post-estimation test after running OLS Can be extended to include heteroskedasticity, serial correlation, GARCH-type behavior Might also increase efficiency of IV and GMM estimators