Econometrics 1: IV, GMM and MLE
James A. Duffy1 Oxford, Michaelmas 2016 (revised: 28/12/16)
1I thank N. Geesing, L. Freund, K. Kuske, and E. Munro for comments. The manuscript was
Econometrics 1: IV, GMM and MLE James A. Duffy 1 Oxford, Michaelmas - - PDF document
Econometrics 1: IV, GMM and MLE James A. Duffy 1 Oxford, Michaelmas 2016 (revised: 28/12/16) 1 I thank N. Geesing, L. Freund, K. Kuske, and E. Munro for comments. The manuscript was prepared with L YX 2.2.2. Contents 1 Instrumental variables
1I thank N. Geesing, L. Freund, K. Kuske, and E. Munro for comments. The manuscript was
ECONOMETRICS 1, MT 2016 20/04/17
i β0 + ui
IV-ORTH Eziui = 0.
i β)zi = Eziyi − EzixT i β
i .
i β0 + ui − xT i β)zi =(2) EzixT i (β0 − β),
i is a dz × dx matrix, the equation
i ]δ = 0
i < dx (see Appendix A.2). In
IV-RANK rk EzixT
i = dx,
i β ⇐
i (β0 − β)
i = LEzixT i has full rank (i.e. rank dx).
i π0,k + vk,i
i = zT i Π0 + v T i , or rather
0 zi + vi
i = Ezi(zT i Π0 + v T i ) = (EzizT i )Π0
i = rk(EzizT i )Π0 =(2) rk Π0
i
IV-RANK′ rk Π0 = dx and rk EzizT
i = dz.
i β
i )−1Eziyi,
n
i
n
i
n
i
i has full
i β
i )−1EzL,iyi
i
n
i β
i β] = EzL,iyi − EzL,ixT i β.
n
n
i β
n
n
i β
i and EzL,iyi, the r.h.s.
i=1 ziui = 0.
1 n
i=1 ziui p
p
n
i
n
i
n
n zi
i=1 zi ˆ
i = 0, by construction, whence n
i = n
n
i + ˆ
n n
i = n
i .
n
i
n
i = (ZTZ)−1ZTX
i
n
0 zi
n zi denotes their feasible
IV-MOM Exi2 < ∞, Ezi4 < ∞, and E|ui|4 < ∞
IV-HMSK E[u2
i | zi] = Eu2 i =: σ2 u.
i zizT i = E[ E[u2 i zizT i | zi] ] = E[ E[u2 i | zi] zizT i ] = σ2 uEzizT i
d
i )−1(Eu2 i ˜
i )(E˜
i )−1.
u(E˜
i )−1.
i β0 + ui into the formula for the 2SLS estimator gives
i
n
i β0 + ui) = β0 +
i
n
i=1 ˆ
i = n i=1 ˆ
i , yields
n
i
n
n sn.
n zi, and ˆ
n
i
n
i p
i )−1EzixT i = Π0
p
i ≤(1) Exizi ≤(2) (Exi2)1/2(Ezi2)1/2 <(3) ∞
n
n
i
p
0 EzizT i Π0 = E˜
i .
p
i is invertible (see the problem set).
n
d
i zizT i ]
n
n
d
0 · N[0, Eu2 i zizT i ] ∼ N[0, Eu2 i ˜
i ].
i )−1(Eu2 i ˜
i )(E˜
i )−1
n
i
n
i ˆ
i
n
i
i ˆ
i ˆ
p
d
d
i β0 + ui = xT 1iβ0,1 + xT 2iβ0,2 + ui
0,1
0,2
1iβ0,1 + xT 2iβ0,2 + ui
0,1
0,2
0 zi + vi
2i) = 0 (by IV-ORTH and the definition of Π0 as a matrix of population
2iρ0 + ǫi,
i β0 + v T 2iρ0 + ǫi.
0 zi + vi)ǫi = ΠT 0 Ezi(ui − v T 2iρ0) =(3) 0
2i) = 0.
2iρ0 + ǫi) =(2) Ev2iv T 2iρ0
2i is full
0 : ρ0 = 0
1 : ρ0 = 0.
0 (and only under H′ 0) ˆ
n ˆ
n,ρ ˆ
d
i β0 + ui
IV-ORTH Eziui = 0; IV-RANK′ rk Π0 = dx and rk EzizT
i = dz.
n
i ˆ
n n
i β)
n
i ˆ
n
n
i β)
u
n
i
n
d
n ξn d
d
n ˆ
u n
i
s
i ∼(4) χ2[s]
1
2
1 x1i + πT 2 z2i + v2i,
11x1i + πT 12z2i + v2,1i
21x1i + πT 22z2i + v2,2i.
V V
2
2i
V V
n
i=1 ˆ
i . This can be regarded as a kind of matrix analogue of a Wald
p
V V
2
2i
V V
d
i=1 ziui
i=1 zixi d
uEz2 i ]
i
uπ−2 0 (Ez2 i )−1]
d
n
n
p
i
i=1 ziui
i=1 zixi
i=1 ziui
i=1 zivi
n
u
v
i .
i=1 ziui
i=1 zivi d
u
i
d
2 (against H1 : β2 = β∗ 2) in the model:
1iβ1 + xT 2iβ2 + ui
1
2
1i, zT 2i)T are the available instruments.
2) := yi − xT 2iβ∗ 2 = xT 1iβ1 + ui,
2, letting δ := β2 − β∗ 2
2) = xT 1iβ1 + xT 2i(β2 − β∗ 2) + ui
1iβ1 + (ΠT 1 x1i + ΠT 2 z2i + v2i)Tδ + ui
1i(β1 + Π1δ) + zT 2i(Π2δ) + (v T 2iδ + ui)
1iκ1 + z2iκ2 + ηi.
2 under the null, we can compute yi(β∗ 2), and thus we can
2) on zi =
1i, zT 2i)T. Indeed, the usual t and Wald tests remain entirely valid here.
0 : κ2 = 0 therefore implies
2 thus consists of testing κ2 = 0 in (1.29), using the
2
0 : κ2 = 0 involves dz2 restrictions, rather than merely dx2 restrictions. In this
0 : κ2 = 0 also
0 as a rejection of H0, rather than as signifying that some instruments
ECONOMETRICS 1, MT 2016 20/04/17
i β0 + ui
i β) = 0,
n
i β) ≈ 0
i (β) := y ∗(ωi; ξi, β) =
1(ωi; ξi, β)
2(ωi; ξi, β)
1 , y 2 2 , y1y2, y1ω, y2ω)T.
i (β), ωi], if (and
i (β), ωi]} = E[m(yi, ωi) − µ(ωi, β)],
i (β), ωi], because the latter depends on ξi, which is not observed.
i=1 of Rdw-valued i.i.d. random vectors.
GMM-ID θ = θ0 is the unique solution to:
n
θ∈Θ
p
p
θ∈Θ
θ∈Θ
GMM-WGHT Wn is positive semi-definite, and Wn
p
n
i=1 is just a collection of i.i.d. random variables. Thus
p
p
p
INTR θ0 ∈ int Θ.
n
p
p
p
n
d
p
p
n,1Wn[gn(θ0) + Dn,2(ˆ
n,1WnDn,2)−1DT n,1Wnn1/2gn(θ0)
d
GMM-JAC rk D = rk DθEg(wi, θ0) = dθ.
GMM-VAR S := Eg(wi, θ0)g(wi, θ0)T is positive definite.
d
n
n
n
i=1 zi(yi − xT i β), and so
n
i
i
GMM-JAC is thus the non-linear GMM counterpart of IV-RANK.
JAC holds, but D is ‘close’ to having rank strictly less than dθ; we say that θ0 is
p
n
i=1 g(wi, ˆ
n gn(θ); this is asymptotically
n p
n -weighted) sample moment conditions. This real-
n
n
i β),
i zizT i .
uEzizT i , which sug-
n = ˆ
u
n
i
i=1 zizT i )−1 will give the same es-
u).
n gn(β) = 0
(1) ˆ
u
n
i
n
i
n
i β) = 0
n
i β) = 0
n n
i β) = 0.
(1) follows from Dβgn(β) = 1 n
i=1 zixT i = 1 nZTX.
n =
n
i zizT i
i ˆ
n gn(ˆ
d
n
p
n
n
n,1 ˆ
n Dn,2)−1DT n,1 ˆ
n ]n1/2gn(θ0)
n
n,1 ˆ
n Dn,2)−1DT n,1 ˆ
n
n
d
H ξ,
H denotes the orthogonal projection onto the dg −dθ subspace orthogonal to
H is thus a rank dg − dθ matrix.
n gn(ˆ
d
H ξ ∼ χ2[dg − dθ].
n
H S−1/2n1/2gn(θ0),
H S−1/2δ = 0
d
d
d
d
p
d
d
d
p
d
d
d
p
n )−1[r(ˆ
d
n A−1 n xn.
p
0 : r(θ0) := dθ
0,k = 0
d
0 : θ3 0 = 1; this gives the Wald
n = n[ˆ
n − 1]
n ˆ
d
n have the same limiting distribution, they may differ
0 : r(ϕ−1(γ)) = ρ; the Wald statistic computed in this case may
n d
θ∈Θρ Qn(θ; ˆ
n ) − min θ∈Θ Qn(θ; ˆ
n ) d
p
n ) to
ECONOMETRICS 1, MT 2016 20/04/17
i β0 + ui;
i β0 + ui;
i = dx (IV-RANK), but were
θ∈Θ
n
n
n
n
i=1. This
n
n
n
c n(θ) = 1
n
0) =
0)1/2 exp
n
n
n
n
i=1 wi, the sample mean; so this must be the MLE
0, take the FOC with respect to σ2 ,
n
n = 1
n
σ2 =
n
n
n
n
p
−∞
0 (1 − θ0)1−w, w ∈ {0, 1}
n
n
i=1 wi
i=1 wi
1 n
i=1 wi
n
i=1 wi
n
i=1 wi. (Once again, the MLE is the sample average!)
i=1 wi
n
i=1 wi
n
n
n
i=1 wi also in this case.
i ∼ N[µ, 1] at zero:
i , 0}.
i might represent an individual’s ‘latent propensity’ to purchase meat
i ≤ w}
∈Wd|w≤w}
∈Wd|w≤w}
i , 0}
i ∼ N[µ, 1]. Recall from (3.5) that wi has density
n
i ∼ N[µ, σ2], rather than forcing σ2 = 1. It
0]. We now wish to generalise this to
i β0, σ2 0],
i β0. [In
i β0 + σ0ǫi
0), but
0)r(x).
n(β, σ2) = n
n
i β
n
i β)2.
0 is
n := 1
n
i ˆ
0 by σ2(zT i γ0), where zi is another
i β0, σ2(zT i γ0)],
i β0 + σ(zT i γ0)ǫi.
i γ0)φ
i β0
i γ0)
n(β, γ) = −n
i γ) − 1
n
i β)2
i γ) .
n(β, γ) is thus maximised with respect to β by the generalised
i = σ2(zT i γ).
n by sequentially maximising
n simultaneously with respect to β and γ (which typically must be
i β0),
i β0 into the unit interval, with limz→−∞ θ(z) = 0 and limz→+∞ θ(z) = 1. [In this
i β0)
i β0 + ǫi ≥ 0}
i β0 + ǫi ≥ 0 | xi}
i β0 | xi}
i β0)
i β0),
n(β) = n
n
i β) + (1 − yi) log[1 − Φ(xT i β)]}
i β) +
i β)].
i measures an individual’s under-
i = xT i β0 + σ0ǫi
i through
i , 0}.
i ≤ y | xi = x}
n(β, σ) = n
i β0
i β0
i=1 is i.i.d. with marginal density
n
c n = 1
n
n
p
ML-ID For each θ = θ0, there exists a w ∈ W such that
n
n
n
p
ML-DIFF θ → p(w; θ) is twice continuously differentiable, for every w ∈ W .
INTR θ0 ∈ int Θ,
θℓn(θ) to ∇2 θℓ0(θ)) yields
p
θℓ0(θ0) =: H. (Again, as discussed in the context of GMM, we need
INTR and consistency to ensure the validity of (3.14).)
n n1/2∇θℓn(θ0).
n
n
n
d
n n1/2∇θℓn(θ0) d
θℓ0(θ0) is invertible (has full rank); we
ML-HESS rk H = dθ.
ML-VAR S := Es(wi; θ0)s(wi; θ0)T is positive definite.
d
θℓ0(θ0) and S := Es(wi; θ0)s(wi; θ0)T.
θℓn(ˆ
n
d
d
θ∈Θ
θ∈Θρ
n )[r(ˆ
d
n ˆ
n
n
n
θ∈Θ ℓn(θ) − max θ∈Θρ ℓn(θ)
n ∇θℓn(ˆ
d
n ˆ
n ) remains valid; and it is possible to modify the LM statistic such
n
ECONOMETRICS 1, MT 2016 20/04/17
ECONOMETRICS 1, MT 2016 20/04/17
i=1 x2 i )1/2 is also a common choice on Rk.)1
1None of our results depend on the specific choice of norm, since all norms on a finite-dimensional space
are equivalent. That is, if ·∗ is another norm on Rk, then there must exist nonzero, finite constants c0 and c1 such that c0x ≤ x∗ ≤ c1x. Convergence (in probability, or in distribution) in one norm thus implies convergence in all the others.
1 , . . . , λ−1 k }.
1 , . . . , λ1/2 k }, is itself positive (semi-)definite, and has the property
p
n→∞ P{xn − x∞ > ǫ} = 0.
d
n→∞ Fn(x) = F∞(x)
d
p
d
d
d
n xn d
n
p
n