Estimating risk
Estimating risk
Maximilian Kasy
Department of Economics, Harvard University
May 4, 2018
1 / 17
Estimating risk Maximilian Kasy Department of Economics, Harvard - - PowerPoint PPT Presentation
Estimating risk Estimating risk Maximilian Kasy Department of Economics, Harvard University May 4, 2018 1 / 17 Estimating risk Introduction Introduction Some of the topics about which I learned from Gary: The normal means model.
Estimating risk
Department of Economics, Harvard University
1 / 17
Estimating risk Introduction
◮ The normal means model. ◮ Finite sample risk and point estimation. ◮ Shrinkage and tuning. ◮ Random coefficients and empirical Bayes.
◮ Brief review of these topics. ◮ Building on that, some new results from my own work.
2 / 17
Estimating risk Introduction
k Eθ
k ∑ j
3 / 17
Estimating risk Introduction
◮ Covariance penalties, ◮ Stein’s Unbiased Risk Estimate (SURE), ◮ Cross-Validation (CV).
◮ X ∈ Rk as sample mean of n i.i.d. draws Yi. ◮ ⇒ n-fold Cross-Validation.
◮ Large n ⇒ CV approximates SURE. ◮ Large k ⇒ CV and SURE converge to MSE,
4 / 17
Estimating risk Introduction
5 / 17
Estimating risk SURE and CV
k ∑j MSEj, where
6 / 17
Estimating risk SURE and CV
7 / 17
Estimating risk SURE and CV
k ∑ j
k ∑ j
k Eθ
k
◮ X is normally distributed. ◮ Σ is known. ◮
8 / 17
Estimating risk SURE and CV
n ∑ i
1 n−1 ∑ i′=i
n ∑ i
9 / 17
Estimating risk Large n
n ∑i Y n i with (Y n i −θ)/√
n2 ∑i(Y n i − X n)(Y n i − X n)′. Then
10 / 17
Estimating risk Large n
s(Yi − X)
s Ui
s
s Ui)
n ∑ i
i .
s
+2
θ,(s+ 1
s
θ′)Ui
1
s2
θ′(X)·Ui2+2∆i,Yi− θ−i
n ∑ i
+0+op( 1 n ).
11 / 17
Estimating risk Large k
◮ Does this hold uniformly over π? ◮ If so, does this yield oracle-optimal tuning parameters?
12 / 17
Estimating risk Large k
◮ Ridge: mR(x,λ) =
1 1+λ x.
◮ Lasso: mL(x,λ) = 1(x < −λ)(x +λ)+ 1(x > λ)(x −λ).
k k
j=1
13 / 17
Estimating risk Large k
14 / 17
Estimating risk Large k
15 / 17
Estimating risk Large k
j
◮ Fast alternative to CV for tuning of neural nets, etc. ◮ Additional acceleration by only calculating this for subset of i, j.
◮ Relevant for many restrictions implied by economic theory. ◮ Proving uniform dominance using SURE, extending James-Stein. ◮ Open question: Smooth choice of “degrees of freedom” that is not
16 / 17
Estimating risk Large k
17 / 17