Nonparametric Sparsity
John Lafferty Larry Wasserman
Computer Science Dept. Department of Statistics Machine Learning Dept. Machine Learning Dept. Carnegie Mellon University Carnegie Mellon University
Nonparametric Sparsity John Lafferty Larry Wasserman - - PowerPoint PPT Presentation
Nonparametric Sparsity John Lafferty Larry Wasserman Computer Science Dept. Department of Statistics Machine Learning Dept. Machine Learning Dept. Carnegie Mellon University Carnegie Mellon University
Computer Science Dept. Department of Statistics Machine Learning Dept. Machine Learning Dept. Carnegie Mellon University Carnegie Mellon University
2
3
4
5
d
newβ)2.
(Chen & Donoho, 1994; Tibshirani, 1996)
6
d
j=1 |βj| ≤ t
Level sets of squared error For orthogonal designs, solution given by soft thresholding ˆ βj = sign(βj) (|βj| − λ)+
7
(Cand` es and Tao, Donoho, Tropp, Meinshausen and B¨ uhlmann, Wainwright, Zhao and Yu, Fan and Peng,...)
8
ˆ m sup m∈F
(Sobolev space of order 2)
9 1e+02 1e+04 1e+06 1e+08 0.0 0.1 0.2 0.3 0.4 0.5
sample size Risk
10 12 14 16 18 20 0e+00 2e+11 4e+11 6e+11 8e+11 1e+12 dimension sample size
Risk = 0.01 d = 20
10
11
12
13
14
15
The estimator can be written as ˆ mh(x) =
n
G(Xi, x, h)Yi Our method is based on the statistic Zj = ∂ ˆ mh(x) ∂hj =
n
Gj(Xi, x, h)Yi The estimated variance is s2
j = Var(Zj | X1, . . . , Xn) = σ2 n
G2
j(Xi, x, h)
16
17 5 10 15 0.0 0.2 0.4 0.6 0.8 1.0
Rodeo Step Bandwidth
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Average over 50 runs Typical Run
2 4 6 8 10 12 14 0.0 0.2 0.4 0.6 0.8 1.0
Rodeo Step Bandwidth
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
18 5 10 15 20 25 30 5 10 15 20 25 30 0.00 0.02 0.04 0.06 0.08 0.10
Leave-one-out cross-validation Rodeo
19
j = h0 for all j > r
j ≤ n−1/(4+r)+ for all j ≤ r
20
21 0.0 0.2 0.4 0.6 0.8 1.0 2 2 4 6 8 |beta|/max|beta| Standardized Coefficients 5 10 2 9 7 1 3 13 4
22
20 40 60 80 100 0.0 0.1 0.2 0.3 0.4 0.5
Greedy Rodeo Step Bandwidth
1 2 3 4 7 8 9
Rodeo order: 3 (body mass index), 9 (serum), 7 (serum), 4 (blood pressure), 1 (age), 2 (sex), 8 (serum), 5 (serum), 10 (serum), 6 (serum). LARS order: 3, 9, 4, 7, 2, 10, 5, 8, 6, 1.
23
(with Han Liu)
24
0.0 0.2 0.4 0.6 0.8 1.0 0.4 0.2 0.0 0.2 0.4
true regression line
x y 0.0 0.2 0.4 0.6 0.8 1.0 0.4 0.2 0.0 0.2 0.4
data adaptive basis, J=36
x fitted
25
β
26
0.0 0.2 0.4 0.6 0.8 1.0 0.03 0.02 0.01 0.00 0.01 0.02 0.03 base 27 x y 0.0 0.2 0.4 0.6 0.8 1.0 0.02 0.01 0.00 0.01 0.02 0.03 base 30 x y 0.0 0.2 0.4 0.6 0.8 1.0 0.05 0.00 0.05 base 15 x y 0.0 0.2 0.4 0.6 0.8 1.0 0.06 0.04 0.02 0.00 0.02 0.04 0.06 base 18 x y 0.0 0.2 0.4 0.6 0.8 1.0 0.04 0.02 0.00 0.02 base 6 x y 0.0 0.2 0.4 0.6 0.8 1.0 0.06 0.04 0.02 0.00 0.02 0.04 0.06 base 9 x y27