Estimator selection
Christophe Giraud
Universit´ e Paris-Sud et Paris-Saclay
M2 MSV et MDA
1/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 1 / 22
Estimator selection Christophe Giraud Universit e Paris-Sud et - - PowerPoint PPT Presentation
Estimator selection Christophe Giraud Universit e Paris-Sud et Paris-Saclay M2 MSV et MDA 1/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 1 / 22 What shall I do with these data ? Classical steps 1
Universit´ e Paris-Sud et Paris-Saclay
1/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 1 / 22
1 Elucidate the question(s) you want to answer to, and check your data
◮ deep discussions with specialists (biologists, physicians, etc), ◮ low level analyses (PCA, LDA, etc) to detect key features, outliers, etc ◮ and ... experience ! 2 Choose and apply an estimation procedure 3 Check your results (residues, possible bias, stability, etc) 2/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 2 / 22
i + εi with εi i.i.d.
1 , . . . , f ∗ n )T and σ2 are unknown
3/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 3 / 22
4/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 4 / 22
5/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 5 / 22
◮ some unknown characteristics of f (sparsity, smoothness, etc) ◮ the unknown variance σ2.
6/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 6 / 22
7/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 7 / 22
8/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 8 / 22
9/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 9 / 22
10/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 10 / 22
11/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 11 / 22
12/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 12 / 22
m∈M
1 We cannot explore a huge collection of models : we restrict to a
3 the criterion must not depend on the unknown variance σ2: We
m = Y − ProjSmY 2
13/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 13 / 22
λ, with
m∈ M
m
m is given by (1) and penπ(m) ≈ penBM(m).
14/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 14 / 22
λ − β∗)2 ≤ C inf β=0
15/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 15 / 22
16/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 16 / 22
β
17/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 17 / 22
β
β∈Rp
18/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 18 / 22
19/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 19 / 22
λℓ; β0
20/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 20 / 22
21/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 21 / 22
k
σ unknown and k unknown σ known or k known
Ultra-high dimension 2k log(p/k) ≥ n n
Minimax risk
22/22 Christophe Giraud (Paris Sud) High-dimensional statistics M2 MSV & MDA 22 / 22