Edgeworth and confidence interval correction in spiked PCA Iain - - PowerPoint PPT Presentation

edgeworth and confidence interval correction in spiked pca
SMART_READER_LITE
LIVE PREVIEW

Edgeworth and confidence interval correction in spiked PCA Iain - - PowerPoint PPT Presentation

Edgeworth and confidence interval correction in spiked PCA Iain Johnstone & Jeha Yang Statistics & Biomedical Data Science, Stanford & Two Sigma Shanghai, December 10, 2019 Edgeworth and confidence interval correction in spiked PCA


slide-1
SLIDE 1

Edgeworth and confidence interval correction in spiked PCA

Iain Johnstone & Jeha Yang

Statistics & Biomedical Data Science, Stanford & Two Sigma

Shanghai, December 10, 2019

slide-2
SLIDE 2

Edgeworth and confidence interval correction in spiked PCA

Iain Johnstone & Jeha Yang

Statistics & Biomedical Data Science, Stanford & Two Sigma

Shanghai, December 10, 2019

slide-3
SLIDE 3

Viral protein mutations and spiked models

Quadeer et. al. PLOS Comp. Bio. 2018

slide-4
SLIDE 4

Viral protein mutations and spiked models

Quadeer et. al. PLOS Comp. Bio. 2018

slide-5
SLIDE 5

A suggestive simulation on correlation matrices

[David Morales, Matt McKay]

6

2nd eigenvalue

2-st Leading Eigenvalue c = 0.2, N = 300, N1 = 10, simple spks = [2.8, 1.9], deg. spks = [0.8, 0.9]

2.1 2.15 2.2 2.25 2.3 2.35 2.4 2.45 2.5 1 2 3 4 5 6 7 8 9 Histogram of the sample eigenvalue

mean = 2.305 [2.322] std = 0.053 [0.054] (0.89 xPaul)

γ

2-st Leading Eigenvalue c = 0.2, N = 300, N1 = 30, simple spks = [6.8, 3.9], deg. spks = [0.8, 0.9]

3.7 3.8 3.9 4 4.1 4.2 4.3 4.4 4.5 4.6 4.7 0.5 1 1.5 2 2.5 3 3.5 Histogram of the sample eigenvalue

mean = 4.145 [4.169] std = 0.126 [0.125] (0.89 xPaul)

γ ρ1 = 0.2 ; ρ2 = 0.1 Theoretical variance is pretty accurate, but there seems to be a shift in the mean (similar to what we’ve seen before in the eigenvector projections of sample covariance when spikes were close to each other)

slide-6
SLIDE 6

Outline Background on spiked covariance model Edgeworth correction - single spike Edgeworth for multiple spikes Explaining the repulsion correction Confidence intervals after selection

slide-7
SLIDE 7

High dimensional spiked PCA model

◮ Data : X = [x1 · · · xn]′ with x1, · · · , xn

i.i.d.

∼ Np+1(0, Σ) ◮ Large dimensional asymptotic regime : as n → ∞, γn := p/n → γ ∈ (0, ∞) ◮ Spiked eigenstructure of Σ : for a fixed r, ℓ1 > · · · > ℓr

  • Spikes

> 1 = ℓr+1 = · · · = ℓp+1 ◮ Statistics : eigenvalues of sample covariance matrix X ′X/n ˆ ρ1 ≥ · · · ≥ ˆ ρp+1 → w.l.o.g. Σ is diagonal

slide-8
SLIDE 8

Largest Eigenvalue ˆ ρ1: Numerical illustration

p = 200, n = 800 [i.e. γn = p/n = 0.25] subcritical critical supercritical Spike h = ℓ − 1 : 0, 0.25, h+ = 0.5, 0.75, 1.

2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 5 10 15

slide-9
SLIDE 9

Finite rank model, K = 1: phase transition

Σ = diag(ℓ1, 1, . . . , 1) p/n → γ . Interior point transition at ℓ1 = 1 + √γ:

[Baik–Ben Arous–Pech´ e,05]

fluctuation

3 = {2

n

1

` Critical point: ° 1+

2

) ° (1+

.

1

¸ Tracy-Widom

slide-10
SLIDE 10

Finite rank model, K = 1: phase transition

Σ = diag(ℓ1, 1, . . . , 1) p/n → γ . Interior point transition at ℓ1 = 1 + √γ:

[Baik–Ben Arous–Pech´ e,05]

)

1

` ( ¸ Critical point: ° 1+

2

) ° (1+

Gaussian

1

` fluctuation

2 = {1

n bias

1

¸

.

slide-11
SLIDE 11

Largest Eigenvalue ˆ ρ1: Numerical illustration

p = 200, n = 800 [i.e. γn = p/n = 0.25] subcritical critical supercritical Spike h = 0, 0.25, h+ = 0.5, 0.75, 1.

2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 5 10 15

Edge: (1 + √γn)2 = 2.25

slide-12
SLIDE 12

Largest eigenvalue: Phase transition

Different rates, limit distributions: For h < √γ : n2/3 ˆ ρ1 − µ(γn) τ(γn)

  • D

⇒ TWβ, For h > √γ : n1/2 ˆ ρ1 − ρ(h, γn) σ(h, γn)

  • D

⇒ N(0, 1)

h =1+

º

` ° 1+

2

) ° (1+

2

) ° (1+ ° 1 )

º

` ( ½

slide-13
SLIDE 13

Largest eigenvalue: Phase transition

Different rates, limit distributions: For h < √γ : n2/3 ˆ ρ1 − µ(γn) τ(γn)

  • D

⇒ TWβ, For h > √γ : n1/2 ˆ ρ1 − ρ(h, γn) σ(h, γn)

  • D

⇒ N(0, 1) with ρ(h, γ) = (1 + h)

  • 1 + γ

h

  • σ2(h, γ) = 2(1 + h)2

1 − γ h2

  • h

=1+

º

` ° 1+

bias (bulk)

2

) ° (1+

2

) ° (1+ ° 1 )

º

` ( ½ Statistical physics lit, 94- Baik-Ben Arous-Peche(05) , Paul (07) Baik-Silverstein (06), Bloemendal-Virag (11) Mo (11) , Wang (12) Benaych-Georges-Guionnet- Maida (11)

slide-14
SLIDE 14

Normal approximation – multiple spikes

◮ Assume that all spikes are simple, supercritical : ℓ1 > · · · > ℓr > 1 + √γ ◮ Asymptotic mutual independence: with ρkn := ρ(ℓk, γn), σkn := σ(ℓk, γn), (ˆ zkn)k=1,··· ,r :=

  • n1/2 (ˆ

ρk − ρkn) σkn

  • k=1,··· ,r

⇒ N(0, Ir)

Shi (2013)

slide-15
SLIDE 15

Edgeworth approximations

slide-16
SLIDE 16

Inaccuracy of approximations : ˆ zkn associated with ℓk = 2.7

(n,γn,l) = (400,1,(2.7))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

(n,γn,l) = (400,1,(2.7,2.2))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

(n,γn,l) = (400,1,(3.2,2.7))

z ^2n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

(n,γn,l) = (400,1,(2.7,2.4))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

slide-17
SLIDE 17

Traditional Edgeworth

(Smooth function of) means model:

Petrov, 1975, Hall, 1992

Sn = 1 √nκ2n

n

  • i=1

Xni indep, mean 0, ∈ Rd, d fixed κjn = 1 n

n

  • 1

EX j

ni

moments First order expansion: P (Sn ≤ x) = Φ(x) + n−1/2p(x)φ(x) + o(n−1/2) p(x) = −κ3n κ3/2

2n

H2(x) 6 , H2(x) = x2 − 1. skewness correction

slide-18
SLIDE 18

Single spike, first order expansion for ˆ ρ1

ˆ z1n = n1/2(ˆ ρ1 − ρ1n)/σ1n Theorem In spiked model, h1 = ℓ1 − 1 > √γ, γn = p/n, P (ˆ z1n ≤ x) = Φ(x) + n−1/2p1n(x)φ(x) + o(n−1/2), uniformly in x ∈ R, with p1n(x) = −α2nH2(x)−α0n α2n = α2(h1, γn) = √ 2 3 h3

1 + γn

(h2

1 − γn)3/2 ,

α0n = α0(h1, γn) = γn √ 2 h1 + 1 (h2

1 − γn)3/2

slide-19
SLIDE 19

Coefficients of Edgeworth expansion for single-spike

α2(h1, γn) = √ 2 3 h3

1 + γn

(h2

1 − γn)3/2 ,

α0(h1, γn) = γn √ 2 h1 + 1 (h2

1 − γn)3/2

◮ Larger for “harder” cases i.e. larger γ and smaller h (> √γ) ◮ Larger than the fixed p case i.e. γ = 0, α2 = √ 2/3, α0 = 0

Muirhead-Chikuse (1975)

◮ Empirically reasonable if 9 2 α2

2

n = (h3

1 + γ)2

n(h2

1 − γ)3 ≤ 0.2

slide-20
SLIDE 20

Single Spike Simulation

(n, γ, l−factor) = (50,0.1,0.3)

l ^ Density 1.0 1.5 2.0 2.5 3.0 3.5 0.0 0.4 0.8 1.2 Edgeworth Normal Upper Edge

(n, γ, l−factor) = (50,1,0.3)

l ^ Density 4 5 6 7 0.0 0.2 0.4 0.6 0.8 1.0 Edgeworth Normal Upper Edge

(n, γ, l−factor) = (100,0.1,0.3)

l ^ Density 1.5 2.0 2.5 3.0 0.0 0.5 1.0 1.5 2.0 Edgeworth Normal Upper Edge

(n, γ, l−factor) = (100,1,0.3)

l ^ Density 3.5 4.0 4.5 5.0 5.5 6.0 0.0 0.5 1.0 1.5 Edgeworth Normal Upper Edge

slide-21
SLIDE 21

Edgeworth for multiple spikes

slide-22
SLIDE 22

Eigenvalues are repulsive!

(n,γn,l) = (400,1,(2.7,2.2))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

(n,γn,l) = (400,1,(2.7,2.4))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

(n,γn,l) = (400,1,(3.2,2.7))

z ^2n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal

◮ joint density of (ˆ ρ1, · · · , ˆ ρn∧(p+1)) has a Jacobian factor

  • i<j

|ˆ ρi − ˆ ρj| → pushes eigenvalues apart ◮ But, not visible at leading order (for supercritical spikes:) (ˆ zkn)k=1,··· ,r ⇒ N(0, Ir)

slide-23
SLIDE 23

Multi spike, first order expansion for ˆ ρk

ˆ zkn = n1/2(ˆ ρk − ρkn)/σkn Theorem In spiked model, hk = ℓk − 1 > √γ, γn = p/n, P (ˆ zkn ≤ x) = Φ(x) + n−1/2pkn(x)φ(x) + o(n−1/2), uniformly in x ∈ R, with pkn(x) = −α2(hk, γn)H2(x) − α0,k(h,γn) α2(hk, γn) = √ 2 3 h3

k + γn

(h2

k − γn)3/2 ,

α0,k(h, γ) = 1 √ 2 hk + 1 (h2

k − γ)1/2

  • γ

h2

k − γ +

  • j=k

hj hk − hj

slide-24
SLIDE 24

Interpretation

Edgeworth corrected density φ + n−1/2(α2H3 + α0H1)φ Relative to single spike case: α2 unchanged, but ∆α0 = α0,k(h, γn) − α0(hk, γn) = 1 √ 2 hk + 1 (h2

k − γn)1/2

  • j=k

hj hk − hj ◮ ∆α0 > 0, e.g. smaller spikes hj < hk, push density to right, conversely for ∆α0 < 0 ◮ closer spikes ⇒ larger effect ◮ additive in ℓj, j = k

slide-25
SLIDE 25

Repulsion example 1 : ˆ zkn associated with ℓk = 2.7

(n,γn,l) = (400,1,(2.7))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal Edgeworth

(n,γn,l) = (400,1,(2.7,2.2))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal Edgeworth

(n,γn,l) = (400,1,(3.2,2.7))

z ^2n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal Edgeworth

(n,γn,l) = (400,1,(2.7,2.4))

z ^1n Density −3 −2 −1 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5

Normal Edgeworth

Figure: Density of ˆ zkn associated with ℓk = 2.7

slide-26
SLIDE 26

Repulsion example 2 : histograms of (ˆ ρk)k=1,··· ,r together

(n,γn,l) = (400,0.5,(2.7,2.2))

ρ ^ Density 2.8 3.0 3.2 3.4 3.6 3.8 4.0 4.2 0.0 1.0 2.0 3.0

Normal Edgeworth

(n,γn,l) = (400,0.5,(3.2,2.7,2.2))

ρ ^ Density 3.0 3.5 4.0 4.5 1 2 3

Normal Edgeworth

Blue, red, green vertical lines correspond to ρ1n, ρ2n, ρ3n, respectively.

slide-27
SLIDE 27

Explaining the Repulsion Correction

slide-28
SLIDE 28

Perturbation setup

Recall ℓ1 > · · · > ℓr > 1 + √γ > 1 = ℓr+1 = · · · = ℓp+1 Focus on ℓk: n−1X ′Xvk = ˆ ρkvk, ˆ ρk → ρkn = ρ(ℓk, γn) = ℓk + γ ℓk ℓk − 1 Permute columns: X = [

  • ℓkZ1, Z2Σ1/2

2

] Σ2 = diag(ℓ(k), 1, · · · , 1) Population eigenvalues of Σ2: H(k) =

  • 1 − r − 1

p

  • δ1 + 1

p

  • j=k

δℓj = δ1 + p−1H∆

slide-29
SLIDE 29

Standard first steps

n−1X ′Xvk = ˆ ρkvk X = [

  • ℓkZ1, Z2Σ1/2

2

] n−1Z2Σ2Σ′

2 = UΛU′

U ∈ O(n), Λ = diag(λ1 ≥ · · · λn) z = U′Z1 ∼ N(0, In) z ⊥ ⊥ Λ (Gaussian assumptions!) Schur complement, Woodbury formula, resolvent,.. R(x) = (Λ − xIn)−1 ⇒ Key equation: (ˆ ρk − ρkn)[1 + ℓkn−1z′ ˜ Rknz] = −ℓkρkn[n−1z′R(ρkn)z + ℓ−1

k ]

slide-30
SLIDE 30

The Forward Map H → Fγ,H

Silverstein equation: H probability measure on R, γ > 0, z(m) = − 1 m + γ

  • t

1 + tmdH(t), m ∈ C+ z(m) = z has unique solution m(z) for z ∈ C+, and m(z) =

  • 1

λ − z dF(λ) = mF(z) defines (Stieltjes transform of) a probability distribution F = Fγ,H. Population: Σp Hp = F Σp = 1

p

δσi Sample: Bn = n−1ZpΣpZ ′

p

F Bn = 1

n

δλi If Hp ⇒ H, p/n → γ F Bn ⇒ Fγ,H

(Marcenko-Pastur-Bai-Silverstein)

slide-31
SLIDE 31

Stochastic Decomposition

n−1z′f (Λ)z = n−1 f (λi)z2

i = n−1

f (λi) + n−1/2Sn(f ) Sn(f ) = n−1/2 f (λi)(z2

i − 1)

(Λ ⊥ ⊥ z) n−1 f (λi) =

  • f (λi) dFγn,Hn(λ)

+ n−1

  • f (λi) − n
  • f dFγn,Hn
  • = Fγn,Hn(f )

+ n−1Gn(f ) deterministic equiv. Bai-Silverstein CLT ⇒ n−1z′f (Λ)z = Fγn,Hn(f ) + n−1/2Sn(f ) + n−1Gn(f )

slide-32
SLIDE 32

Perturbing the centering

H = δ1 + p−1H∆ From Wang-Silverstein-Yao, 2014 Fγ,H(f ) = Fγ(f ) + n−1A(f ) + O(n−2) A(f ) = 1 2πi

  • C

f (z0(m)w(m)dm z0(m) = − 1 m + γ 1 + m w(m) =

  • t

1 + tmdH∆(t)

..

,

  • ,•
  • • • • •

i)f

.

  • .

I

  • I

..

slide-33
SLIDE 33

Evaluating An(gkn)

In WSY 14, set H ← H(k)n = δ1 + 1

p

  • j=k(δℓj − δ1)

f (z) ← gkn(z) = (ρkn − z)−1, w(m) =

j=k

  • ℓj

1+ℓjm − 1 1+m

  • An(gkn) =

1 2πi

  • C
  • j=k

tj(m)dm = hk (h2

k − γ)

  • j=k

hj hk − hj repulsion term

,

  • (,

~ - ....

.. eir

  • , ii!

·-<.

.I,

"I , ·.......

. .

  • . T· .•

,, ..

0(.

~-

~

..

  • .,.

A

  • (

"

slide-34
SLIDE 34

Back to n−1z′R(ρkn)z

−R(ρkn) = −(Λ − ρknIn)−1 = gkn(Λ) Decomposition: −n−1z′R(ρkn)z ≈ Fγn(gkn) + n−1/2Sn(gkn) + n−1Dn(gkn) Dn(gkn) = Gn(gkn) + An(gkn) + O(n−1) = ˜ α0,k(h, γn) + Zkn, since, from Bai-Silverstein CLT Gn(gkn) = µγn(gkn) + Zkn, µγn(gkn) = γnhk (h2

k − γn)2

bulk term

slide-35
SLIDE 35

Key linearization

ˆ zkn = n1/2(ˆ ρk − ρkn) σkn ≈ Sn(gkn) + n−1/2Dn(gkn) σknFγn(g2

kn) + h.o.t.

Delta method for Edgeworth expansion, + conditioning P{ˆ zkn ≤ x} = E

  • P{Sn(gkn) ≤ yn(x)|Λ}
  • + o(n−1/2)

Final steps: ◮ Edgeworth expansion (conditional on Λ) ◮ uncondition; identify terms

slide-36
SLIDE 36

Edgeworth (conditional on Λ )

Sn(gkn) = n−1 Xni Xni = cni(z2

i − 1)

cni = gkn(λi) From e.g. Petrov 1975, n.i.d. case: P

  • 1

¯ κ2n √n

  • Xni ≤ y
  • Λ
  • = Φ(y)−

¯ κjn ¯ κ3/2

2n

√n H2(y) 6 φ(y)+o(n−1/2) Cumulants: ¯ κjn = κjn−1 n

1 cj ni = κjFγn(gj kn) + O(n−1/2)

quadratic term: ¯ κjn ¯ κ3/2

2n

= √ 2 3 h3

k + γn

(h2

k − γn)3/2 + O(n−1/2) = α2(hk, γn) + O(n−1/2)

slide-37
SLIDE 37

Assembling pieces

P{ˆ zkn ≤ x} = E

  • Φ(yn) − α2(hk, γn)

√n H2(yn) 6 φ(yn) + o(n−1/2)

  • yn = yn(x) = x −

1 ¯ κ2n √nDn(gkn) repulsive shift = x − 1 ¯ κ2n √n[˜ α0,k(h, γn) + Zkn] EΦ(yn) ≈ Φ(x) − α0k(h, γn) √n φ(x) Final result: P{ˆ zkn ≤ x} = Φ(x)− 1 √n

  • α2(hk, γn)H2(y)

6 + α0k(h, γn)

  • φ(x)+o(n−1/2)
slide-38
SLIDE 38

Confidence intervals after selection

slide-39
SLIDE 39

Inference for supercritical spikes

Below bulk edge: Even for supercritical ℓk, P{ˆ ρk < b(γ)} can be significant!

2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 5 10 15

→ inference after selection of supercritical spikes Selection rule: select all ˆ ρk, k = 1, · · · , ˆ r such that ˆ ρk > θn := b(γn) + n−1/3√γn Consistent: P (ˆ r = r) = 1 − o(n−m), m ∈ N Minimal conditioning: Liu-Markovic-Tibshirani (2018) ˆ ρk | ˆ ρk > θn

slide-40
SLIDE 40

Pivots

Exact distribution of ˆ ρk: F kn(x, ℓ) = Pℓ(ˆ ρk > x) Exact pivot given ˆ ρk > θn: ukn(ˆ ρk, ℓ) := F kn(ˆ ρk, ℓ) F kn(θn, ℓ) ∼ U(0, 1) for all ℓ Approach:

  • 1. Approximate F kn by Gaussian, Edgeworth, ...
  • 2. Form approximate pivots

uA

kn(ˆ

ρk, ℓ) ≈ U(0, 1)

  • 3. Confidence intervals:

{ℓk > 1 + √γ : uA

kn(ˆ

ρk, ℓ) ∈ I}

slide-41
SLIDE 41

Pivots ctd.

{ℓk > 1 + √γ : uA

kn(ˆ

ρk, ℓ) ∈ I} I =

  • [0, 1 − α]

upper [α/2, 1 − α/2] two-sided... Usually ℓk → uA

kn(ˆ

ρk, ℓ) is monotone ր

.. ,-

  • '

t'~ --

'

  • C
  • ,....
  • '1))
  • V

,._

'-"'

slide-42
SLIDE 42

Pivots ctd.

{ℓk > 1 + √γ : uA

kn(ˆ

ρk, ℓ) ∈ I} I =

  • [0, 1 − α]

upper [α/2, 1 − α/2] two-sided... Usually ℓk → uA

kn(ˆ

ρk, ℓ) is monotone ր

.. ,-

  • '

t'~ --

'

  • C
  • ,....
  • '1))
  • V

,._

'-"'

Gaussian example: F kn(x, ℓ) ≈ Φ(zn(x, ℓk)), zn(x, ℓ) = n1/2 x − ρ(ℓ, γn) σ(ℓ, γn) → Selective Z pivot: uz

n(ˆ

ρk, ℓk) := Φ(zn(ˆ ρk, ℓk)) Φ(zn(θn, ℓk))

slide-43
SLIDE 43

Edgeworth pivots

Edgeworth approximation ΦE

kn(x, ℓ) = Φ(x) + n−1/2pk(x; ℓ, γn)φ(x)

→ Selective E pivot: [estimated ˆ ℓ : ˆ ℓj = ρ−1

n (ˆ

ρj)] uE

kn(ˆ

ρ, ℓk) := Φ

E kn(zn(ˆ

ρk, ℓk), ˆ ℓ) Φ

E kn(zn(θn, ℓk), ˆ

ℓ) Positive (E) pivot: uP

kn(ˆ

ρ, ℓk) :=

  • uE

kn(ˆ

ρ, ℓk) if Φ

E kn(zn(ˆ

ρk, ℓk), ˆ ℓ) > 0 uz

n(ˆ

ρ, ℓk)

  • therwise
slide-44
SLIDE 44

Coverage accuracy

Theorem: Uniformly in α ∈ [0, 1], for any 1 ≤ k ≤ r, P{u(ˆ ρ) ≤ α | ˆ ρk > θn} − α =

  • O(n−1/2)

for u(ˆ ρ) = uz

n(ˆ

ρk, ℓk),

  • (n−1/2)

for u(ˆ ρ) = uE

kn(ˆ

ρ, ℓk), uP

kn(ˆ

ρ, ℓk) ◮ Consequence of the Edgeworth expansion ◮ also holds for clipped pivots ((u(ˆ ρ) ∨ 0) ∧ 1)

slide-45
SLIDE 45

Numerical coverage – 2 spikes

0.00 0.04 0.08

K−S : hk=1.5 the other hj

None 2.5 2.0

  • 0.90

0.94 0.98

Upper : hk=1.5 the other hj coverage

None 2.5 2.0

  • 0.90

0.94 0.98

Lower : hk=1.5 the other hj coverage

None 2.5 2.0

  • 0.00

0.04 0.08

K−S : hk=2 the other hj

None 1.5 2.5

  • 0.90

0.94 0.98

Upper : hk=2 the other hj coverage

None 1.5 2.5

  • 0.90

0.94 0.98

Lower : hk=2 the other hj coverage

None 1.5 2.5

  • (n,γn) = (400,1)
  • selec_Z

selec_E positive

slide-46
SLIDE 46

Numerical coverage – 2 spikes

0.00 0.04 0.08

K−S : hk=2.5 the other hj

None 1.5 2.0

  • 0.90

0.94 0.98

Upper : hk=2.5 the other hj coverage

None 1.5 2.0

  • 0.90

0.94 0.98

Lower : hk=2.5 the other hj coverage

None 1.5 2.0

  • (n,γn) = (400,1)
  • selec_Z

selec_E positive

◮ Repulsion stronger for closer spikes → worse approximations ◮ selective E(o) has Φ

E < 0 with prob > 5% in tough cases:

h = (2.0, 1.5), (2.5, 2.0) ◮ Positive pivot(+) usually fixes this!

slide-47
SLIDE 47

Future work

◮ Other models, e.g. low rank denoising X =

r

  • k=1

ℓkuku′

k + Z

◮ corrections for joint distributions ◮ non-Gaussian data ◮ second order expansions: LSS obstacle

Reference: (single spike) Yang & J., Statistica Sinica 2018. (multispike) in preparation.

THANK YOU!