Selective Inference via the Condition on Selection Framework: - PowerPoint PPT Presentation

Selective Inference via the Condition on Selection Framework: Inference after Variable Selection Jason D. Lee Stanford University Collaborators: Yuekai Sun, Dennis Sun, Qiang Liu, and Jonathan Taylor. Slides at http://web.stanford.edu/~jdl17/selective_inference_and_ debiasing.pdf Jason Lee : : Selective Inference via the Condition on Selection Framework 1

Selective Inference Selective Inference is about testing hypotheses suggested by the data. Selective Inference is common (Yoav Benjamini’s talk). In many applications there is no hypothesis specified before data collection and exploratory analysis. Inference after variable selection. Confidence intervals and p-values are only reported for the selected variables. Exploratory Data Analysis by Tukey emphasized using data to suggest hypotheses, and post-hoc analysis to test these. Screening in Genomics, only select genes with large t-statistic or correlation. Peak/bump hunting in neuroscience, only study process when X t > τ or critical points of the process. Jason Lee : : Selective Inference via the Condition on Selection Framework 2

Selective Inference Conventional Wisdom (Data Dredging, Wikipedia) A key point in proper statistical analysis is to test a hypothesis with data that was not used in constructing the hypothesis. (Data splitting) This talk The Condition on Selection framework allows you to specify and test hypotheses using the same dataset. Jason Lee : : Selective Inference via the Condition on Selection Framework 3

Selective Inference 1 Reviewing the Condition on Selection Framework 2 Motivation: Inference after variable selection Formalizing Selective Inference Related Work Selection Events in Variable Selection Truncated Gaussian Pivotal Quantity Beyond submodel parameters 3 Experiments 4 Extensions 5 Debiased lasso for communication-efficient regression 6 Jason Lee : : Selective Inference via the Condition on Selection Framework 4

Table of Contents Selective Inference 1 Reviewing the Condition on Selection Framework 2 Motivation: Inference after variable selection Formalizing Selective Inference Related Work Selection Events in Variable Selection Truncated Gaussian Pivotal Quantity Beyond submodel parameters 3 Experiments 4 Extensions 5 Debiased lasso for communication-efficient regression 6 Jason Lee : : Selective Inference via the Condition on Selection Framework 5

Motivation: Linear regression in high dimensions 1 Select relevant variables ˆ M via a variable selection procedure ( k most correlated, lasso, forward stepwise ...). M = X † M , ˆ β ˆ 2 Fit linear model using only variables in ˆ M y . ˆ 3 Construct 90% z-intervals (ˆ β j − 1 . 65 σ j , ˆ β j + 1 . 65 σ j ) for selected variables j ∈ ˆ M . Are these confidence intervals correct? Jason Lee : : Selective Inference via the Condition on Selection Framework 6

Check by Simulation Generate design matrix X ∈ R n × p from a standard normal with n = 20 and p = 200 . Let y = N ( Xβ 0 , 1) . β 0 is 2 sparse with β 0 1 , β 0 2 = SNR . Use marginal screening to select k = 2 variables, and then fit linear regression over the selected variables. Construct 90% confidence intervals for selected regression coefficients and check the coverage proportion. Jason Lee : : Selective Inference via the Condition on Selection Framework 7

Simulation 1 Coverage Proportion 0.9 0.8 Adjusted 0.7 Z test 0.6 0.5 0.4 −1 0 1 log 10 SNR Figure: Plot of the coverage proportion across a range of SNR. The coverage proportion of the z intervals ( ˆ β ± 1 . 65 σ ) is far below the nominal level of 1 − α = . 9 , even at SNR =5. The selective intervals (our method) always have coverage proportion . 9 . Warning!!!! Unadjusted confidence intervals are NOT selectively valid. Jason Lee : : Selective Inference via the Condition on Selection Framework 8

Valid Selective Inference Notation The selection function ˆ H selects the hypothesis of interest, ˆ H ( y ) : Y → H . φ ( y ; H ) be a test of hypothesis H , so reject if φ ( y ; H ) = 1 . φ ( y ; H ) is a valid test of H if P 0 ( φ ( y ; H ) = 1) ≤ α . { y : ˆ H ( y ) = H } is the selection event. F ∈ N ( H ) if F is a null distribution with respect to H . Definition φ ( y ; ˆ H ) is a valid selective test if P F ( φ ( y ; ˆ H ( y )) = 1 | F ∈ N ( ˆ H )) ≤ α Jason Lee : : Selective Inference via the Condition on Selection Framework 10

Condition on Selection Framework Conditioning for Selective Type 1 Error Control We can design a valid selective test φ by ensuring φ is a valid test with respect to the distribution conditioned on the selection event meaning ∀ F ∈ N ( H i ) , P F ( φ ( y ; H i ) = 1 | ˆ H = H i ) ≤ α, then P F ( φ ( y ; ˆ H ( y )) = 1 | F ∈ N ( ˆ H )) � P F ( φ ( y ; H i ) = 1 | ˆ H = H i ) P F ( ˆ H = H i | F ∈ N ( ˆ = H )) i : F ∈ N ( H i ) � P F ( ˆ H = H i | F ∈ N ( ˆ ≤ α H )) i : F ∈ N ( H i ) ≤ α Jason Lee : : Selective Inference via the Condition on Selection Framework 11

Existing methods for Selective Inference Reduction to Simultaneous Inference: Assume that there is an apriori set of hypotheses H that could be tested. We can simultaneously control the type 1 error over all of H , which implies selective type 1 error rate control for some selected ˆ H ( y ) ∈ H ( e.g. Scheffe’s method and PoSI). Data Splitting: Split the dataset y = ( y 1 , y 2 ) . Let ˆ H ( y 1 ) be the selected hypothesis, and construct the test of ˆ H ( y 1 ) using only y 2 . Data splitting is “wasteful” in the sense that it is not using all the information in the first half of the data. Jason Lee : : Selective Inference via the Condition on Selection Framework 12

Setup Model Assume that y i = µ ( x i ) + ǫ i ǫ i ∼ N (0 , σ 2 ) .   µ ( x 1 ) . x i ∈ R p , y ∈ R n , and µ = .  .   .  µ ( x n )  x T  1 . .  ∈ R n × p . Design matrix X =   .  x T n Jason Lee : : Selective Inference via the Condition on Selection Framework 13

Related Work Lockhart et al. 2013 tests for whether all signal variables are found. Our framework allows us to test the same thing with no assumptions on X and is completely non-asymptotic and exact. Taylor et al. 2014 show the significance test result can be recovered from the selective inference framework, and Taylor et al. 2014 generalize to testing global null for (almost) any regularizer. POSI (Berk et al. 2013) widen intervals to simultaneously cover all coefficients of all possible submodels. Asymptotic normality by debiasing (Zhang and Zhang 2012, Van de Geer et al. 2013, Javanmard and Montanari 2013, Chernozhukov et al. 2013). Oracle property and non-convex regularizers (Loh 2014). Under a beta-min condition, the solution to non-convex problem has a Gaussian distribution. Knockoff for FDR control in linear regression (Foygel and Candes 2014) allows for exact FDR control for n ≥ p . Jason Lee : : Selective Inference via the Condition on Selection Framework 15

Lasso selection event Lasso Selection Event 1 2 � y − Xβ � 2 + λ � β � 1 ˆ β = arg min β From KKT conditions, a set of variables ˆ M is selected with sign (ˆ β ˆ M ) = ˆ s iff � � � � y : sign( β ( ˆ � Z ( ˆ M, ˆ s )) = ˆ s, M, ˆ s ) ∞ < 1 = { y : Ay ≤ b } � � � This says that the inactive subgradients are strictly dual feasible, and the signs of the active subgradient agrees with the sign of the lasso estimate. M X M ) − 1 ( X T β ( M, s ) := ( X T M y − λs ) M X M ) − 1 s + 1 Z ( M, s ) := X T M c X M ( X T λX T M c ( I − X M ( X T M X M ) − 1 X T M ) y. Jason Lee : : Selective Inference via the Condition on Selection Framework 17

Selective Inference via the Condition on Selection Framework: - PowerPoint PPT Presentation

Selective Inference via the Condition on Selection Framework: Inference after Variable Selection Jason D. Lee Stanford University Collaborators: Yuekai Sun, Dennis Sun, Qiang Liu, and Jonathan Taylor. Slides at

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Post-Selection Inference Todd Kuffner Washington University in St. Louis PhyStat 2016

Texas Instruments & RFAB TI Information Selective Disclosure TI Information Selective

Cimzia Selective rebrand Concept A Cimzia Selective rebrand Logo Main / Colour Grayscale

Selective Prediction Binary classifications Rong Zhou November 8, 2017 Table of contents 1.

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Pre-processing and Classification of Hyperspectral Imagery via Selective Inpainting Victoria

Selective inference: a conditional perspective Xiaoying Tian Harris Joint work with Jonathan

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Lecture 12 Conditioning and Condition Numbers NLA Reading Group Spring 13 by Can

Selective Laser Trabeculoplasty Selective Laser Trabeculoplasty SLT SLT Jorge

Selective W eb Archiving at the Germ an National Library 1 | 8 | Selective Web Archiving

Lecture 11: Data representations Felix Held, Mathematical Sciences MSA220/MVE440 Statistical

ClearSCADA WEB-X CLIENT! Diary of the Penetration Tester ! Aditya K Sood, Senior Security

Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables SIDI ZAKARI Ibrahim LIB-MA,

Models for Neutrino Masses and Physics Beyond Standard Model Salah Nasrj The 2nd Toyama

STK-IN4300 Statistical Learning Methods in Data Science Riccardo De Bin debin@math.uio.no

Distributional Results for Thresholding Estimators in High-Dimensional Gaussian Regression Ulrike

st t st rrs

TH THE E MC MCPS PS VI VISUAL SUAL AR ART T CE CENT NTER ER A COUNTYWIDE, PRE-COLLEGE

Selective Inference via the Condition on Selection Framework: - PowerPoint PPT Presentation

Selective Inference via the Condition on Selection Framework: Inference after Variable Selection Jason D. Lee Stanford University Collaborators: Yuekai Sun, Dennis Sun, Qiang Liu, and Jonathan Taylor. Slides at

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Post-Selection Inference Todd Kuffner Washington University in St. Louis PhyStat 2016

Texas Instruments &amp; RFAB TI Information Selective Disclosure TI Information Selective

Cimzia Selective rebrand Concept A Cimzia Selective rebrand Logo Main / Colour Grayscale

Selective Prediction Binary classifications Rong Zhou November 8, 2017 Table of contents 1.

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

Pre-processing and Classification of Hyperspectral Imagery via Selective Inpainting Victoria

Selective inference: a conditional perspective Xiaoying Tian Harris Joint work with Jonathan

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Lecture 12 Conditioning and Condition Numbers NLA Reading Group Spring 13 by Can

Selective Laser Trabeculoplasty Selective Laser Trabeculoplasty SLT SLT Jorge

Selective W eb Archiving at the Germ an National Library 1 | 8 | Selective Web Archiving

Lecture 11: Data representations Felix Held, Mathematical Sciences MSA220/MVE440 Statistical

ClearSCADA WEB-X CLIENT! Diary of the Penetration Tester ! Aditya K Sood, Senior Security

Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables SIDI ZAKARI Ibrahim LIB-MA,

Models for Neutrino Masses and Physics Beyond Standard Model Salah Nasrj The 2nd Toyama

STK-IN4300 Statistical Learning Methods in Data Science Riccardo De Bin debin@math.uio.no

Distributional Results for Thresholding Estimators in High-Dimensional Gaussian Regression Ulrike

st t st rrs

TH THE E MC MCPS PS VI VISUAL SUAL AR ART T CE CENT NTER ER A COUNTYWIDE, PRE-COLLEGE

Texas Instruments & RFAB TI Information Selective Disclosure TI Information Selective

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?