Variable Selection is Hard . Foster 1 , Howard Karloff, and Justin - PowerPoint PPT Presentation

Variable Selection is Hard . Foster 1 , Howard Karloff, and Justin Thaler 2 Dean P 1 Amazon NYC 2 Yahoo Labs New York July 2015

Problem Formulation: ( g , h ) -Sparse Regression Given: An m × p Boolean matrix B and a positive integer k such that there is a real p -dimensional vector x ∗ , � x ∗ � 0 ≤ k , such that B x ∗ = 1 . Goal: Output a p -dimensional vector x with � x � 0 ≤ k · g ( p ) such that � B x − 1 � 2 ≤ h ( m , p ) . This problem and its noisy variants are central to model design in statistics. Sparse solutions are simple, and generalize well.

An Inefficient Algorithm for ( 1 , 0 ) -Sparse Regression For every k -sparse vector x , check if B x = 1 . Runs in time n O ( k ) . Algorithm does not “cheat” on the sparsity nor the accuracy of the solution.

An Inefficient Algorithm for ( 1 , 0 ) -Sparse Regression For every k -sparse vector x , check if B x = 1 . Runs in time n O ( k ) . Algorithm does not “cheat” on the sparsity nor the accuracy of the solution. There are many efficient algorithms (e.g. LASSO) that “cheat” only on the accuracy. There are other efficient algorithms that cheat only on the sparsity. But all known algorithms may cheat a whole lot if B is ill-conditioned.

An Inefficient Algorithm for ( 1 , 0 ) -Sparse Regression For every k -sparse vector x , check if B x = 1 . Runs in time n O ( k ) . Algorithm does not “cheat” on the sparsity nor the accuracy of the solution. There are many efficient algorithms (e.g. LASSO) that “cheat” only on the accuracy. There are other efficient algorithms that cheat only on the sparsity. But all known algorithms may cheat a whole lot if B is ill-conditioned. Main Result of this work: Based on a standard complexity assumption, there is no efficient algorithm that works for general matrices, not even if it is allowed to cheat (a lot) on both the sparsity and accuracy.

Precise Statement of Hardness Result Informal Statement : There is no efficient algorithm for ( g , h ) -Sparse Regression, even for if g grows “nearly polynomially quickly” with p , and even if h grows polynomially quickly in p and nearly linearly in m .

Precise Statement of Hardness Result Informal Statement : There is no efficient algorithm for ( g , h ) -Sparse Regression, even for if g grows “nearly polynomially quickly” with p , and even if h grows polynomially quickly in p and nearly linearly in m . Formal Statement : Assume NP �⊆ BPTIME ( n polylog ( n ) ) . Then for any positive constants δ, C 1 , C 2 , there exist a g ( p ) in 2 Ω( lg 1 − δ ( p )) and an h ( m , p ) in Ω p C 1 · m 1 − C 2 � � such that there is no quasipolynomial-time randomized algorithm for ( g , h ) -S PARSE R EGRESSION .

Precise Statement of Hardness Result Informal Statement : There is no efficient algorithm for ( g , h ) -Sparse Regression, even for if g grows “nearly polynomially quickly” with p , and even if h grows polynomially quickly in p and nearly linearly in m . Formal Statement : Assume NP �⊆ BPTIME ( n polylog ( n ) ) . Then for any positive constants δ, C 1 , C 2 , there exist a g ( p ) in 2 Ω( lg 1 − δ ( p )) and an h ( m , p ) in Ω p C 1 · m 1 − C 2 � � such that there is no quasipolynomial-time randomized algorithm for ( g , h ) -S PARSE R EGRESSION . Assuming a reasonable conjecture about PCPs, the problem is hard even for some g ( p ) ∈ p Ω( 1 ) .

Prior Hardness Results Natarajan [1995] and Davis et al. [1997] showed roughly that ( 1 , 0 ) -Sparse Regression is NP-Hard. “Hardness if algorithm cannot cheat on sparsity or accuracy.”

Prior Hardness Results Natarajan [1995] and Davis et al. [1997] showed roughly that ( 1 , 0 ) -Sparse Regression is NP-Hard. “Hardness if algorithm cannot cheat on sparsity or accuracy.” Arora et al. [1997] and Amaldi and Kahn [1998] showed that there is no polynomial time algorithm for ( 2 log 1 − δ ( p ) , 1 ) -Sparse Regression, assuming that NP �⊆ DTIME ( n polylog ( n ) ) . “Hardness if algorithm cannot cheat on accuracy.”

Prior Hardness Results Natarajan [1995] and Davis et al. [1997] showed roughly that ( 1 , 0 ) -Sparse Regression is NP-Hard. “Hardness if algorithm cannot cheat on sparsity or accuracy.” Arora et al. [1997] and Amaldi and Kahn [1998] showed that there is no polynomial time algorithm for ( 2 log 1 − δ ( p ) , 1 ) -Sparse Regression, assuming that NP �⊆ DTIME ( n polylog ( n ) ) . “Hardness if algorithm cannot cheat on accuracy.” Zhang et al. [2014] showed, roughly, that LASSO’s accuracy guarantees in the noisy setting are optimal among all polynomial time algorithms that do not cheat on the sparsity, assuming NP �⊆ P / poly. “Hardness if algorithm cannot cheat on sparsity.”

Proof Sketch of Toy Result Claim: Any polynomial-time algorithm for ( g ( p ) , 1 ) -S PARSE R EGRESSION implies an n O ( log log n ) -time algorithm for SAT, where g ( p ) = ( 1 − δ ) ln p .

Proof Sketch of Toy Result Claim: Any polynomial-time algorithm for ( g ( p ) , 1 ) -S PARSE R EGRESSION implies an n O ( log log n ) -time algorithm for SAT, where g ( p ) = ( 1 − δ ) ln p . Proof: Feige gives a reduction from SAT, running in time n O ( log log n ) on SAT instances of size n , to S ET C OVER , in which the resulting incidence matrix B (whose rows are elements and columns are sets) has the following properties. There is a (known) k such that: If a formula φ ∈ SAT, then there is a collection of k disjoint sets which covers the universe, i.e., B x = 1 for some k -sparse x . if φ �∈ SAT, then no collection of at most k · [( 1 − δ ) ln p ] sets covers the universe. i.e., B x has at least one entry equal to 0 for any � x � 0 ≤ k · [( 1 − δ ) ln p ] . Hence, � B x − 1 � 2 ≥ 1 . Any algorithm for ( g ( p ) , 1 ) -Sparse regression can distinguish these two cases.

Variable Selection is Hard . Foster 1 , Howard Karloff, and Justin - PowerPoint PPT Presentation

Variable Selection is Hard . Foster 1 , Howard Karloff, and Justin Thaler 2 Dean P 1 Amazon NYC 2 Yahoo Labs New York July 2015 Problem Formulation: ( g , h ) -Sparse Regression Given: An m p Boolean matrix B and a positive integer k such that

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Numberjack User Guide May 27, 2013 1 Variables Constructor for the class Variable : Constructor

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Luigi Spezia Biomathematics & Statistics Scotland Aberdeen BAYESIAN VARIABLE SELECTION

Variable selection STAT 401 - Statistical Methods for Research Workers Jarad Niemi Iowa State

MLCC 2019 Variable Selection and Sparsity Lorenzo Rosasco UNIGE-MIT-IIT Outline Variable

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

Design of experiments for the NIPS 2003 variable selection benchmark Isabelle Guyon July 2003

HydroCare HC-44 HydroCare HC-44 Hard Water Problems Hard Water Problems Hard Water Costs You

6/18/2018 When Family Life Gets Hard 1 6/18/2018 When Family Life Gets Hard God

Nonparametric Variable Selection via Sufficient Dimension Reduction Lexin Li Workshop on Current

Variable Benefit Plans in Depth Kelly Coffing, FSA, EA, MAAA September 21, 2019 Agenda The

Measuring variable importance in random forests Variable Variable importance in RF importance

Variables in C++ The variable C++ Variables Kinds of Variables Memory storage

Marais Laying Technologies NETWORK INSTALLATION TELECOM POWER GAS WATER Agenda

ANGAN 2019, New Delhi, 9 September 2019 THIS PRESENTATION WAS SHARED BY Prof. Ashok B. Lall

BRITMINDO GROUP Professional Mining Services MINE MANAGEMENT SERVICES CLIENT REPRESENTATIVE

Kridhan Infra Limited Investor Update Q4FY16 Disclaimer This presentation contains

City of Grand Ledge Water Supply Studies Studies Overview Exis istin ing ir iron removal

Competency generates results Universal testing machines Static und portable hardness

Public Stakeholder Meeting Oklahoma Water Resources Board October 10, 2017 Pr Propos oposed

AGN Hardness-Intensity Diagram by XMM-Newton Ji Svoboda , Czech Academy of Sciences, Matteo

Variable Selection is Hard . Foster 1 , Howard Karloff, and Justin - PowerPoint PPT Presentation

Variable Selection is Hard . Foster 1 , Howard Karloff, and Justin Thaler 2 Dean P 1 Amazon NYC 2 Yahoo Labs New York July 2015 Problem Formulation: ( g , h ) -Sparse Regression Given: An m p Boolean matrix B and a positive integer k such that

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Numberjack User Guide May 27, 2013 1 Variables Constructor for the class Variable : Constructor

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

Luigi Spezia Biomathematics &amp; Statistics Scotland Aberdeen BAYESIAN VARIABLE SELECTION

Variable selection STAT 401 - Statistical Methods for Research Workers Jarad Niemi Iowa State

MLCC 2019 Variable Selection and Sparsity Lorenzo Rosasco UNIGE-MIT-IIT Outline Variable

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

Design of experiments for the NIPS 2003 variable selection benchmark Isabelle Guyon July 2003

HydroCare HC-44 HydroCare HC-44 Hard Water Problems Hard Water Problems Hard Water Costs You

6/18/2018 When Family Life Gets Hard 1 6/18/2018 When Family Life Gets Hard God

Nonparametric Variable Selection via Sufficient Dimension Reduction Lexin Li Workshop on Current

Variable Benefit Plans in Depth Kelly Coffing, FSA, EA, MAAA September 21, 2019 Agenda The

Measuring variable importance in random forests Variable Variable importance in RF importance

Variables in C++ The variable C++ Variables Kinds of Variables Memory storage

Marais Laying Technologies NETWORK INSTALLATION TELECOM POWER GAS WATER Agenda

ANGAN 2019, New Delhi, 9 September 2019 THIS PRESENTATION WAS SHARED BY Prof. Ashok B. Lall

BRITMINDO GROUP Professional Mining Services MINE MANAGEMENT SERVICES CLIENT REPRESENTATIVE

Kridhan Infra Limited Investor Update Q4FY16 Disclaimer This presentation contains

City of Grand Ledge Water Supply Studies Studies Overview Exis istin ing ir iron removal

Competency generates results Universal testing machines Static und portable hardness

Public Stakeholder Meeting Oklahoma Water Resources Board October 10, 2017 Pr Propos oposed

AGN Hardness-Intensity Diagram by XMM-Newton Ji Svoboda , Czech Academy of Sciences, Matteo

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Luigi Spezia Biomathematics & Statistics Scotland Aberdeen BAYESIAN VARIABLE SELECTION