Screening Rules for Lasso with Non-Convex Sparse Regularizers A. - PowerPoint PPT Presentation

Screening Rules for Lasso with Non-Convex Sparse Regularizers A. Rakotomamonjy Joint work with G. Gasso and J. Salmon ICML 2019 This work benefited from the support of the project OATMIL ANR-17-CE23-0012 of the French National Research Agency (ANR), the Normandie Projet GRR-DAISI, European funding FEDER DAISI 1 / 6

Objective of the paper Lasso and screening learning sparsity-induced linear models from high-dimensional data X ∈ R n × d , y ∈ R n d 1 � 2 � y − Xw � 2 min 2 + λ | w j | w ∈ R d j =1 Screening rule : identify vanishing variables in w ⋆ . Example with ˆ w , ˆ s intermediate primal-dual solutions : | x ⊤ ⇒ w ⋆ j ˆ s | + r (ˆ w , ˆ s ) � x j � < 1 = j = 0 by exploiting sparsity, convexity and duality. 2.00 l1 logsum 1.75 Extension to non-convex regularizers mcp 1.50 1.25 non-convex regularizers lead to statistically 1.00 better models but 0.75 how to do screening when the regularizer 0.50 0.25 is non-convex? 0.00 −2.0 −1.5 −1.0 −0.5 0.0 0.5 1.0 1.5 2.0 2 / 6

Non-convex Lasso The problem d 1 � 2 � y − Xw � 2 min 2 + r λ ( | w j | ) w ∈ R d j =1 with the regularizer r λ ( · ) being smooth and concave on [0 , ∞ [. The proposed screening strategy Solve by majorization-minimization d w k +1 = arg min � 1 2 � y − Xw � 2 2 α � w − w k � 2 1 2 + + λ j | w j | , 2 w ∈ R d j =1 with λ j = r ′ λ ( | w j | ) Screen at two levels within each weighted Lasso propagate screened variables information between 2 successive Lasso. 3 / 6

Screening weighted Lasso Optimization problem and screening condition d 1 2 + 1 � 2 � y − Xw � 2 2 α � w − w k � 2 | x ⊤ j s ⋆ − v ⋆ ⇒ w ⋆ min 2 + λ j | w j | j |− λ j < 0 = j = 0 w ∈ R d j =1 with s and v being dual variables and s ⋆ = y − Xw ⋆ and w ⋆ − w ′ ⋆ = α v ⋆ . Our screening test � � x j � + 1 � � | x ⊤ j ˆ s − ˆ v j | + 2 G Λ < λ j α � �� ( λ j ) T (ˆ w , ˆ s , ˆ v ) j given a primal-dual intermediate solution (ˆ w , ˆ s , ˆ v ), with duality gap G Λ . 4 / 6

Screened variables propagation Setting After iteration k , we have a weigthed Lasso with weights { λ j } and approximate solutions ˆ w , ˆ s and ˆ v . Screened variables are those ( λ j ) (ˆ w , ˆ s , ˆ v ) < λ j T j Before iteration k + 1 change of weights { λ ν j } j =1 ,..., d w ν ,ˆ s ν , ˆ v ν ), new primal-dual triplet (ˆ Screening propagation test √ √ 2 b ) + c + 1 ( λ j ) 2 b < λ ν T (ˆ w , ˆ s , ˆ v ) + � x j � ( a + j j α s ν − ˆ w ν , ˆ s ν , ˆ v ν ) | ≤ b and | ˆ v ν with that � ˆ s � 2 ≤ a , | G Λ (ˆ w , ˆ s , ˆ v ) − G Λ ν (ˆ j − ˆ v j | ≤ c . 5 / 6

Summary First approach for screening with non-convex regularizers Convexification and propagation At poster #190 Pacific Ballroom More technical details Experimental results on computational gain and on propagation strategy Regularization Path - n=50 d=100 p=5 σ =2.00 Ratio of screened variables ncxCD 100 GIST MM genuine 0.8 MM screening Percentage of time of ncxCD 80 0.6 60 0.4 40 0.2 Pre-PWL 20 Post-PWL 0.0 0 0 5 10 15 20 25 30 1.00e-03 1.00e-04 1.00e-05 Iterations Tolerance 6 / 6

Screening Rules for Lasso with Non-Convex Sparse Regularizers A. - PowerPoint PPT Presentation

Screening Rules for Lasso with Non-Convex Sparse Regularizers A. Rakotomamonjy Joint work with G. Gasso and J. Salmon ICML 2019 This work benefited from the support of the project OATMIL ANR-17-CE23-0012 of the French National Research Agency

Screening Rules for Lasso with Non-Convex Sparse Regularizers Joseph Salmon

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

Sparse CCA using Lasso Anastasia Lykou & Joe Whittaker Department of Mathematics and

Ridge/Lasso Regression, Model selection Xuezhi Wang Computer Science Department Carnegie Mellon

Sparse Exponential Weighting as an alternative to LASSO and Dantzig selector Alexandre Tsybakov

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

Big Data - Lecture 2 High dimensional regression with the Lasso S. Gadat Toulouse, Octobre 2014

Optimizing Convex Functions over Non-Convex Domains Dan Bienstock and Alex Michalka

A practical tour of optimization algorithms for the Lasso Alexandre Gramfort

Using Stata 16s lasso features for prediction and inference Di Liu StataCorp August, 2019

Why Geometric Progression LASSO Method in Selecting the LASSO How Is Selected: . . . Natural

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

Convex hull: basic facts Convex hull: basic facts CG Lecture 1 CG Lecture 1 Problem : give a set

Convex hulls of spheres and convex hulls of convex polytopes lying on parallel hyperplanes

SEAN CHAMBERS DIRECTOR OF WATER AND SEWER CITY OF GREELEY THOUGHTS ON WATER BANKING THROUGH

OSVGAN: Generative Adversarial Networks for Data Scarce Online Signature Verification Chandra

APCNN : Tackling Class Imbalance in Relation Extraction through Aggregated Piecewise Convolutional

AN INTEGER PROGRAMMING FORMULATION OF THE MINIMAL JACOBIAN REPRESENTATION PROBLEM P a u l

Informed search methods Tuomas Sandholm Computer Science Department Carnegie Mellon University

SEARCH-TREE BASED SDN CANDIDATE SELECTION IN HYBRID IP/SDN NETWORK NING LI ASSISTANT PROFESSOR

South African labour market transitions during the global financial and economic crisis:

Career Search Resources: Future Ready Iowa Website, Career Coach, and Occupational Projections

Screening Rules for Lasso with Non-Convex Sparse Regularizers A. - PowerPoint PPT Presentation

Screening Rules for Lasso with Non-Convex Sparse Regularizers A. Rakotomamonjy Joint work with G. Gasso and J. Salmon ICML 2019 This work benefited from the support of the project OATMIL ANR-17-CE23-0012 of the French National Research Agency

Screening Rules for Lasso with Non-Convex Sparse Regularizers Joseph Salmon

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

Sparse CCA using Lasso Anastasia Lykou &amp; Joe Whittaker Department of Mathematics and

Ridge/Lasso Regression, Model selection Xuezhi Wang Computer Science Department Carnegie Mellon

Sparse Exponential Weighting as an alternative to LASSO and Dantzig selector Alexandre Tsybakov

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

Big Data - Lecture 2 High dimensional regression with the Lasso S. Gadat Toulouse, Octobre 2014

Optimizing Convex Functions over Non-Convex Domains Dan Bienstock and Alex Michalka

A practical tour of optimization algorithms for the Lasso Alexandre Gramfort

Using Stata 16s lasso features for prediction and inference Di Liu StataCorp August, 2019

Why Geometric Progression LASSO Method in Selecting the LASSO How Is Selected: . . . Natural

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

Convex hull: basic facts Convex hull: basic facts CG Lecture 1 CG Lecture 1 Problem : give a set

Convex hulls of spheres and convex hulls of convex polytopes lying on parallel hyperplanes

SEAN CHAMBERS DIRECTOR OF WATER AND SEWER CITY OF GREELEY THOUGHTS ON WATER BANKING THROUGH

OSVGAN: Generative Adversarial Networks for Data Scarce Online Signature Verification Chandra

APCNN : Tackling Class Imbalance in Relation Extraction through Aggregated Piecewise Convolutional

AN INTEGER PROGRAMMING FORMULATION OF THE MINIMAL JACOBIAN REPRESENTATION PROBLEM P a u l

Informed search methods Tuomas Sandholm Computer Science Department Carnegie Mellon University

SEARCH-TREE BASED SDN CANDIDATE SELECTION IN HYBRID IP/SDN NETWORK NING LI ASSISTANT PROFESSOR

South African labour market transitions during the global financial and economic crisis:

Career Search Resources: Future Ready Iowa Website, Career Coach, and Occupational Projections

Sparse CCA using Lasso Anastasia Lykou & Joe Whittaker Department of Mathematics and