dimensionality reduction for tukey regression
play

Dimensionality Reduction for Tukey Regression Kenneth L. Clarkson 1 - PowerPoint PPT Presentation

Dimensionality Reduction for Tukey Regression Kenneth L. Clarkson 1 Ruosong Wang 2 David P. Woodruff 2 1 IBM Research - Almaden 2 Carnegie Mellon University Motivation A number of problems in numerical linear algebra have witnessed remarkable


  1. Dimensionality Reduction for Tukey Regression Kenneth L. Clarkson 1 Ruosong Wang 2 David P. Woodruff 2 1 IBM Research - Almaden 2 Carnegie Mellon University

  2. Motivation ◮ A number of problems in numerical linear algebra have witnessed remarkable speedups via linear sketching. ◮ For linear regression, we have nnz ( A ) + poly ( d /ε ) time algorithms for a variety of convex loss functions. ◮ Can we apply the technique of linear sketching to non-convex loss functions, e.g., the Tukey loss function? M ( x ) � x 2 | x | ≤ 1 M ( x ) = 1 | x | > 1 x

  3. Row Sampling Algorithm ◮ Theorem 1 For a matrix A ∈ R n × d and b ∈ R n , there is a row sampling algorithm that returns a weight vector w ∈ R n , such that for n � ˆ x = argmin w i M (( Ax − b ) i ) , i =1 we have n n � � M (( A ˆ x − b ) i ) ≤ (1 + ε ) min M (( Ax − b ) i ) . i =1 i =1 The weight vector w has at most poly ( d log n /ε ) non-zero entries and can be computed in � O ( nnz ( A ) + poly ( d log n /ε )) time.

  4. Oblivious Sketch ◮ Theorem 2 There is a distribution S ∈ R poly ( d log n ) × n over sketching matrices and weight vector w ∈ R n , such that for n � ˆ x = argmin w i M (( SAx − Sb ) i ) , i =1 we have n n � � M (( A ˆ x − b ) i ) ≤ O (log n ) min M (( Ax − b ) i ) . i =1 i =1 ◮ Calculating SA and Sb requires nnz ( A ) time. ◮ The sketch can be readily implemented in streaming and distributed settings.

  5. Technical Lemma ◮ Structural Lemma for Tukey Loss Function ◮ Lemma 1 For a given matrix A ∈ R n × d , there is a set of indices I ⊆ [ n ] with size | I | ≤ poly ( d α ), such that for any y = Ax with � n i =1 M ( y i ) ≤ α , for all i ∈ [ n ] with | y i | ≥ 1, we have i ∈ I . ◮ The set I can be efficiently constructed. ◮ Net Argument For Tukey Loss Function

  6. For more details, hardness results, provable algorithms and experiments, please come to poster #208!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend