Dimensionality Reduction for Tukey Regression Kenneth L. Clarkson 1 - - PowerPoint PPT Presentation

dimensionality reduction for tukey regression
SMART_READER_LITE
LIVE PREVIEW

Dimensionality Reduction for Tukey Regression Kenneth L. Clarkson 1 - - PowerPoint PPT Presentation

Dimensionality Reduction for Tukey Regression Kenneth L. Clarkson 1 Ruosong Wang 2 David P. Woodruff 2 1 IBM Research - Almaden 2 Carnegie Mellon University Motivation A number of problems in numerical linear algebra have witnessed remarkable


slide-1
SLIDE 1

Dimensionality Reduction for Tukey Regression

Kenneth L. Clarkson1 Ruosong Wang2 David P. Woodruff2

1IBM Research - Almaden 2Carnegie Mellon University

slide-2
SLIDE 2

Motivation

◮ A number of problems in numerical linear algebra have

witnessed remarkable speedups via linear sketching.

◮ For linear regression, we have nnz(A) + poly(d/ε) time

algorithms for a variety of convex loss functions.

◮ Can we apply the technique of linear sketching to non-convex

loss functions, e.g., the Tukey loss function? M(x) =

  • x2

|x| ≤ 1 1 |x| > 1

x M(x)

slide-3
SLIDE 3

Row Sampling Algorithm

◮ Theorem 1 For a matrix A ∈ Rn×d and b ∈ Rn, there is a

row sampling algorithm that returns a weight vector w ∈ Rn, such that for ˆ x = argmin

n

  • i=1

wiM((Ax − b)i), we have

n

  • i=1

M((Aˆ x − b)i) ≤ (1 + ε) min

n

  • i=1

M((Ax − b)i). The weight vector w has at most poly(d log n/ε) non-zero entries and can be computed in O(nnz(A) + poly(d log n/ε)) time.

slide-4
SLIDE 4

Oblivious Sketch

◮ Theorem 2 There is a distribution S ∈ Rpoly(d log n)×n over

sketching matrices and weight vector w ∈ Rn, such that for ˆ x = argmin

n

  • i=1

wiM((SAx − Sb)i), we have

n

  • i=1

M((Aˆ x − b)i) ≤ O(log n) min

n

  • i=1

M((Ax − b)i).

◮ Calculating SA and Sb requires nnz(A) time. ◮ The sketch can be readily implemented in streaming and

distributed settings.

slide-5
SLIDE 5

Technical Lemma

◮ Structural Lemma for Tukey Loss Function

◮ Lemma 1 For a given matrix A ∈ Rn×d, there is a set of

indices I ⊆ [n] with size |I| ≤ poly(dα), such that for any y = Ax with n

i=1 M(yi) ≤ α, for all i ∈ [n] with |yi| ≥ 1, we

have i ∈ I.

◮ The set I can be efficiently constructed.

◮ Net Argument For Tukey Loss Function

slide-6
SLIDE 6

For more details, hardness results, provable algorithms and experiments, please come to poster #208!