Leveraged volume sampling for linear regression Micha l Derezi - PowerPoint PPT Presentation

Leveraged volume sampling for linear regression Micha� l Derezi´ nski Manfred K. Warmuth Daniel Hsu UC Berkeley UC Santa Cruz Columbia University Linear regression d y X n � i w − y i ) 2 ( x ⊤ Loss: L ( w ) = i w ∗ = argmin Optimum: L ( w ) w

Leveraged volume sampling for linear regression Micha� l Derezi´ nski Manfred K. Warmuth Daniel Hsu UC Berkeley UC Santa Cruz Columbia University Linear regression with hidden responses d y X Sample S = { 4 , 6 , 9 } y 4 . x ⊤ 4 y 6 . x ⊤ 6 Receive x ⊤ y 9 9 y 4 , y 6 , y 9 � i w − y i ) 2 Loss: L ( w ) = ( x ⊤ i w ∗ = argmin Optimum: L ( w ) w

Leveraged volume sampling for linear regression Micha� l Derezi´ nski Manfred K. Warmuth Daniel Hsu UC Berkeley UC Santa Cruz Columbia University Linear regression with hidden responses d y X Sample Goal: Best unbiased estimator � w ( S ) S = { 4 , 6 , 9 } y 4 . x ⊤ �� 4 y 6 . w ( S ) = w ∗ E x ⊤ 6 �� Receive ≤ (1 + ǫ ) L ( w ∗ ) L w ( S ) x ⊤ y 9 9 y 4 , y 6 , y 9 � i w − y i ) 2 Existing sampling methods: Loss: L ( w ) = ( x ⊤ 1. leverage score sampling: i.i.d., biased i w ∗ = argmin Optimum: L ( w ) 2. volume sampling: joint, unbiased w

Leveraged volume sampling Volume sampling Jointly choose set S of k ≥ d indices s.t. � � � Pr ( S ) ∝ det x i x ⊤ i i ∈ S

Leveraged volume sampling Volume sampling Jointly choose set S of k ≥ d indices s.t. � � � Pr ( S ) ∝ det x i x ⊤ i i ∈ S Theorem [DW17] �� = w ∗ E w ( S ) � i w − y i ) 2 . w ( S ) = argmin � ( x ⊤ where w i ∈ S

Leveraged volume sampling Volume sampling Jointly choose set S of k ≥ d indices s.t. � � � Pr ( S ) ∝ det x i x ⊤ i i ∈ S Theorem [DW17] �� = w ∗ w ( S ) E New Lower Bound Volume sampling may need a sample of size k = Ω( n ) to get a (3 / 2) -approximation � �� ǫ =1 / 2

Leveraged volume sampling Solution: Use i.i.d. and joint sampling Volume sampling Jointly choose set S of k ≥ d indices s.t. leverage scores volume sampling � �� 1 Pr ( S ) ∝ det x i x ⊤ Pr ( S ) ∝ ℓ i det ℓ i x i x ⊤ i i i ∈ S i ∈ S i ∈ S Theorem [DW17] �� = w ∗ w ( S ) E New Lower Bound Volume sampling may need a sample of size k = Ω( n ) to get a (3 / 2) -approximation � �� ǫ =1 / 2

Leveraged volume sampling Solution: Use i.i.d. and joint sampling Volume sampling Jointly choose set S of k ≥ d indices s.t. leverage scores volume sampling � �� 1 Pr ( S ) ∝ det x i x ⊤ Pr ( S ) ∝ ℓ i det ℓ i x i x ⊤ i i i ∈ S i ∈ S i ∈ S � � � 2 Theorem [DW17] 1 x ⊤ w ( S ) = argmin � i w − y i �� ℓ i w = w ∗ w ( S ) E i ∈ S New Lower Bound New Theorem For k = O ( d log d + d /ǫ ) Volume sampling may need a sample of size �� = w ∗ w ( S ) and E k = Ω( n ) to get a (3 / 2) -approximation �� w.h.p. w ( S ) ≤ (1 + ǫ ) L ( w ∗ ) L ǫ =1 / 2

New volume sampling algorithm Determinantal rejection sampling trick repeat Sample i 1 , . . . , i s i.i.d. ∼ ( ℓ 1 , . . . , ℓ n ) � det ( � s 1 x it x ⊤ it ) � t =1 ℓ it Sample Accept ∼ Bernoulli det( X ⊤ X ) until Accept = true preprocessing O ( nd 2 ) + sampling O ( d 4 ) � �� improvable to � no dependence on n O ( nd +poly( d ))

New volume sampling algorithm Experiments – 7 datasets from Libsvm Determinantal rejection sampling trick repeat Sample i 1 , . . . , i s i.i.d. ∼ ( ℓ 1 , . . . , ℓ n ) � det ( � s 1 x it x ⊤ it ) � t =1 ℓ it Sample Accept ∼ Bernoulli det( X ⊤ X ) until Accept = true preprocessing O ( nd 2 ) + sampling O ( d 4 ) � �� improvable to � no dependence on n O ( nd +poly( d )) Check out poster #151

Leveraged volume sampling for linear regression Micha l Derezi - PowerPoint PPT Presentation

Leveraged volume sampling for linear regression Micha l Derezi nski Manfred K. Warmuth Daniel Hsu UC Berkeley UC Santa Cruz Columbia University Linear regression d y X n i w y i ) 2 ( x Loss: L ( w ) = i w = argmin

Private Equity: Leveraged Expertise or Leveraged Bets? Ulf Axelson London School of Economics

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

Scheduling RTI and Special Services in Elementary Schools: No More "When can I have your

User-level Andreas Zoor & scheduling Nikolai Nagibin Whats User-level scheduling

TinyOS Hardw are Evolution Miniature hardware devices manufactured economically in large

Including some slides modified

Appendix Reconciliation of GAAP to Non-GAAP Financial Measures The Companys presentations may

Sketched Ridge Regression: Optimization and Statistical Perspectives Shusen Wang Alex Gittens

"The future is already here it is just not evenly distributed." William Gibson

How To Leverage Adversity and Turn Setbacks Into Springboards Claire Nana LMFT

Leveraged volume sampling for linear regression Micha l Derezi - PowerPoint PPT Presentation

Leveraged volume sampling for linear regression Micha l Derezi nski Manfred K. Warmuth Daniel Hsu UC Berkeley UC Santa Cruz Columbia University Linear regression d y X n i w y i ) 2 ( x Loss: L ( w ) = i w = argmin

Private Equity: Leveraged Expertise or Leveraged Bets? Ulf Axelson London School of Economics

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

Scheduling RTI and Special Services in Elementary Schools: No More &quot;When can I have your

User-level Andreas Zoor &amp; scheduling Nikolai Nagibin Whats User-level scheduling

TinyOS Hardw are Evolution Miniature hardware devices manufactured economically in large

Including some slides modified

Appendix Reconciliation of GAAP to Non-GAAP Financial Measures The Companys presentations may

Sketched Ridge Regression: Optimization and Statistical Perspectives Shusen Wang Alex Gittens

&quot;The future is already here it is just not evenly distributed.&quot; William Gibson

How To Leverage Adversity and Turn Setbacks Into Springboards Claire Nana LMFT

Scheduling RTI and Special Services in Elementary Schools: No More "When can I have your

User-level Andreas Zoor & scheduling Nikolai Nagibin Whats User-level scheduling

"The future is already here it is just not evenly distributed." William Gibson