Importance Sampling via Locality Sensitive Hashing.
Rice University
Anshumali Shrivastava
anshumali@rice.edu
7th March 2019
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 1 / 23
Importance Sampling via Locality Sensitive Hashing. Rice University - - PowerPoint PPT Presentation
Importance Sampling via Locality Sensitive Hashing. Rice University Anshumali Shrivastava anshumali@rice.edu 7 th March 2019 7 th March 2019 Anshumali Shrivastava (Rice University) COMP 480/580 1 / 23 Motivating Problem: Stochastic Gradient
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 1 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 2 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 3 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 3 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 4 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 5 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 5 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 6 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 6 / 23
1 With bounded norm assumption. 2 Allowing Asymmetry. 1SL [NIPS 14 (Best Paper), UAI 15, WWW 15], APRS [PODS 16]. Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 7 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 8 / 23
1 Approximate (or Inexact) answer suffices. 2 We are allowed to pre-process C once. (offline costly step) Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 8 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 9 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 9 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 9 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 9 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 9 / 23
𝟐 … 𝒊𝑳 𝟐 Buckets
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 10 / 23
𝟐 … 𝒊𝑳 𝟐 Buckets
𝑴 … 𝒊𝑳 𝑴 Buckets
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 10 / 23
𝟐 … 𝒊𝑳 𝟐 Buckets
𝑴 … 𝒊𝑳 𝑴 Buckets
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 10 / 23
𝟐 … 𝒊𝑳 𝟐 Buckets
𝑴 … 𝒊𝑳 𝑴 Buckets
1 Two knobs K and L to control. Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 10 / 23
2Li et. al. NIPS 2011 Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 11 / 23
2Li et. al. NIPS 2011 Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 11 / 23
2Li et. al. NIPS 2011 Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 11 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 12 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 13 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 14 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 15 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 16 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 16 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 17 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 18 / 23
200 400 600 800 1000 #Samples 1 2 3 4 5 6 7 8 MAE
PTB Uniform LSH Exact Gumbel MIPS Gumbel
Running Time: Samples Uniform LSH Exact Gumbel MIPS Gumbel 50 0.13 0.23 531.37 260.75 400 0.92 1.66 3,962.25 1,946.22 1500 3.41 6.14 1,4686.73 7,253.44 5000 9.69 17.40 42,034.58 20,668.61
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 19 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 20 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 21 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 21 / 23
50000 100000 150000 200000 250000 300000 Time (ms) 101 102 103 104 Training Objective LSD+adaGrad Train LSD+adaGrad Test SGD+adaGrad Train SGD+adaGrad Test
10000 20000 30000 40000 50000 60000 70000 80000 90000 Time (ms) 101 102 103 104 Training Objective LSD Train LSD Test SGD Train SGD Test
10 20 30 40 50 Epoch 101 102 103 104 Training Objective LSD+adaGrad Train LSD+adaGrad Test SGD+adaGrad Train SGD+adaGrad Test
5 10 15 20 25 30 Epoch 101 102 103 104 Training Objective LSD Train LSD Test SGD Train SGD Test
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 22 / 23
Anshumali Shrivastava (Rice University) COMP 480/580 7th March 2019 23 / 23