Dimension Reduction and Nearest Neighbor Search Advanced - PowerPoint PPT Presentation

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 • Each 𝐵 𝑗𝑘 is chosen i.i.d. from 𝑶(0,1/𝑙) • Linear combination of independent Gaussian r.v. is also Gaussian 2 , 𝑍~𝑶 𝜈 𝑍 , 𝜏 𝑍 2 → 𝑏𝑌 + 𝑐𝑍~𝑶 𝑏𝜈 𝑌 + 𝑐𝜈 𝑍 , 𝑏 2 𝜏 𝑌 2 + 𝑐 2 𝜏 𝑍 2 • 𝑌~𝑶 𝜈 𝑌 , 𝜏 𝑌 𝑣 is unit vector

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 • Each 𝐵 𝑗𝑘 is chosen i.i.d. from 𝑶(0,1/𝑙) • Linear combination of independent Gaussian r.v. is also Gaussian 2 , 𝑍~𝑶 𝜈 𝑍 , 𝜏 𝑍 2 → 𝑏𝑌 + 𝑐𝑍~𝑶 𝑏𝜈 𝑌 + 𝑐𝜈 𝑍 , 𝑏 2 𝜏 𝑌 2 + 𝑐 2 𝜏 𝑍 2 • 𝑌~𝑶 𝜈 𝑌 , 𝜏 𝑌 𝑣 is unit vector Moreover, these 𝐵𝑣 𝑗 are mutually independent!

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 In terms of expectation we are fine, but how fast do we deviate from expectation?

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , Notice 𝑌 𝑗 = 𝑙 ⋅ 𝑍 𝑗 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗

Let 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) and 𝐵 ∈ ℝ 𝑙×𝑒 , let each entry of 𝐵 is chosen i.i.d. from 𝑶(0,1/𝑙) , 2 − 1 > 𝜗 < 1/𝑜 3 then for any unit vector 𝑣 ∈ ℝ 𝑒 : Pr 𝐵𝑣 2 For suitable Chernoff bound for 𝝍 𝟑 -distribution: 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗

Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗

Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗 If 𝑌~𝑶(0,1) and 𝑡 < 1/2 , then 𝔽 𝑓 𝑡𝑌 2 = 1 1−2𝑡

Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗 when 𝜇 ≤ 1/4 If 𝑌~𝑶(0,1) and 𝑡 < 1/2 , then 𝔽 𝑓 𝑡𝑌 2 = 1 1−2𝑡

Chernoff bound for 𝝍 𝟑 -distribution: For i.i.d. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ~𝑶(0,1) and 0 < 𝜗 < 1 , 2 − 1 > 𝜗 < 2𝑓 −𝑙𝜗 2 /8 1 𝑙 𝑙 σ 𝑗=1 Pr 𝑌 𝑗 when 𝜇 ≤ 1/4 If 𝑌~𝑶(0,1) and 𝑡 < 1/2 , then 𝔽 𝑓 𝑡𝑌 2 = 1 let 𝜇 = 𝜗/4 1−2𝑡

Theorem (Johnson-Lindenstrauss 1984) : ∀0 < 𝜗 < 1 , for any set 𝑇 of 𝑜 points from ℝ 𝑒 , there is a 𝜚: ℝ 𝑒 → ℝ 𝑙 with 𝑙 ∈ 𝑃(𝜗 −2 ⋅ log 𝑜) , such that ∀𝑦 𝑗 , 𝑦 𝑘 ∈ 𝑇 : 2 ≤ 2 ≤ (1 + 𝜗) 𝑦 𝑗 − 𝑦 𝑘 2 1 − 𝜗 𝑦 𝑗 − 𝑦 𝑘 𝜚 𝑦 𝑗 − 𝜚 𝑦 𝑘 2 2 2 “ JLT states in Euclidian space, it is always possible to embed a set of 𝑜 points in arbitrary dimension to 𝑃(log 𝑜) dimension with constant distortion. ” “ Even better, it is very easy to find such 𝜚 : Just sample a random 𝑙 × 𝑒 matrix 𝐵 ”

Nearest Neighbor Search (NNS) Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , find the 𝑧 𝑗 which is closest to Ԧ 𝑦

Nearest Neighbor Search (NNS) a set a distance function satisfying triangle inequality Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , find the 𝑧 𝑗 which is closest to Ԧ 𝑦

Nearest Neighbor Search (NNS) Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , find the 𝑧 𝑗 which is closest to Ԧ 𝑦

Nearest Neighbor Search (NNS) Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , find the 𝑧 𝑗 which is closest to Ԧ 𝑦 Can find many applications in: • database systems • pattern recognition • machine learning • bioinformatics • …

Nearest Neighbor Search (NNS) Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , find the 𝑧 𝑗 which is closest to Ԧ 𝑦 Can find many applications in: • database systems ? • pattern recognition sound • machine learning • bioinformatics • … size

Nearest Neighbor Search (NNS) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑉 𝑒 for some finite 𝑉 𝑦 ∈ 𝑉 𝑒 , find the 𝑧 𝑗 which is closest to Ԧ Query: given a point Ԧ 𝑦 Goal: Efficiently answer the query What efficiency we care? • Usually space and time Trivial solution: • No preprocessing, just linear search Voronoi diagram When dimension 𝑒 is small: • Binary search when 𝑒 = 1 𝑙 -d tree • 𝑙 -d tree • Voronoi diagram • …

Nearest Neighbor Search (NNS) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑉 𝑒 for some finite 𝑉 𝑦 ∈ 𝑉 𝑒 , find the 𝑧 𝑗 which is closest to Ԧ Query: given a point Ԧ 𝑦 Goal: Efficiently answer the query What if dimension 𝑒 is large, say 𝑒 ≫ log 𝑜 ?

Nearest Neighbor Search (NNS) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑉 𝑒 for some finite 𝑉 𝑦 ∈ 𝑉 𝑒 , find the 𝑧 𝑗 which is closest to Ԧ Query: given a point Ԧ 𝑦 Goal: Efficiently answer the query What if dimension 𝑒 is large, say 𝑒 ≫ log 𝑜 ? Curse of dimensionality: It is conjectured that to solve NNS in high dimension requires either super-polynomial( 𝑜 ) space or super-polynomial( 𝑒 ) time.

Nearest Neighbor Search (NNS) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑉 𝑒 for some finite 𝑉 𝑦 ∈ 𝑉 𝑒 , find the 𝑧 𝑗 which is closest to Ԧ Query: given a point Ԧ 𝑦 Goal: Efficiently answer the query What if dimension 𝑒 is large, say 𝑒 ≫ log 𝑜 ? Curse of dimensionality: It is conjectured that to solve NNS in high dimension requires either super-polynomial( 𝑜 ) space or super-polynomial( 𝑒 ) time. Blessing: Randomization + Approximation

Approximate Near ( est ) Neighbor (ANN) Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise

Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise

Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise If we can solve (𝑑, 𝑠) -ANN, then we can solve 𝑑 -ANN with little overhead.

Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise If we can solve (𝑑, 𝑠) -ANN, then we can solve 𝑑 -ANN with little overhead. 𝐸 𝑛𝑗𝑜 = 1≤𝑗<𝑘≤𝑜 dist(𝑧 𝑗 , 𝑧 𝑘 ) min 𝐸 𝑛𝑏𝑦 = 1≤𝑗<𝑘≤𝑜 dist(𝑧 𝑗 , 𝑧 𝑘 ) max 𝑆 = 𝐸 𝑛𝑗𝑜 𝑑 −1 , 𝐸 𝑛𝑗𝑜 𝑑 0 , 𝐸 𝑛𝑗𝑜 𝑑 1 , ⋯ , 𝐸 𝑛𝑏𝑦 ⋅ ⋅ ⋅ 2 2 2

Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise If we can solve (𝑑, 𝑠) -ANN, then we can solve 𝑑 -ANN with little overhead. 𝐸 𝑛𝑗𝑜 = 1≤𝑗<𝑘≤𝑜 dist(𝑧 𝑗 , 𝑧 𝑘 ) min 𝐸 𝑛𝑏𝑦 = 1≤𝑗<𝑘≤𝑜 dist(𝑧 𝑗 , 𝑧 𝑘 ) max 𝑆 = 𝐸 𝑛𝑗𝑜 𝑑 −1 , 𝐸 𝑛𝑗𝑜 𝑑 0 , 𝐸 𝑛𝑗𝑜 𝑑 1 , ⋯ , 𝐸 𝑛𝑏𝑦 ⋅ ⋅ ⋅ 2 2 2 Let 𝑠 ∗ be the min in 𝑆 s.t. ( 𝑑, 𝑠 ∗ ) -ANN returns yes with 𝑧 ∗

Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise If we can solve (𝑑, 𝑠) -ANN, then we can solve 𝑑 -ANN with little overhead. 𝐸 𝑛𝑗𝑜 = 1≤𝑗<𝑘≤𝑜 dist(𝑧 𝑗 , 𝑧 𝑘 ) min 𝐸 𝑛𝑏𝑦 = 1≤𝑗<𝑘≤𝑜 dist(𝑧 𝑗 , 𝑧 𝑘 ) max 𝑆 = 𝐸 𝑛𝑗𝑜 𝑑 −1 , 𝐸 𝑛𝑗𝑜 𝑑 0 , 𝐸 𝑛𝑗𝑜 𝑑 1 , ⋯ , 𝐸 𝑛𝑏𝑦 ⋅ ⋅ ⋅ 2 2 2 Let 𝑠 ∗ be the min in 𝑆 s.t. ( 𝑑, 𝑠 ∗ ) -ANN returns yes with 𝑧 ∗ 𝑦, 𝑧 ∗ ≤ 𝑦, 𝑧 𝑗 > 𝑠 ∗ / 𝑑 𝑑 ⋅ 𝑠 ∗ ∀𝑧 𝑗 ∈ 𝑌: dist Ԧ dist Ԧ

Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 , 𝒅 -ANN (Approximate Nearest Neighbor): Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑 ⋅ min 1≤𝑘≤𝑜 dist( Ԧ 𝑦, 𝑧 𝑘 ) (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise If we can solve (𝑑, 𝑠) -ANN, then we can solve 𝑑 -ANN with little overhead. 𝑑 -ANN can be solved ∀𝑠 : ( 𝑑, 𝑠) -ANN can be solved 𝐸 𝑛𝑏𝑦 𝐸 𝑛𝑗𝑜 with space 𝑃 𝑡 ⋅ log 𝑑 ൗ with space 𝑡 and query time 𝑢 𝐸 𝑛𝑏𝑦 𝐸 𝑛𝑗𝑜 and query time 𝑃 𝑢 ⋅ log 2 log 𝑑 ൗ

Setup: consider Hamming space 0,1 𝑒 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 0,1 𝑒 𝑦 ∈ 0,1 𝑒 , Query: given a point Ԧ (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise GF(2): two elements {0,1} , XOR as sum, AND as multiplication. 𝑒 Therefore, 𝑨 𝑗 𝑘 = 𝐵𝑧 𝑗 𝑘 = σ 𝑚=1 𝐵 𝑘𝑚 ⋅ 𝑧 𝑗 𝑚 mod 2 . Let 𝑙 , 𝑞 and 𝑡 to be fixed later. Sample a 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries from Bernoulli(𝑞) . For 𝑗 = 1,2, ⋯ , 𝑜 : let 𝑨 𝑗 = 𝐵𝑧 𝑗 ∈ 0,1 𝑙 on finite field GF(2). dist 𝑣, 𝑨 𝑗 ≤ 𝑡 for all 𝑣 ∈ 0,1 𝑙 . Store all 𝑡 -balls 𝐶 𝑡 𝑣 = 𝑧 𝑗 𝑦 ∈ 0,1 𝑒 : Now, upon a query Ԧ Retrieve 𝐶 𝑡 (𝐵 Ԧ 𝑦) . If 𝐶 𝑡 𝐵 Ԧ 𝑦 = ∅ return “no”, else return any 𝑧 𝑗 ∈ 𝐶 𝑡 (𝐵 Ԧ 𝑦) . Space: 𝑃(𝑜 ⋅ 2 𝑙 ) Query time: 𝑃(𝑒𝑙) computation + 𝑃(1) memory access

Setup: consider Hamming space 0,1 𝑒 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 0,1 𝑒 𝑦 ∈ 0,1 𝑒 , Query: given a point Ԧ (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Let 𝑙 , 𝑞 and 𝑡 to be fixed later. Sample a 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries from Bernoulli(𝑞) . For 𝑗 = 1,2, ⋯ , 𝑜 : let 𝑨 𝑗 = 𝐵𝑧 𝑗 ∈ 0,1 𝑙 on finite field GF(2). dist 𝑣, 𝑨 𝑗 ≤ 𝑡 for all 𝑣 ∈ 0,1 𝑙 . Store all 𝑡 -balls 𝐶 𝑡 𝑣 = 𝑧 𝑗 𝑦 ∈ 0,1 𝑒 : Now, upon a query Ԧ Retrieve 𝐶 𝑡 (𝐵 Ԧ 𝑦) . If 𝐶 𝑡 𝐵 Ԧ 𝑦 = ∅ return “no”, else return any 𝑧 𝑗 ∈ 𝐶 𝑡 (𝐵 Ԧ 𝑦) .

Setup: consider Hamming space 0,1 𝑒 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 0,1 𝑒 𝑦 ∈ 0,1 𝑒 , Query: given a point Ԧ (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Let 𝑙 , 𝑞 and 𝑡 to be fixed later. Sample a 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries from Bernoulli(𝑞) . For 𝑗 = 1,2, ⋯ , 𝑜 : let 𝑨 𝑗 = 𝐵𝑧 𝑗 ∈ 0,1 𝑙 on finite field GF(2). dist 𝑣, 𝑨 𝑗 ≤ 𝑡 for all 𝑣 ∈ 0,1 𝑙 . Store all 𝑡 -balls 𝐶 𝑡 𝑣 = 𝑧 𝑗 𝑦 ∈ 0,1 𝑒 : Now, upon a query Ԧ Retrieve 𝐶 𝑡 (𝐵 Ԧ 𝑦) . If 𝐶 𝑡 𝐵 Ԧ 𝑦 = ∅ return “no”, else return any 𝑧 𝑗 ∈ 𝐶 𝑡 (𝐵 Ԧ 𝑦) . 𝑧 ∈ 0,1 𝑒 : For suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ

Setup: consider Hamming space 0,1 𝑒 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 0,1 𝑒 𝑦 ∈ 0,1 𝑒 , Query: given a point Ԧ (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Let 𝑙 , 𝑞 and 𝑡 to be fixed later. Sample a 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries from Bernoulli(𝑞) . For 𝑗 = 1,2, ⋯ , 𝑜 : let 𝑨 𝑗 = 𝐵𝑧 𝑗 ∈ 0,1 𝑙 on finite field GF(2). dist 𝑣, 𝑨 𝑗 ≤ 𝑡 for all 𝑣 ∈ 0,1 𝑙 . Store all 𝑡 -balls 𝐶 𝑡 𝑣 = 𝑧 𝑗 𝑦 ∈ 0,1 𝑒 : Now, upon a query Ԧ Retrieve 𝐶 𝑡 (𝐵 Ԧ 𝑦) . If 𝐶 𝑡 𝐵 Ԧ 𝑦 = ∅ return “no”, else return any 𝑧 𝑗 ∈ 𝐶 𝑡 (𝐵 Ԧ 𝑦) . 𝑧 ∈ 0,1 𝑒 : For suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ (𝑑, 𝑠) -ANN is solved w.h.p.

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ each row vector 𝐵 𝑗 of 𝐵 has i.i.d. entries ∈ Bernoulli(𝑞)

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ each row vector 𝐵 𝑗 of 𝐵 has i.i.d. entries ∈ Bernoulli(𝑞) an alternative view regarding the generation of 𝐵 𝑗 : • build 𝐷 ⊆ [𝑒] s.t. each element in [𝑒] is chosen independently with pr. 2𝑞 • each coordinate in 𝐷 is independently set to 0 or 1 each with pr. 1/2

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ each row vector 𝐵 𝑗 of 𝐵 has i.i.d. entries ∈ Bernoulli(𝑞) an alternative view regarding the generation of 𝐵 𝑗 : • build 𝐷 ⊆ [𝑒] s.t. each element in [𝑒] is chosen independently with pr. 2𝑞 • each coordinate in 𝐷 is independently set to 0 or 1 each with pr. 1/2 observations: • if 𝑘 ∉ 𝐷 for all coordinates 𝑘 where Ԧ 𝑦 𝑘 ≠ 𝑧 𝑘 , then 𝐵 Ԧ Ԧ 𝑦 𝑗 = 𝐵 Ԧ 𝑧 𝑗 • otherwise, if exists such 𝑘 ∈ 𝐷 , then once all other entries in 𝐵 𝑗 are fixed, exactly one of the two choices for 𝐵 𝑗𝑘 will make 𝐵 Ԧ 𝑦 𝑗 = 𝐵 Ԧ 𝑧 𝑗

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ each row vector 𝐵 𝑗 of 𝐵 has i.i.d. entries ∈ Bernoulli(𝑞) choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠 𝑌 𝑗 where 𝑌 𝑗 = ቊ1 if 𝐵 Ԧ 𝑦 𝑗 ≠ 𝐵 Ԧ 𝑧 𝑗 𝑙 𝑧 = 𝑌 = σ 𝑗=1 dist 𝐵 Ԧ 𝑦, 𝐵 Ԧ 0 otherwise

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠 𝑌 𝑗 where 𝑌 𝑗 = ቊ1 if 𝐵 Ԧ 𝑦 𝑗 ≠ 𝐵 Ԧ 𝑧 𝑗 𝑙 𝑧 = 𝑌 = σ 𝑗=1 dist 𝐵 Ԧ 𝑦, 𝐵 Ԧ 0 otherwise 1 4 + 1 2 −2 − 𝑑+1 𝑙 3 8 − 2 −(𝑑+2) 𝑙 choose 𝑡 = = 2

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠 𝑌 𝑗 where 𝑌 𝑗 = ቊ1 if 𝐵 Ԧ 𝑦 𝑗 ≠ 𝐵 Ԧ 𝑧 𝑗 𝑙 𝑧 = 𝑌 = σ 𝑗=1 dist 𝐵 Ԧ 𝑦, 𝐵 Ԧ 0 otherwise independent 1 4 + 1 2 −2 − 𝑑+1 𝑙 3 8 − 2 −(𝑑+2) 𝑙 choose 𝑡 = = 2

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ Chernoff bound: 𝑙 Let independent r.v. 𝑌 1 , 𝑌 2 , ⋯ , 𝑌 𝑙 ∈ {0,1} , let 𝑌 = σ 𝑗=1 𝑌 𝑗 , then for 𝑡 > 0 : Pr 𝑌 ≥ 𝔽 𝑌 + 𝑡 ≤ exp − 2𝑡 2 choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠 𝑙 Pr 𝑌 ≤ 𝔽 𝑌 − 𝑡 ≤ exp − 2𝑡 2 𝑙 𝑌 𝑗 where 𝑌 𝑗 = ቊ1 if 𝐵 Ԧ 𝑦 𝑗 ≠ 𝐵 Ԧ 𝑧 𝑗 𝑙 𝑧 = 𝑌 = σ 𝑗=1 dist 𝐵 Ԧ 𝑦, 𝐵 Ԧ 0 otherwise independent 1 4 + 1 2 −2 − 𝑑+1 𝑙 8 − 2 −(𝑑+2) 𝑙 3 choose 𝑡 = = 2

random 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries ∈ Bernoulli(𝑞) 𝑒 𝑦 𝑗 = σ 𝑘=1 computation on GF(2): 𝐵 Ԧ 𝐵 𝑗𝑘 ⋅ Ԧ 𝑦 𝑘 mod 2 𝑧 ∈ 0,1 𝑒 : for suitable 𝑙 ∈ 𝑃(log 𝑜) , p and s; ∀ Ԧ 𝑦, Ԧ choose 𝑞 to satisfy 1 − 2𝑞 = 2 −1/𝑠 𝑌 𝑗 where 𝑌 𝑗 = ቊ1 if 𝐵 Ԧ 𝑦 𝑗 ≠ 𝐵 Ԧ 𝑧 𝑗 𝑙 𝑧 = 𝑌 = σ 𝑗=1 dist 𝐵 Ԧ 𝑦, 𝐵 Ԧ 0 otherwise independent 1 4 + 1 2 −2 − 𝑑+1 𝑙 3 8 − 2 −(𝑑+2) 𝑙 choose 𝑡 = = 2

Setup: consider Hamming space 0,1 𝑒 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 0,1 𝑒 𝑦 ∈ 0,1 𝑒 , Query: given a point Ԧ (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise ln 𝑜 3 8 − 2 −(𝑑+2) 𝑙 . 1 − 2 −1/𝑠 Τ Τ Let 𝑙 = 1 8−2 −(𝑑+2) , 𝑞 = 2 and 𝑡 = Τ Sample a 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries from Bernoulli(𝑞) . For 𝑗 = 1,2, ⋯ , 𝑜 : let 𝑨 𝑗 = 𝐵𝑧 𝑗 ∈ 0,1 𝑙 on finite field GF(2). dist 𝑣, 𝑨 𝑗 ≤ 𝑡 for all 𝑣 ∈ 0,1 𝑙 . Store all 𝑡 -balls 𝐶 𝑡 𝑣 = 𝑧 𝑗 𝑦 ∈ 0,1 𝑒 : Now, upon a query Ԧ Retrieve 𝐶 𝑡 (𝐵 Ԧ 𝑦) . If 𝐶 𝑡 𝐵 Ԧ 𝑦 = ∅ return “no”, else return any 𝑧 𝑗 ∈ 𝐶 𝑡 (𝐵 Ԧ 𝑦) . Space: 𝑃(𝑜 ⋅ 2 𝑙 ) Query time: 𝑃(𝑒𝑙) computation + 𝑃(1) memory access

Setup: consider Hamming space 0,1 𝑒 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 0,1 𝑒 𝑦 ∈ 0,1 𝑒 , Query: given a point Ԧ (𝒅, 𝒔) -ANN (Approximate Near Neighbor): • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise ln 𝑜 3 8 − 2 −(𝑑+2) 𝑙 . 1 − 2 −1/𝑠 Τ Τ Let 𝑙 = 1 8−2 −(𝑑+2) , 𝑞 = 2 and 𝑡 = Τ Sample a 𝑙 × 𝑒 Boolean matrix 𝐵 with i.i.d. entries from Bernoulli(𝑞) . For 𝑗 = 1,2, ⋯ , 𝑜 : let 𝑨 𝑗 = 𝐵𝑧 𝑗 ∈ 0,1 𝑙 on finite field GF(2). dist 𝑣, 𝑨 𝑗 ≤ 𝑡 for all 𝑣 ∈ 0,1 𝑙 . Store all 𝑡 -balls 𝐶 𝑡 𝑣 = 𝑧 𝑗 𝑦 ∈ 0,1 𝑒 : Now, upon a query Ԧ Retrieve 𝐶 𝑡 (𝐵 Ԧ 𝑦) . If 𝐶 𝑡 𝐵 Ԧ 𝑦 = ∅ return “no”, else return any 𝑧 𝑗 ∈ 𝐶 𝑡 (𝐵 Ԧ 𝑦) . Space: 𝑃(𝑜 ⋅ 2 𝑙 ) Query time: 𝑃(𝑒𝑙) computation + 𝑃(1) memory access Space: 𝑜 𝑃(1) Solve (𝑑, 𝑠) -ANN w.h.p. Query time: 𝑃(𝑒 log 𝑜)

Locality-Sensitive Hashing (LSH) Given a metric space 𝑌, dist , a random ℎ: 𝑌 → 𝑉 drawn from ℋ is an (𝑠, 𝑑𝑠, 𝑞, 𝑟) -LSH if, for all Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 :

Locality-Sensitive Hashing (LSH) Given a metric space 𝑌, dist , a random ℎ: 𝑌 → 𝑉 drawn from ℋ is an (𝑠, 𝑑𝑠, 𝑞, 𝑟) -LSH if, for all Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : 𝑞 > 𝑟

Locality-Sensitive Hashing (LSH) Given a metric space 𝑌, dist , a random ℎ: 𝑌 → 𝑉 drawn from ℋ is an (𝑠, 𝑑𝑠, 𝑞, 𝑟) -LSH if, for all Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : If there exists an (𝑠, 𝑑𝑠, 𝑞, 𝑟) -LSH ℎ: 𝑌 → 𝑉 , then there exists an (𝑠, 𝑑𝑠, 𝑞 𝑙 , 𝑟 𝑙 ) -LSH 𝑕: 𝑌 → 𝑉 𝑙

Locality-Sensitive Hashing (LSH) Given a metric space 𝑌, dist , a random ℎ: 𝑌 → 𝑉 drawn from ℋ is an (𝑠, 𝑑𝑠, 𝑞, 𝑟) -LSH if, for all Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : If there exists an (𝑠, 𝑑𝑠, 𝑞, 𝑟) -LSH ℎ: 𝑌 → 𝑉 , then there exists an (𝑠, 𝑑𝑠, 𝑞 𝑙 , 𝑟 𝑙 ) -LSH 𝑕: 𝑌 → 𝑉 𝑙 Independently draw ℎ 1 , ℎ 2 , ⋯ , ℎ 𝑙 according to the distribution of ℎ ∈ 𝑉 𝑙 𝑕 𝑦 = ℎ 1 𝑦 , ℎ 2 𝑦 , ⋯ , ℎ 𝑙 𝑦

(𝒅, 𝒔) -ANN Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 : • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Suppose we have (𝑠, 𝑑𝑠, 𝑞 ∗ , Τ 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 ∀ Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 :

(𝒅, 𝒔) -ANN Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 : • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Suppose we have (𝑠, 𝑑𝑠, 𝑞 ∗ , Τ 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 ∀ Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in nondecreasing order of 𝑕(𝑧 𝑗 ) . Upon query Ԧ 𝑦 ∈ 𝑌 : Find all 𝑧 𝑗 such that 𝑕 Ԧ 𝑦 = 𝑕(𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”.

(𝒅, 𝒔) -ANN Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 : • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Suppose we have (𝑠, 𝑑𝑠, 𝑞 ∗ , Τ 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 ∀ Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in nondecreasing order of 𝑕(𝑧 𝑗 ) . Upon query Ԧ 𝑦 ∈ 𝑌 : Find all 𝑧 𝑗 such that 𝑕 Ԧ 𝑦 = 𝑕(𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”. If the real answer is “no”: always correct If the real answer is “yes”: correct with probability at least 𝑞 ∗

(𝒅, 𝒔) -ANN Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 : • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Suppose we have (𝑠, 𝑑𝑠, 𝑞 ∗ , Τ 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 ∀ Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in nondecreasing order of 𝑕(𝑧 𝑗 ) . Upon query Ԧ 𝑦 ∈ 𝑌 : Find all 𝑧 𝑗 such that 𝑕 Ԧ 𝑦 = 𝑕(𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”. If the real answer is “no”: always correct If the real answer is “yes”: correct with probability at least 𝑞 ∗ Space: 𝑃(𝑜) Time: 𝑃(log 𝑜)

(𝒅, 𝒔) -ANN Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 : • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Suppose we have (𝑠, 𝑑𝑠, 𝑞 ∗ , Τ 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 ∀ Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in nondecreasing order of 𝑕(𝑧 𝑗 ) . Upon query Ԧ 𝑦 ∈ 𝑌 : Find all 𝑧 𝑗 such that 𝑕 Ԧ 𝑦 = 𝑕(𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”. If the real answer is “no”: always correct If the real answer is “yes”: correct with probability at least 𝑞 ∗ Space: 𝑃(𝑜) Time: 𝑃(log 𝑜) + 𝑃(1) in expectation

(𝒅, 𝒔) -ANN Setup: metric space (𝑌, dist) Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: given a point Ԧ 𝑦 ∈ 𝑌 : • Return a 𝑧 𝑗 s.t. dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 if ∃𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 ≤ 𝑠 • Return “no” if ∀𝑧 𝑘 : dist Ԧ 𝑦, 𝑧 𝑘 > 𝑑𝑠 • Arbitrary answer otherwise Suppose we have (𝑑, 𝑑𝑠, 𝑞 ∗ , Τ 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 ∀ Ԧ 𝑦, Ԧ 𝑧 ∈ 𝑌 : 1 𝑞 ∗ , independently draw 𝑕 1 , 𝑕 2 , ⋯ , 𝑕 𝑚 . Τ Let 𝑚 = Maintain 𝒎 sorted tables: For 𝑘 = 1,2, ⋯ , 𝑚 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in table- 𝑘 in nondecreasing order of 𝑕 𝑘 (𝑧 𝑗 ) . Upon query 𝒚 ∈ 𝒀 : Find first 10 ⋅ 𝑚 such 𝑧 𝑗 that ∃𝑘: 𝑕 𝑘 Ԧ 𝑦 = 𝑕 𝑘 (𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”.

(𝑑, 𝑑𝑠, 𝑞 ∗ , Τ (𝑑, 𝑠) -ANN in metric space (𝑌, dist) 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: some point Ԧ 𝑦 ∈ 𝑌 1 𝑞 ∗ , independently draw 𝑕 1 , 𝑕 2 , ⋯ , 𝑕 𝑚 . Let 𝑚 = Τ Maintain 𝒎 sorted tables: For 𝑘 = 1,2, ⋯ , 𝑚 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in table- 𝑘 in nondecreasing order of 𝑕 𝑘 (𝑧 𝑗 ) . Upon query 𝒚 ∈ 𝒀 : Find first 10 ⋅ 𝑚 such 𝑧 𝑗 that ∃𝑘: 𝑕 𝑘 Ԧ 𝑦 = 𝑕 𝑘 (𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”.

(𝑑, 𝑑𝑠, 𝑞 ∗ , Τ (𝑑, 𝑠) -ANN in metric space (𝑌, dist) 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: some point Ԧ 𝑦 ∈ 𝑌 1 𝑞 ∗ , independently draw 𝑕 1 , 𝑕 2 , ⋯ , 𝑕 𝑚 . Let 𝑚 = Τ Maintain 𝒎 sorted tables: For 𝑘 = 1,2, ⋯ , 𝑚 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in table- 𝑘 in nondecreasing order of 𝑕 𝑘 (𝑧 𝑗 ) . Upon query 𝒚 ∈ 𝒀 : Find first 10 ⋅ 𝑚 such 𝑧 𝑗 that ∃𝑘: 𝑕 𝑘 Ԧ 𝑦 = 𝑕 𝑘 (𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”. If the real answer is “no”: always correct.

(𝑑, 𝑑𝑠, 𝑞 ∗ , Τ (𝑑, 𝑠) -ANN in metric space (𝑌, dist) 1 𝑜) -LSH 𝑕: 𝑌 → 𝑉 Data: 𝑜 points 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 ∈ 𝑌 Query: some point Ԧ 𝑦 ∈ 𝑌 1 𝑞 ∗ , independently draw 𝑕 1 , 𝑕 2 , ⋯ , 𝑕 𝑚 . Let 𝑚 = Τ Maintain 𝒎 sorted tables: For 𝑘 = 1,2, ⋯ , 𝑚 : Store 𝑧 1 , 𝑧 2 , ⋯ , 𝑧 𝑜 in table- 𝑘 in nondecreasing order of 𝑕 𝑘 (𝑧 𝑗 ) . Upon query 𝒚 ∈ 𝒀 : Find first 10 ⋅ 𝑚 such 𝑧 𝑗 that ∃𝑘: 𝑕 𝑘 Ԧ 𝑦 = 𝑕 𝑘 (𝑧 𝑗 ) by binary search. If encounter some 𝑧 𝑗 such that dist Ԧ 𝑦, 𝑧 𝑗 ≤ 𝑑𝑠 then return this 𝑧 𝑗 ; otherwise return “no”. If the real answer is “no”: always correct. If exists 𝑧 𝑡 such that dist Ԧ 𝑦, 𝑧 𝑡 ≤ 𝑠 , then Pr answer "no" ≤

Dimension Reduction and Nearest Neighbor Search Advanced - PowerPoint PPT Presentation

Dimension Reduction and Nearest Neighbor Search Advanced Algorithms Nanjing University, Fall 2018 Dimension reduction: Why we care? High dimension data are common, yet working on them directly is expensive. Dimension reduction: Why we

Dimension Reduction CSE 6242 / CX 4242 Thanks : Prof. Jaegul Choo , Dr. Ramakrishnan Kannan,

Linear Dimension Reduction (in L 2 ) Linear Dimension Reduction: R D R d Goal: Find a low-dim.

VC-dimension and Erd os-P osa property Nicolas Bousquet LIRMM, University Montpellier II

The Human Dimension Sue Manns Regional Director Pegasus The Human Dimension The Human

The Metric Dimension Problem. J. D az Monash U., May 2018 The Metric Dimension problem

Packing Dimension Results for Anisotropic Gaussian Random Fields Dongsheng Wu Department of

Dimension Reduction CS 760@UW-Madison Goals for the lecture you should understand the following

Geometric perspectives for supervised dimension reduction A Tale of Two Manifolds S. Mukherjee,

Nonparametric Variable Selection via Sufficient Dimension Reduction Lexin Li Workshop on Current

Extreme Value Theory and Dimension GARDES Inference on reduction for the study of hyperspectral

Dimension Reduction CS 6242 Ramakrishnan Kannan Thanks : Prof. Jaegul Choo and Prof. Le

Dimension reduction numerical methods for Bermudan options Scott Sues Probability, Numerics, and

Introduction to Harm Reduction Definition of Harm Reduction Harm reduction refers to policies,

Dimension Reduction with Heavy Tails Gabriel Kuhn Munich University of Technology

Developing the intercultural dimension Developing the intercultural dimension in teaching and

1 In this lecture we discuss Pansus conformal dimension . Definition (Pansu, 1989) Let X be a

Beyond Locality-Sensitive Hashing Huy L. Nguy Alexandr Andoni 1 Piotr Indyk 2 n 3 Ilya

Wellbeing, living standards, and their Wellbeing, living standards, and their distribution

Machine Learning in PandaRoot GlueX-Panda Workshop G.Washington University, May 2019 Ralf Kliemt

BU CS 332 Theory of Computation Lecture 21: Reading: NP Completeness Sipser Ch 7.3

Supersense Tagging for Arabic: The MT-in-the-Middle Attack Nathan Schneider Behrang Mohit Chris

Reasoning for Humans: Clear Thinking in an Uncertain World PHIL 171 Eric Pacuit Department of

Anne Bracy Computer Science Cornell University The slides are the product of many rounds of

A community facility for systems tes1ng at scale The prior