New directions in approximate nearest neighbors for the angular - PowerPoint PPT Presentation

Structured filters Construct concatenated code O

Structured filters Normalize (only for example) O

Structured filters Construct Voronoi cells O

Structured filters Defines partition O

Structured filters ...with efficient decoding O

Structured filters Techniques • Idea 1: Increase number of regions to 2 Θ( d ) ◮ Number of hash tables increases to 2 Θ( d ) – ok for n = 2 Θ( d ) ◮ Decoding cost potentially too large O

Structured filters Techniques • Idea 1: Increase number of regions to 2 Θ( d ) ◮ Number of hash tables increases to 2 Θ( d ) – ok for n = 2 Θ( d ) ◮ Decoding cost potentially too large • Idea 2: Use structured codes for random regions ◮ Spherical/Voronoi LSH with dependent random points ◮ Concatenated code of log d low-dim. spherical codes O ◮ Allows for efficient list-decoding

Structured filters Techniques • Idea 1: Increase number of regions to 2 Θ( d ) ◮ Number of hash tables increases to 2 Θ( d ) – ok for n = 2 Θ( d ) ◮ Decoding cost potentially too large • Idea 2: Use structured codes for random regions ◮ Spherical/Voronoi LSH with dependent random points ◮ Concatenated code of log d low-dim. spherical codes O ◮ Allows for efficient list-decoding • Idea 3: Replace partitions with filters ◮ Relaxation: filters need not partition the space ◮ Simplified analysis ◮ Might not be needed to achieve improvement

Structured filters Results For random sparse settings ( n = 2 o ( d ) ), query time O ( n ρ ) with 1 � � ρ = 1 + o d (1) . 2 c 2 − 1 O

Structured filters Results For random sparse settings ( n = 2 o ( d ) ), query time O ( n ρ ) with 1 � � ρ = 1 + o d (1) . 2 c 2 − 1 For random dense settings ( n = 2 κ d with small κ ), we obtain 1 − κ � � ρ = 1 + o d ,κ (1) . 2 c 2 − 1 O

Structured filters Results For random sparse settings ( n = 2 o ( d ) ), query time O ( n ρ ) with 1 � � ρ = 1 + o d (1) . 2 c 2 − 1 For random dense settings ( n = 2 κ d with small κ ), we obtain 1 − κ � � ρ = 1 + o d ,κ (1) . 2 c 2 − 1 O For random dense settings ( n = 2 κ d with large κ ), we obtain ρ = − 1 1 � � � � 2 κ log 1 − 1 + o d (1) . 2 c 2 − 1

Asymmetric nearest neighbors Previous results: symmetric NNS • Query time: O ( n ρ ) • Update time: O ( n ρ ) • Preprocessing time: O ( n 1+ ρ ) • Space complexity: O ( n 1+ ρ )

Asymmetric nearest neighbors Previous results: symmetric NNS • Query time: O ( n ρ ) • Update time: O ( n ρ ) • Preprocessing time: O ( n 1+ ρ ) • Space complexity: O ( n 1+ ρ ) Can we get a tradeoff between these costs?

Asymmetric nearest neighbors Voronoi regions O

Asymmetric nearest neighbors Spherical cap

Asymmetric nearest neighbors Cap height α α

Asymmetric nearest neighbors Smaller α = ⇒ Larger caps, more work α

Asymmetric nearest neighbors Larger α = ⇒ Smaller caps, less work α

Asymmetric nearest neighbors α q > α u = ⇒ Faster queries, slower updates α u α q

Asymmetric nearest neighbors α q < α u = ⇒ Slower queries, faster updates α q α u

Asymmetric nearest neighbors Results General expressions ρ q = ( 2c 2 − 1 ) / c 4 Minimize space ( α q /α u = cos θ ) ρ u = 0 ρ q = 1 / ( 2c 2 − 1 ) Balance costs α q α u ρ u = 1 / ( 2c 2 − 1 ) ( α q /α u = 1) Minimize time ρ q = 0 ρ u = ( 2c 2 − 1 ) / ( c 2 − 1 ) 2 ( α q /α u = 1 / cos θ ) Query time O ( n ρ q ), update time O ( n ρ u ), preprocessing time O ( n 1+ ρ u ), space complexity O ( n 1+ ρ u )

Asymmetric nearest neighbors Results General expressions Small c = 1 + ε ρ q = ( 2c 2 − 1 ) / c 4 ρ q = 1 − 4 ε 2 + O ( ε 3 ) Minimize space ( α q /α u = cos θ ) ρ u = 0 ρ u = 0 ρ q = 1 / ( 2c 2 − 1 ) ρ q = 1 − 4 ε + O ( ε 2 ) Balance costs α q α u ρ u = 1 / ( 2c 2 − 1 ) ρ u = 1 − 4 ε + O ( ε 2 ) ( α q /α u = 1) Minimize time ρ q = 0 ρ q = 0 ρ u = ( 2c 2 − 1 ) / ( c 2 − 1 ) 2 ρ u = 1 / (4 ε 2 ) + O (1 /ε ) ( α q /α u = 1 / cos θ ) Query time O ( n ρ q ), update time O ( n ρ u ), preprocessing time O ( n 1+ ρ u ), space complexity O ( n 1+ ρ u )

Asymmetric nearest neighbors Results General expressions Large c → ∞ ρ q = ( 2c 2 − 1 ) / c 4 ρ q = 2 / c 2 + O (1 / c 4 ) Minimize space ( α q /α u = cos θ ) ρ u = 0 ρ u = 0 ρ q = 1 / ( 2c 2 − 1 ) ρ q = 1 / (2 c 2 ) + O (1 / c 4 ) Balance costs α q α u ρ u = 1 / ( 2c 2 − 1 ) ρ u = 1 / (2 c 2 ) + O (1 / c 4 ) ( α q /α u = 1) Minimize time ρ q = 0 ρ q = 0 ρ u = 2 / c 2 + O (1 / c 4 ) ρ u = ( 2c 2 − 1 ) / ( c 2 − 1 ) 2 ( α q /α u = 1 / cos θ ) Query time O ( n ρ q ), update time O ( n ρ u ), preprocessing time O ( n 1+ ρ u ), space complexity O ( n 1+ ρ u )

Asymmetric nearest neighbors Tradeoffs α q α u

Conclusions Main result: Allow using more regions with list-decodable codes • For n = 2 o ( d ) , non-asymptotic improvement • For n = 2 Θ( d ) , asymptotic improvement • Corollary: Lower bounds for n = 2 o ( d ) do not hold for n = 2 Θ( d ) • Improved tradeoffs between query and update complexities

New directions in approximate nearest neighbors for the angular - PowerPoint PPT Presentation

New directions in approximate nearest neighbors for the angular distance Thijs Laarhoven mail@thijs.com http://www.thijs.com/ Proximity Workshop, College Park (MD), USA (January 13, 2016) Nearest neighbor searching O Nearest neighbor

Approximate Nearest Neighbors Search Approximate Nearest Neighbors Search in High Dimensions in

Approximate Nearest Neighbors Sariel Har Peled: Notes Arya, Mount, Netenyahu, Silverman, Wu An

K-Nearest Neighbors Nicolas Indelicato K-Nearest Neighbors Dataset Background How the

k-Nearest Neighbors Lecture 2 k-Nearest Neighbors September 16, 2015 1 Wentworth Institute of

FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION Marius Muja, David G.

Simple and Fast Nearest Neighbor Search Marcel Birn, Manuel Holtgrewe, Peter Sanders , Johannes

c i,j max k,m c k,m 4 Wednesday, 2 Oct. 2019 Machine Learning (COMP 135) 3 Wednesday, 2

CSC 411: Lecture 05: Nearest Neighbors Class based on Raquel Urtasun & Rich Zemels lectures

c i,j max k,m c k,m 4 Wednesday, 26 Feb. 2020 Machine Learning (COMP 135) 3 Wednesday, 26

Inference and Estimation Using Nearest Neighbors 2019 The Second Korea-Japan Machine Learning

G. G. Stokes 1857 Stokes diagram with Stokes directions Halo at with singular directions

G. G. Stokes 1857 Stokes diagram with Stokes directions Halo at with singular directions

Nearest Neighbor and Locality-Sensitive Hashing Nearest Neighbor Set Similarity

Graph-based timespace trade-offs for approximate near neighbors Thijs Laarhoven

Approximate Nearest Neighbors via Point Location Among Balls Method of Har-Peled (improved

Proximity in the Age of Distraction: Robust Approximate Nearest Neighbor Search Sariel Har-Peled

Data-driven Inference, Reconstruction, and Observational Completeness of Quantum Devices Based on

A method for efficiently calculating head-related transfer functions (HRTFs) directly from head

Announcements Please turn in Assignment 3 and pick up Assignment 4 You can also email

Statistical aspects of determinantal point processes Fr ed eric Lavancier , Laboratoire de

Search for Cosmic Ray Sources Using Deep Learning on Spherical Data Niklas Langner Martin

Extrapolating Solar Dynamo Models throughout the Heliosphere Taylor Cox Bridgewater College

Analyzing the spatial structure of the Internet topology Sndor Laki ETOMIC TEAM Etvs

From Math 2220 Class 30 Change of Coordinates Polar/Sph/Cyl Dr. Allen Back Problems Inverses

New directions in approximate nearest neighbors for the angular - PowerPoint PPT Presentation

New directions in approximate nearest neighbors for the angular distance Thijs Laarhoven mail@thijs.com http://www.thijs.com/ Proximity Workshop, College Park (MD), USA (January 13, 2016) Nearest neighbor searching O Nearest neighbor

Approximate Nearest Neighbors Search Approximate Nearest Neighbors Search in High Dimensions in

Approximate Nearest Neighbors Sariel Har Peled: Notes Arya, Mount, Netenyahu, Silverman, Wu An

K-Nearest Neighbors Nicolas Indelicato K-Nearest Neighbors Dataset Background How the

k-Nearest Neighbors Lecture 2 k-Nearest Neighbors September 16, 2015 1 Wentworth Institute of

FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION Marius Muja, David G.

Simple and Fast Nearest Neighbor Search Marcel Birn, Manuel Holtgrewe, Peter Sanders , Johannes

c i,j max k,m c k,m 4 Wednesday, 2 Oct. 2019 Machine Learning (COMP 135) 3 Wednesday, 2

CSC 411: Lecture 05: Nearest Neighbors Class based on Raquel Urtasun &amp; Rich Zemels lectures

c i,j max k,m c k,m 4 Wednesday, 26 Feb. 2020 Machine Learning (COMP 135) 3 Wednesday, 26

Inference and Estimation Using Nearest Neighbors 2019 The Second Korea-Japan Machine Learning

G. G. Stokes 1857 Stokes diagram with Stokes directions Halo at with singular directions

G. G. Stokes 1857 Stokes diagram with Stokes directions Halo at with singular directions

Nearest Neighbor and Locality-Sensitive Hashing Nearest Neighbor Set Similarity

Graph-based timespace trade-offs for approximate near neighbors Thijs Laarhoven

Approximate Nearest Neighbors via Point Location Among Balls Method of Har-Peled (improved

Proximity in the Age of Distraction: Robust Approximate Nearest Neighbor Search Sariel Har-Peled

Data-driven Inference, Reconstruction, and Observational Completeness of Quantum Devices Based on

A method for efficiently calculating head-related transfer functions (HRTFs) directly from head

Announcements Please turn in Assignment 3 and pick up Assignment 4 You can also email

Statistical aspects of determinantal point processes Fr ed eric Lavancier , Laboratoire de

Search for Cosmic Ray Sources Using Deep Learning on Spherical Data Niklas Langner Martin

Extrapolating Solar Dynamo Models throughout the Heliosphere Taylor Cox Bridgewater College

Analyzing the spatial structure of the Internet topology Sndor Laki ETOMIC TEAM Etvs

From Math 2220 Class 30 Change of Coordinates Polar/Sph/Cyl Dr. Allen Back Problems Inverses

CSC 411: Lecture 05: Nearest Neighbors Class based on Raquel Urtasun & Rich Zemels lectures