today
play

Today. Cuckoo hashing. Today. Cuckoo hashing. - PowerPoint PPT Presentation

Today. Cuckoo hashing. Today. Cuckoo hashing. Johnson-Lindenstrass. Cuckoo hashing. Hashing with two choices: max load O ( loglog n ) . Cuckoo hashing. Hashing with two choices: max load O ( loglog n ) . Cuckoo hashing: Cuckoo hashing.


  1. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions.

  2. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k

  3. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i

  4. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x �

  5. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux �

  6. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux � = � e i | Ux �

  7. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux � = � e i | Ux � = � e i | z �

  8. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux � = � e i | Ux � = � e i | z � Inverse of U maps e i to random vector v i

  9. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux � = � e i | Ux � = � e i | z � Inverse of U maps e i to random vector v i

  10. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux � = � e i | Ux � = � e i | z � Inverse of U maps e i to random vector v i z = Ux is uniformly distributed on d sphere for unit x ∈ R d .

  11. Projections. Project x into subspace spanned by v 1 , v 2 , ··· , v k . y 1 = x · v 1 , y 2 = x · , v 2 , ··· , y k = x · v k Projection: ( y 1 ,..., y k ) . Have: Arbitrary vector, random k -dimensional subspace. View As: Random vector, standard basis for k dimensions. Orthogonal U - rotates v 1 ,..., v k onto e 1 ,..., e k y i = � v i | x � = � Uv i | Ux � = � e i | Ux � = � e i | z � Inverse of U maps e i to random vector v i z = Ux is uniformly distributed on d sphere for unit x ∈ R d . y i is i th coordinate of random vector z .

  12. Expected value of y i . Random projection: first k coordinates of random unit vector, z i .

  13. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1.

  14. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation.

  15. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed.

  16. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed. i ] = k E [ ∑ i ∈ [ k ] z 2 d .

  17. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed. i ] = k E [ ∑ i ∈ [ k ] z 2 d . Linearity of Expectation.

  18. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed. i ] = k E [ ∑ i ∈ [ k ] z 2 d . Linearity of Expectation. � k Expected length is d .

  19. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed. i ] = k E [ ∑ i ∈ [ k ] z 2 d . Linearity of Expectation. � k Expected length is d . Johnson-Lindenstrass: close to expectation.

  20. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed. i ] = k E [ ∑ i ∈ [ k ] z 2 d . Linearity of Expectation. � k Expected length is d . Johnson-Lindenstrass: close to expectation. k is large enough →

  21. Expected value of y i . Random projection: first k coordinates of random unit vector, z i . E [ ∑ i ∈ [ d ] z 2 i ] = 1. Linearity of Expectation. By symmetry, each z i is identically distributed. i ] = k E [ ∑ i ∈ [ k ] z 2 d . Linearity of Expectation. � k Expected length is d . Johnson-Lindenstrass: close to expectation. k is large enough → � k ≈ ( 1 ± ε ) d with decent probability.

  22. Concentration Bounds. z is uniformly random unit vector.

  23. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d .

  24. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √

  25. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 .

  26. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . ∆

  27. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if ∆

  28. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. ∆

  29. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. ∆

  30. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps ∆

  31. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps √ ∆ ≤ S.A. of sphere of radius 1 − ∆ 2

  32. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps √ ∆ ≤ S.A. of sphere of radius 1 − ∆ 2 ∝ r d = 1 − ∆ 2 � d / 2 �

  33. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps √ ∆ ≤ S.A. of sphere of radius 1 − ∆ 2 ∝ r d = 1 − ∆ 2 � d / 2 � � d / 2 � 1 − t 2 ∝ d

  34. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps √ ∆ ≤ S.A. of sphere of radius 1 − ∆ 2 ∝ r d = 1 − ∆ 2 � d / 2 � � d / 2 − t 2 � 1 − t 2 ∝ ≈ e 2 d

  35. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps √ ∆ ≤ S.A. of sphere of radius 1 − ∆ 2 ∝ r d = 1 − ∆ 2 � d / 2 � � d / 2 − t 2 � 1 − t 2 ∝ ≈ e 2 d Constant of ∝ is unit sphere area.

  36. Concentration Bounds. z is uniformly random unit vector. Random point on the unit sphere. E [ ∑ i ∈ [ k ] z 2 i ] = k d . d ] ≤ e − t 2 / 2 t Claim: Pr [ | z 1 | > √ Sphere view: surface “far” from equator defined by e 1 . | z 1 | ≥ ∆ if z ≥ ∆ from equator of sphere. Point on “ ∆ -spherical cap”. Area of caps √ ∆ ≤ S.A. of sphere of radius 1 − ∆ 2 ∝ r d = 1 − ∆ 2 � d / 2 � � d / 2 − t 2 � 1 − t 2 ∝ ≈ e 2 d Constant of ∝ is unit sphere area. Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small.

  37. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small.

  38. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. Total Length?

  39. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k .

  40. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d �

  41. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 .

  42. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k − � � d � �

  43. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε d ] � � d �

  44. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � d �

  45. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d �

  46. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above.

  47. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector.

  48. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit.

  49. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit. Projection fails to preserve | x i − x j |

  50. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit. Projection fails to preserve | x i − x j | with probability ≤ 1 n c

  51. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit. Projection fails to preserve | x i − x j | with probability ≤ 1 n c Scaled vector length also preserved.

  52. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit. Projection fails to preserve | x i − x j | with probability ≤ 1 n c Scaled vector length also preserved. ≤ n 2 pairs

  53. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit. Projection fails to preserve | x i − x j | with probability ≤ 1 n c Scaled vector length also preserved. ≤ n 2 pairs plus union bound

  54. Many coordinates. Argued Pr [ any z 2 i > ( 2log d ) E [ z 2 i ]] is small. � z 2 1 + z 2 2 + ··· z 2 Total Length? z = k . � � � � � > t ] ≤ e − t 2 d / 2 � ( z 2 1 + z 2 2 + ··· + z 2 k � Pr [ k ) − � � d � � d , k = c log n k Substituting t = ε ε 2 . � � � � � d ] ≤ e − ε 2 k = e − c log n = 1 � z 2 1 + z 2 2 + ··· + z 2 k � k Pr [ k − � > ε � � n c d � Johnson-Lindenstraus: For n points, x 1 ,..., x n , all distances � k preserved to within 1 ± ε under d -scaled projection above. View one pair x i − x j as vector. Scale to unit. Projection fails to preserve | x i − x j | with probability ≤ 1 n c Scaled vector length also preserved. ≤ n 2 pairs plus union bound 1 → prob any pair fails to be preserved with ≤ n c − 2 .

  55. Locality Preserving Hashing Find nearby points in high dimensional space.

  56. Locality Preserving Hashing Find nearby points in high dimensional space. Points could be images!

  57. Locality Preserving Hashing Find nearby points in high dimensional space. Points could be images! Hash function h ( · ) s.t. h ( x i ) = h ( x j ) if d ( x i , x j ) ≤ δ .

  58. Locality Preserving Hashing Find nearby points in high dimensional space. Points could be images! Hash function h ( · ) s.t. h ( x i ) = h ( x j ) if d ( x i , x j ) ≤ δ . √ Low dimensions: grid cells give d -approximation.

  59. Locality Preserving Hashing Find nearby points in high dimensional space. Points could be images! Hash function h ( · ) s.t. h ( x i ) = h ( x j ) if d ( x i , x j ) ≤ δ . √ Low dimensions: grid cells give d -approximation. Not quite a solution.

  60. Locality Preserving Hashing Find nearby points in high dimensional space. Points could be images! Hash function h ( · ) s.t. h ( x i ) = h ( x j ) if d ( x i , x j ) ≤ δ . √ Low dimensions: grid cells give d -approximation. Not quite a solution. Why?

  61. Locality Preserving Hashing Find nearby points in high dimensional space. Points could be images! Hash function h ( · ) s.t. h ( x i ) = h ( x j ) if d ( x i , x j ) ≤ δ . √ Low dimensions: grid cells give d -approximation. Not quite a solution. Why? Close to grid boundary.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend