Dimensionality Reduction embedding Distortion L Norm Corollaries - - PDF document

dimensionality reduction
SMART_READER_LITE
LIVE PREVIEW

Dimensionality Reduction embedding Distortion L Norm Corollaries - - PDF document

Dimensionality Reduction Metric Space Isometric Dimensionality Reduction embedding Distortion L Norm Corollaries Anil Maheshwari Euclidean Norm anil@scs.carleton.ca School of Computer Science Carleton University Canada Metric Space


slide-1
SLIDE 1

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Dimensionality Reduction

Anil Maheshwari

anil@scs.carleton.ca School of Computer Science Carleton University Canada

slide-2
SLIDE 2

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Metric Space hX, di

Let X be a set of n-points and let d be a distance measure associated with pairs of elements in X. We say that hX, di is a finite metric space if the function d satisfies metric properties, i.e. (a) 8x 2 X, d(x, x) = 0, (b) 8x, y 2 X, x 6= y, d(x, y) > 0, (c) 8x, y 2 X, d(x, y) = d(y, x) (symmetry), and (d) 8x, y, z 2 X, d(x, y)  d(x, z) + d(z, y) (triangle inequality).

slide-3
SLIDE 3

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Embeddings

Let hX, di and hX0, d0i be two metric spaces. Embedding: A map f : X ! X0 is called an embedding. Isometric embedding (i.e., distance preserving) if for all x, y 2 X, d(x, y) = d0(f(x), f(y)).

slide-4
SLIDE 4

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Motivating Problem

Input: X=Set of n-points in k-dimensional space, where n >> 2k Output: A pair of points that maximize L1-distance. Naive Solution: O(k n

2

  • ) = O(kn2) time

Better algorithm via isometric embedding of Lk

1 ! L2k 1

running in O(2kn) time

slide-5
SLIDE 5

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Universality of L1-metric

L1-metric

Let hX, di be any finite metric space, where n = |X|. X can be isometrically embedded into L1-metric space of appropriate dimension.

slide-6
SLIDE 6

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Euclidean Metric

Input: Metric Space defined by K4, C4, and star-Y w.r.t. unweighted SP . Question: Can one embed 4-points in Euclidean space isometrically?

slide-7
SLIDE 7

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Distortion

Contraction: Is the maximum factor by which the distances shrink and it equals maxx,y2X

d(x,y) d0(f(x),f(y)).

Expansion: Is the maximum factor by which the distances are stretched and it equals maxx,y2X

d0(f(x),f(y)) d(x,y)

. Distortion: of an embedding is the product of its expansion and contraction factor.

slide-8
SLIDE 8

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

hX, di

D

, ! Lk=O(Dn

2 D log n)

1

Input: A metric space hX, di, where X is a set of n-points and let d satisfies the metric properties. Output: An embedding of X in a k = O(Dn

2 D log n)

dimensional space such that such that the distances gets distorted (actually contracted) by a factor of at most D under L1 norm.

slide-9
SLIDE 9

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

hX, di

D

, ! Lk=O(Dn

2 D log n)

1

(contd.)

Let x, y 2 X and let f(x), f(y) be their embedding in the k-dimensional space, respectively.

Property

The distances gets contracted by a factor of at most D 1. Formally, maxx,y2X

d(x,y) ||f(x)f(y)||1  D

Example: If D = O(log n), k = O(log2 n), i.e. hX, di

O(log n)

, ! LO(log2 n)

1

Meaning: Any metric space hX, di can be embedded in a O(log2 n)-dimensional space and the distances may distort (contract) by a factor of at most O(log n). Applications ?

slide-10
SLIDE 10

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Proof of hX, di

D

, ! Lk=O(Dn

2 D log n)

1

Constructive proof via a randomized algorithm.

Definition

Let S ✓ X. For x 2 X, define distance of x from S as d(x, S) = min

z2S d(x, z)

Claim

Let x, y 2 X. For all S ✓ X, |d(x, S) d(y, S)|  d(x, y).

slide-11
SLIDE 11

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Proof Contd.

Definition

(Mapping) Let x 2 X. Let S1, S2, · · · , Sk ✓ X. The mapping f maps x to the point f(x) = {d(x, S1), d(x, S2), · · · , d(x, Sk)}. Observation: Let S1, S2, · · · , Sk ✓ X. For x, y 2 X, ||f(x) f(y)||1  d(x, y).

slide-12
SLIDE 12

Proof Contd.

Definition (Mapping) Let x 2 X. Let S1, S2, · · · , Sk ✓ X. The mapping f maps x to the point f(x) = {d(x, S1), d(x, S2), · · · , d(x, Sk)}. Observation: Let S1, S2, · · · , Sk ✓ X. For x, y 2 X, ||f(x) f(y)||1  d(x, y).

2020-10-19

Dimensionality Reduction L∞ Norm Proof Contd.

Proof.

Follows from the above claim, as for each 1  i  k, |d(x, Si) d(y, Si)|  d(x, y).

slide-13
SLIDE 13

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Randomized Algorithm

Input: Metric space X and parameter D. Output: A set of O(Dm) subsets of X.

1

p min( 1

2, n 2

D ) 2

m O(n

2 D log n) 3

For j 1 to d D

2 e and

For i 1 to m: Choose set Sij by sampling each element of X independently with probability pj

4

For each x 2 X return f(x) = [d(x, S11), · · · d(x, Sm1), d(x, S12), · · · , d(x, Sm2), · · · d(x, S1d D

2 e), · · · , d(x, Smd D 2 e)]

slide-14
SLIDE 14

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

An Observation

Let x, y be two distinct points of X. Let B(x, r) be the set

  • f points of X that are within a distance of r from x (think
  • f B(x, r) as a ball of radius r centred at x). Similarly, let

B(y, r + ∆) be the set of points of X that are within a distance of r + ∆ from y. Consider a subset S ⇢ X such that S \ B(x, r) 6= ; and S \ B(y, r + ∆) = ;. Then |d(x, S) d(y, S)| ∆.

slide-15
SLIDE 15

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Key Lemma

Lemma

Let x, y be two distinct points of X. There exists an index j 2 {1, · · · , d D

2 e} such that if Sij is as chosen in the

Algorithm, than Pr ⇥ ||f(x) f(y)||1 d(x,y)

D

⇤ p

12

1

p min( 1

2, n 2

D ) 2

m O(n

2 D log n) 3

For j 1 to d D

2 e and

For i 1 to m: Choose set Sij by sampling each element of X independently with probability pj

4

For each x 2 X return f(x) = [d(x, S11), · · · d(x, Sm1), d(x, S12), · · · , d(x, Sm2), · · · d(x, S1d D

2 e), · · · , d(x, Smd D 2 e)]

slide-16
SLIDE 16

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Ball Properties

Set ∆ = d(x,y)

D

. For i = 0, · · · , d D

2 e, define balls of radius i∆ as follows.

Let B0 = {x}. B1 be the ball of radius ∆ centred at y. B2 is the ball of radius 2∆ centred at x. B3 is the ball centred at y of radius 3∆ and so on.

Property I

No even ball overlaps with an odd ball.

slide-17
SLIDE 17

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Ball Properties (contd.)

For even (odd) i, let |Bi| denote the number of points of X that are within a distance of at most i∆ from x (respectively, y).

Property II

There is an index t 2 {0, · · · , d D

2 e 1}, such that

|Bt| n

2t D and |Bt+1|  n 2(t+1) D

slide-18
SLIDE 18

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Ball Properties (contd.)

Let t be the index such that |Bt| n

2t D and

|Bt+1|  n

2(t+1) D

Consider when j = t + 1 in the Algorithm.

Property III

The set Sij chosen by the algorithm has non-empty intersection with Bt with probability at least p/3, and it will avoid Bt+1 with probability at least 1/4. Define: Event E1: Sij \ Bt 6= ;. Event E2: Sij \ Bt+1 = ;.

slide-19
SLIDE 19

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Event E1

Pr(Sij \ Bt 6= ;) p/3

slide-20
SLIDE 20

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Event E2

Pr(Sij \ Bt+1 = ;) 1/4

slide-21
SLIDE 21

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Main Theorem

hX, di

D

, ! Lk=O(Dn

2 D log n)

1

slide-22
SLIDE 22

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Corollary 1: hX, di

Θ(log n)

, ! LO(log2 n)

1

Set D = Θ(log n), in the Theorem hX, di

D

, ! Lk=O(Dn

2 D log n)

1

and we obtain hX, di

Θ(log n)

, ! LO(log2 n)

1

.

slide-23
SLIDE 23

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Corollary 2: hX, di

log2 n

, ! LO(log2 n)

1

Let k = O(log2 n) be the dimension of embedding. For a pair of points x, y 2 X, we have ||f(x) f(y)||1  kd(x, y) (it holds for each coordinate). In the Theorem, for a pair x, y 2 X, we know that there is at least one set which is good, i.e., with probability 1 1/n2, ||f(x) f(y)||1

d(x,y) Θ(log n).

Extend the machinery in the Theorem to show that with high probability there are log n sets that are good by choosing slightly larger value for m (but still of order of O(log n)). If this is the case, then ||f(x) f(y)||1 log n d(x,y)

Θ(log n) = Θ(d(x, y))

Thus we have Θ(d(x, y))  ||f(x) f(y)||1  kd(x, y), and hence we have a mapping with distortion O(log2 n).

slide-24
SLIDE 24

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Corollary 3: hX, di

log1.5 n

, ! LO(log2 n)

2

Let k = O(log2 n) be the dimension of embedding. Observe that for the same embedding as in Corollary 1, for a pair of points x, y 2 X, we have ||f(x) f(y)||2 = qX (d(x, Sij) d(y, Sij))2  p kd(x, y) We can show, ||f(x) f(y)||2 = qX (d(x, Sij) d(y, Sij))2

  • s

log n( d(x, y) Θ(log n))2

  • d(x, y)

Θ(plog n) This results in a total distortion of O(log1.5 n).

slide-25
SLIDE 25

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Mapping under Euclidean Norm

Let X be a set of n points in d-dimensional space, where d  n. We will map points of X to a O( ln n

✏2 )-dimensional

space such that the distortion is within a factor of 1 ± ✏. Distances are measured with respect to Euclidean distance.

slide-26
SLIDE 26

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Johnson-Lindenstrauss Theorem

Let V be a set of n points in d-dimensions. A mapping f : Rd ! Rk can be computed, in randomized polynomial time, so that for all pairs of points u, v 2 V , (1 ✏)||u v||2  ||f(u) f(v)||2  (1 + ✏)||u v||2, where 0 < ✏ < 1 and n, d, and k 4( ✏2

2 ✏3 3 )1 ln n are

positive integers. Comments: || . . . || is with respect to Euclidean distance Function f is defined in terms of a matrix Ak⇥d with entries from standardized normal distribution.

slide-27
SLIDE 27

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Normal Distribution

Random variable X has a Normal Distribution N(µ, 2), with mean µ and standard deviation > 0, if its probability density function is of the form f(x) =

1 p 2⇡e 1

2 ( xµ σ ) 2

, 1 < x < 1 If X has a Normal distribution N(µ, 2), than aX + b has a Normal distribution N(aµ + b, a22), for constants a, b. The distribution N(0, 1), with pdf

1 p 2⇡e x2

2 , is referred to

as a standardized normal distribution.

Sum of Normal Distributions

Let X and Y be independent r.v. with Normal distributions N(µ1, 2

1) and N(µ2, 2 2). Let r.v. Z = X + Y .

Z has a Normal distribution N(µ1 + µ2, 2

1 + 2 2).

The sum of two independent Normal distributions is a Normal distribution.

slide-28
SLIDE 28

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

A Random Matrix

Consider a k ⇥ d matrix A where its entries are chosen independently from N(0, 1

k).

Let x be a vector in Rd. Consider the k-dimensional vector Ax

Expected squared length

E[||Ax||2] = ||x||2

slide-29
SLIDE 29

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

How good is E[||Ax||2] = ||x||2?

Estimate Pr(||Ax||2 (1 + ✏)||x||2) and Pr(||Ax||2  (1 ✏)||x||2), for ✏ 2 (0, 1). Pr(||Ax||2 (1 + ✏)||x||2) = Pr(

k

P

i=1

Z2

i (1 + ✏)||x||2),

where Zi is a random variable with distribution N(0, ||x||2

k )

Divide by ||x||2

k , and we obtain

Pr(

k

P

i=1

Y 2

i (1 + ✏)k),

where Yi has a N(0, 1) distribution.

New Problem

Estimate Pr(

k

P

i=1

Y 2

i (1 + ✏)k), where Yi has a N(0, 1)

distribution.

slide-30
SLIDE 30

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Estimating Pr(

k

P

i=1

Y 2

i )

Pr(

k

P

i=1

Y 2

i (1 + ✏)k)  e k

4 (✏2✏3)

Pr(

k

P

i=1

Y 2

i  (1 ✏)k)  e k

4 (✏2✏3)

slide-31
SLIDE 31

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Estimating Pr(

k

P

i=1

Y 2

i )

If k = 20log n

✏2 ,

Pr((1 ✏)k 

k

P

i=1

Y 2

i  (1 + ✏)k) 1 1 n3

slide-32
SLIDE 32

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Back to J-L Theorem

Let V be a set of n points in d-dimensions. A mapping f : Rd ! Rk can be computed, in randomized polynomial time, so that for all pairs of points u, v 2 V , (1 ✏)||u v||2  ||f(u) f(v)||2  (1 + ✏)||u v||2, where 0 < ✏ < 1 and n, d, and k 4( ✏2

2 ✏3 3 )1 ln n are

positive integers. By choosing matrix Ak⇥d consisting of independent values from N(0, 1

k), we show that 8u, v 2 V

Pr((1✏)||uv||2  ||AuAv||2  (1+✏)||uv||2) 1 1

n

slide-33
SLIDE 33

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Proof of J-L Theorem

We know that for any vector x 2 Rd, Pr((1 ✏)||x||2  ||Ax||2  (1 + ✏)||x||2) 1 1

n3

Consider any pair of points u, v 2 V . Set x = u v. Then Pr((1✏)||uv||2  ||A(uv)||2  (1+✏)||uv||2) 1 1

n3

There are in all n

2

  • pairs of points in V .

By union bound, we have that 8u, v 2 V Pr((1✏)||uv||2  ||AuAv||2  (1+✏)||uv||2) 1 1

n

slide-34
SLIDE 34

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

Comments

1

Choice of matrix A doesn’t depend on points in V

2

What properties A needed to satisfy?

3

E[||Ax||2] = ||x||2

4

A is dense = ) Av takes more computation time

5

Can we find sparse matrix A? Choose entries of A from {1, 1, 0} with probabilities 1/6,1/6, and 2/3, respectively and normalize.

6

. . .

slide-35
SLIDE 35

Dimensionality Reduction Metric Space Isometric embedding Distortion L∞ Norm Corollaries Euclidean Norm

References

1

Johnson and Lindenstrauss, Extensions of Lipschitz mappings into a Hilbert space, Contemporary Mathematics 26:189-206, 1984.

2

Achlioptas, Database-friendly random projections, JCSS 66(4): 671-687, 2003.

3

Dasgupta, and Gupta, An elementary proof of a theorem of Johnson and Lindenstrauss" Random Structures & Algorithms, 22 (1): 60-65, 2003.

4

Dubhashi and Panconesi, Concentration of Measure for the Analysis of Randomized Algorithms, Cambridge University Press, 2009.

5

Matousek, Lectures on Discrete Geometry, Volume 212 of Graduate Texts in Mathematics. Springer, New York, 2002.

6

Ankush Moitra Notes