. . . 1 / 5 The curse of dimensionality . many applications - - PowerPoint PPT Presentation

1 5 the curse of dimensionality many applications require
SMART_READER_LITE
LIVE PREVIEW

. . . 1 / 5 The curse of dimensionality . many applications - - PowerPoint PPT Presentation

The curse of dimensionality . . . 1 / 5 The curse of dimensionality . many applications require high dimensional data . . 1 / 5 The curse of dimensionality . many applications require high dimensional data many algorithms become


slide-1
SLIDE 1

. .

.

The curse of dimensionality

1 / 5

slide-2
SLIDE 2

. .

.

The curse of dimensionality

many applications require high dimensional data

1 / 5

slide-3
SLIDE 3

. .

.

The curse of dimensionality

many applications require high dimensional data many algorithms become inefficient with high dimensional

1 / 5

slide-4
SLIDE 4

. .

.

The curse of dimensionality

many applications require high dimensional data many algorithms become inefficient with high dimensional like to replace high dimensional data by smaller dimensional data without losing too much information

1 / 5

slide-5
SLIDE 5

. .

.

The curse of dimensionality

many applications require high dimensional data many algorithms become inefficient with high dimensional like to replace high dimensional data by smaller dimensional data without losing too much information see two techniques for this task

1 / 5

slide-6
SLIDE 6

. .

.

The curse of dimensionality

many applications require high dimensional data many algorithms become inefficient with high dimensional like to replace high dimensional data by smaller dimensional data without losing too much information see two techniques for this task

1 Johnson-Lindenstrauss lemma

1 / 5

slide-7
SLIDE 7

. .

.

The curse of dimensionality

many applications require high dimensional data many algorithms become inefficient with high dimensional like to replace high dimensional data by smaller dimensional data without losing too much information see two techniques for this task

1 Johnson-Lindenstrauss lemma 2 singular value decomposition / principal component analysis

another technique is feature selection

1 / 5

slide-8
SLIDE 8

. .

.

The Johnson-Lindenstrauss lemma

.

Theorem 5.1

. . Let P be a set of n points in Rd and 0 < ϵ < 1. Then, for c large enough, there is an embedding π : P → Rc log(n)/ϵ2, such that for all p, q ∈ P (1 − ϵ) · Dl2(p, q) ≤ Dl2(π(p), π(q)) ≤ (1 + ϵ) · Dl2(p, q).

2 / 5

slide-9
SLIDE 9

. .

.

The Johnson-Lindenstrauss lemma - the construction

.

Gaussian distribution

. . µ ∈ R, σ ∈ R>0 density function N(·|µ, σ2) : R → R>0 N(x|µ, σ2) → 1 √ 2πσ2 · exp(−(x − µ)2 2σ2 ) distribution with density function N(· | µ, σ2) called Gaussian

  • r normal distribution N(µ, σ2) with mean µ and standard

deviation σ,i.e. ∀l ∈ R : Pr[x ≤ l] = ∫ l

−∞

N(x | µ, σ2)dx.

3 / 5

slide-10
SLIDE 10

. .

.

The Johnson-Lindenstrauss lemma - the construction

.

Density function of Gaussian distribution

. .

4 / 5

slide-11
SLIDE 11

. .

.

The Johnson-Lindenstrauss lemma - the construction

.

Random mapping

. . A = (rij)1≤i≤k,1≤j≤d ∈ Rk×d, where each rij is chosen according to N(0, 1). ∀x ∈ Rd : πA(x) =

1 √ k · A · x.

. . .

5 / 5

slide-12
SLIDE 12

. .

.

The Johnson-Lindenstrauss lemma - the construction

.

Random mapping

. . A = (rij)1≤i≤k,1≤j≤d ∈ Rk×d, where each rij is chosen according to N(0, 1). ∀x ∈ Rd : πA(x) =

1 √ k · A · x.

.

Lemma 5.2

. . Let πA : Rd → Rk be a chosen as above, let u ∈ Rd be a vector, and let 0 < ϵ < 1. Then, for c large enough and k = c · log(n)/ϵ2: Pr [ (1 − ϵ) ≤ ∥πA(u)∥2 ∥u∥2 ≤ (1 + ϵ) ] ≥ 1 − 1 3n2 .

5 / 5