Multiresolution Analysis DS-GA 1013 / MATH-GA 2824 - - PowerPoint PPT Presentation

multiresolution analysis
SMART_READER_LITE
LIVE PREVIEW

Multiresolution Analysis DS-GA 1013 / MATH-GA 2824 - - PowerPoint PPT Presentation

Multiresolution Analysis DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/OBDA_fall17/index.html Carlos Fernandez-Granda Frames Short-time Fourier transform (STFT) Wavelets Thresholding


slide-1
SLIDE 1

Multiresolution Analysis

DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis

http://www.cims.nyu.edu/~cfgranda/pages/OBDA_fall17/index.html Carlos Fernandez-Granda

slide-2
SLIDE 2

Frames Short-time Fourier transform (STFT) Wavelets Thresholding

slide-3
SLIDE 3

Definition

Let V be an inner-product space A frame of V is a set of vectors F := { v1, v2, . . .} such that for every x ∈ V cL || x||2

·,· ≤

  • v∈F

| x, v|2 ≤ cU || x||2

·,·

for fixed positive constants cU ≥ cL ≥ 0 The frame is a tight frame if cL = cU

slide-4
SLIDE 4

Frames span the whole space

Any frame F := { v1, v2, . . .} of V spans V Proof: Assume y / ∈ span ( v1, v2, . . .)

slide-5
SLIDE 5

Frames span the whole space

Any frame F := { v1, v2, . . .} of V spans V Proof: Assume y / ∈ span ( v1, v2, . . .) Then Pspan(

v1, v2,...)⊥

y is nonzero and

  • v∈F
  • Pspan(

v1, v2,...)⊥

y, v

  • 2

=

slide-6
SLIDE 6

Frames span the whole space

Any frame F := { v1, v2, . . .} of V spans V Proof: Assume y / ∈ span ( v1, v2, . . .) Then Pspan(

v1, v2,...)⊥

y is nonzero and

  • v∈F
  • Pspan(

v1, v2,...)⊥

y, v

  • 2

= 0

slide-7
SLIDE 7

Orthogonal bases are tight frames

Any orthonormal basis B :=

  • b1,

b2, . . .

  • is a tight frame

Proof:

slide-8
SLIDE 8

Orthogonal bases are tight frames

Any orthonormal basis B :=

  • b1,

b2, . . .

  • is a tight frame

Proof: For any vector x ∈ V || x||2

·,·

slide-9
SLIDE 9

Orthogonal bases are tight frames

Any orthonormal basis B :=

  • b1,

b2, . . .

  • is a tight frame

Proof: For any vector x ∈ V || x||2

·,· =

  • b∈B
  • x,

b

  • b
  • 2

·,·

slide-10
SLIDE 10

Orthogonal bases are tight frames

Any orthonormal basis B :=

  • b1,

b2, . . .

  • is a tight frame

Proof: For any vector x ∈ V || x||2

·,· =

  • b∈B
  • x,

b

  • b
  • 2

·,·

=

  • b∈B
  • x,

b

  • 2
  • b
  • 2

·,·

slide-11
SLIDE 11

Orthogonal bases are tight frames

Any orthonormal basis B :=

  • b1,

b2, . . .

  • is a tight frame

Proof: For any vector x ∈ V || x||2

·,· =

  • b∈B
  • x,

b

  • b
  • 2

·,·

=

  • b∈B
  • x,

b

  • 2
  • b
  • 2

·,·

=

  • b∈B
  • x,

b

  • 2
slide-12
SLIDE 12

Analysis operator

The analysis operator Φ of a frame maps a vector to its coefficients Φ ( x) [k] = x, vk For any finite frame { v1, v2, . . . , vm} of Cn the analysis operator is F :=    

  • v∗

1

  • v∗

2

. . .

  • v∗

m

   

slide-13
SLIDE 13

Frames in finite-dimensional spaces

  • v1,

v2, . . . , vm are a frame of Cn if and only F is full rank In that case, cU = σ2

1

cL = σ2

n

Proof: σ2

n ≤ ||F

x||2

2 = m

  • j=1
  • x,

vj2 ≤ σ2

1

slide-14
SLIDE 14

Pseudoinverse

If an n × m tall matrix A, m ≥ n, is full rank, then its pseudoinverse A† := (A∗A)−1 A∗ is well defined, is a left inverse of A A†A = I and equals A† = VS−1U∗ where A = USV ∗ is the SVD of A

slide-15
SLIDE 15

Proof

A† := (A∗A)−1 A∗

slide-16
SLIDE 16

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗

slide-17
SLIDE 17

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗ =

  • VS2V ∗−1 VSU∗
slide-18
SLIDE 18

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗ =

  • VS2V ∗−1 VSU∗

= VS−2V ∗VSU∗

slide-19
SLIDE 19

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗ =

  • VS2V ∗−1 VSU∗

= VS−2V ∗VSU∗ = VS−1U

slide-20
SLIDE 20

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗ =

  • VS2V ∗−1 VSU∗

= VS−2V ∗VSU∗ = VS−1U A†A

slide-21
SLIDE 21

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗ =

  • VS2V ∗−1 VSU∗

= VS−2V ∗VSU∗ = VS−1U A†A = VS−1UV ∗USV ∗

slide-22
SLIDE 22

Proof

A† := (A∗A)−1 A∗ = (VSU∗USV ∗A)−1 VSU∗ =

  • VS2V ∗−1 VSU∗

= VS−2V ∗VSU∗ = VS−1U A†A = VS−1UV ∗USV ∗ = I

slide-23
SLIDE 23

Frames Short-time Fourier transform (STFT) Wavelets Thresholding

slide-24
SLIDE 24

Motivation

Spectrum of speech, music, etc. varies over time Idea: Compute frequency representation of time segments of the signal

slide-25
SLIDE 25

Short-time Fourier transform

The short-time Fourier transform (STFT) of a function f ∈ L2[−1/2, 1/2] is STFT {f } (k, τ) := 1/2

−1/2

f (t) w (t − τ)e−i2πkt dt where w ∈ L2[−1/2, 1/2] is a window function Frame vectors: vk,τ (t) := w (t − τ) ei2πkt

slide-26
SLIDE 26

Discrete short-time Fourier transform

The STFT of a vector x ∈ Cn is STFT {f } (k, l) :=

  • x ◦

w[l], hk

  • where w ∈ Cn is a window vector

Frame vectors: vk,l (t) := w[l] ◦ hk

slide-27
SLIDE 27

STFT

Length of window and shifts are chosen so that shifted windows overlap In that case the STFT is a frame We can invert it using fast algorithms based on the FFT Window should not produce spurious high-frequency artifacts

slide-28
SLIDE 28

Rectangular window

Signal Window × = Spectrum ∗ =

slide-29
SLIDE 29

Hann window

Signal Window × = Spectrum ∗ =

slide-30
SLIDE 30

Frame vector l = 0, k = 0

Real part Imaginary part Spectrum

slide-31
SLIDE 31

Frame vector l = 1/32, k = 0

Real part Imaginary part Spectrum

slide-32
SLIDE 32

Frame vector l = 0, k = 64

Real part Imaginary part Spectrum

slide-33
SLIDE 33

Frame vector l = 1/32, k = 64

Real part Imaginary part Spectrum

slide-34
SLIDE 34

Speech signal

slide-35
SLIDE 35

Spectrum

slide-36
SLIDE 36

Spectrogram (log magnitude of STFT coefficients)

Time Frequency

slide-37
SLIDE 37

Frames Short-time Fourier transform (STFT) Wavelets Thresholding

slide-38
SLIDE 38

Wavelet transform

Motivation: Extracting features at different scales Idea: Frame vectors are scaled, shifted copies of a fixed function An additional function captures low-pass component at largest scale

slide-39
SLIDE 39

Wavelet transform

The wavelet transform of a function f ∈ L2[−1/2, 1/2] depends on a choice of scaling function (or father wavelet) φ and wavelet function (or mother wavelet) ψ The scaling coefficients are Wφ {f } (τ) := 1 √s

  • f (t) φ (t − τ) dt

The wavelet coefficients are Wψ {f } (s, τ) := 1 √s 1 f (t) ψ t − τ s

  • dt

Wavelets can be designed to be bases or frames

slide-40
SLIDE 40

Haar wavelet

Scaling function Mother wavelet Wavelets are band-pass filters, scaling functions are low-pass filters

slide-41
SLIDE 41

Discrete wavelet transform

The discrete wavelet transform depends on a choice of scaling vector φ and wavelet ψ The scaling coefficients are W

φ {f } (l) :=

  • x,

φ[l]

  • The wavelet coefficients are

W

ψ {f } (s, l) :=

  • x,

ψ[s,l]

  • ,

where

  • ψ[s,l][j] :=

ψ j − l s

  • Wavelets can be designed to be bases or frames
slide-42
SLIDE 42

Orthonormal wavelet basis

Scale Basis functions 20

slide-43
SLIDE 43

Orthonormal wavelet basis

Scale Basis functions 20

slide-44
SLIDE 44

Orthonormal wavelet basis

Scale Basis functions 20

slide-45
SLIDE 45

Orthonormal wavelet basis

Scale Basis functions 20 21

slide-46
SLIDE 46

Orthonormal wavelet basis

Scale Basis functions 20 21

slide-47
SLIDE 47

Orthonormal wavelet basis

Scale Basis functions 20 21

slide-48
SLIDE 48

Orthonormal wavelet basis

Scale Basis functions 20 21 22

slide-49
SLIDE 49

Orthonormal wavelet basis

Scale Basis functions 20 21 22

slide-50
SLIDE 50

Orthonormal wavelet basis

Scale Basis functions 20 21 22 23

slide-51
SLIDE 51

Orthonormal wavelet basis

Scale Basis functions 20 21 22 23 (scaling vector)

slide-52
SLIDE 52

Multiresolution decomposition

Sequence of subspaces V0, V1, . . . , VK representing different scales Fix a scaling vector φ and a wavelet ψ

slide-53
SLIDE 53

Multiresolution decomposition

Sequence of subspaces V0, V1, . . . , VK representing different scales Fix a scaling vector φ and a wavelet ψ VK is the span of φ

slide-54
SLIDE 54

Multiresolution decomposition

Sequence of subspaces V0, V1, . . . , VK representing different scales Fix a scaling vector φ and a wavelet ψ VK is the span of φ VK

slide-55
SLIDE 55

Multiresolution decomposition

Vk := Wk ⊕ Vk+1 Wk is the span of ψ dilated by 2k and shifted by multiples of 2k+1

slide-56
SLIDE 56

Multiresolution decomposition

Vk := Wk ⊕ Vk+1 Wk is the span of ψ dilated by 2k and shifted by multiples of 2k+1 W0

slide-57
SLIDE 57

Multiresolution decomposition

Vk := Wk ⊕ Vk+1 Wk is the span of ψ dilated by 2k and shifted by multiples of 2k+1 W0 W2

slide-58
SLIDE 58

Multiresolution decomposition

Vk := Wk ⊕ Vk+1 Wk is the span of ψ dilated by 2k and shifted by multiples of 2k+1 W0 W2 PVk x is an approximation of x at scale 2k

slide-59
SLIDE 59

Multiresolution decomposition

Properties

◮ V0 = Cn (approximation at scale 20 is perfect) ◮ Vk is invariant to translations of scale 2k ◮ Dilating vectors in Vj by 2 yields vectors in Vj+1

slide-60
SLIDE 60

Electrocardiogram

Signal Haar transform

slide-61
SLIDE 61

Scale 29

PW9 x PV9 x

slide-62
SLIDE 62

Scale 28

PW8 x PV8 x

slide-63
SLIDE 63

Scale 27

PW7 x PV7 x

slide-64
SLIDE 64

Scale 26

PW6 x PV6 x

slide-65
SLIDE 65

Scale 25

PW5 x PV5 x

slide-66
SLIDE 66

Scale 24

PW4 x PV4 x

slide-67
SLIDE 67

Scale 23

PW3 x PV3 x

slide-68
SLIDE 68

Scale 22

PW2 x PV2 x

slide-69
SLIDE 69

Scale 21

PW1 x PV1 x

slide-70
SLIDE 70

Scale 20

PW0 x PV0 x

slide-71
SLIDE 71

2D Wavelets

Extension to 2D by using outer products of 1D atoms ξ2D

s1,s2,k1,k2 := ξ1D s1,k1

  • ξ1D

s2,k2

T The JPEG 2000 compression standard is based on 2D wavelets Many extensions: Steerable pyramid, ridgelets, curvelets, bandlets, . . .

slide-72
SLIDE 72

2D Haar transform

slide-73
SLIDE 73

2D wavelet transform

slide-74
SLIDE 74

2D wavelet transform

slide-75
SLIDE 75

Frames Short-time Fourier transform (STFT) Wavelets Thresholding

slide-76
SLIDE 76

Denoising

Aim: Extracting information (signal) from data in the presence of uninformative perturbations (noise) Additive noise model data = signal + noise

  • y =

x + z Prior knowledge about structure of signal vs structure of noise is required

slide-77
SLIDE 77

Assumption

◮ Signal is a sparse superposition of basis/frame vectors ◮ Noise is not

slide-78
SLIDE 78

Assumption

◮ Signal is a sparse superposition of basis/frame vectors ◮ Noise is not

Example: Gaussian noise z with covariance matrix σ2I, distribution of F z?

slide-79
SLIDE 79

Example

Data Signal

slide-80
SLIDE 80

Thresholding

Hard-thresholding operator Hη ( v) [j] :=

  • v [j]

if | v [j]| > η

  • therwise
slide-81
SLIDE 81

Denoising via hard thresholding

Estimate Signal

slide-82
SLIDE 82

Multisinusoidal signal

  • y

F y

Data Signal Data

slide-83
SLIDE 83

Denoising via hard thresholding

Data: y = x + z Assumption: F x is sparse, F z is not

  • 1. Apply the hard-thresholding operator Hη to F

y

  • 2. If F is a basis, then
  • xest := F −1Hη (F

y) If F is a frame,

  • xest := F †Hη (F

y) , where F † is the pseudoinverse of F (other left inverses of F also work)

slide-84
SLIDE 84

Denoising via hard thresholding in Fourier basis

  • y

F y

Data Signal Data

slide-85
SLIDE 85

Denoising via hard thresholding in Fourier basis

F −1Hη (F y) Hη (F y)

Estimate Signal Estimate

slide-86
SLIDE 86

Image denoising

  • x

F x

slide-87
SLIDE 87

Image denoising

  • z

F z

slide-88
SLIDE 88

Data (SNR=2.5)

  • y

F y

slide-89
SLIDE 89

F y

slide-90
SLIDE 90

Hη (F y)

slide-91
SLIDE 91

F −1Hη (F y)

slide-92
SLIDE 92
  • y
slide-93
SLIDE 93

Data (SNR=1)

  • y

F y

slide-94
SLIDE 94

F y

slide-95
SLIDE 95

Hη (F y)

slide-96
SLIDE 96

F −1Hη (F y)

slide-97
SLIDE 97
  • y
slide-98
SLIDE 98

Image denoising

  • y

F −1Hη (F y)

  • x
slide-99
SLIDE 99

Denoising via thresholding

  • y

F −1Hη (F y)

  • x
slide-100
SLIDE 100

Speech denoising

slide-101
SLIDE 101

Time thresholding

slide-102
SLIDE 102

Spectrum

slide-103
SLIDE 103

Frequency thresholding

slide-104
SLIDE 104

Frequency thresholding

Data DFT thresholding

slide-105
SLIDE 105

Spectrogram (STFT)

Time Frequency

slide-106
SLIDE 106

STFT thresholding

Time Frequency

slide-107
SLIDE 107

STFT thresholding

Data STFT thresholding

slide-108
SLIDE 108

Coefficients are structured

Time Frequency

slide-109
SLIDE 109

Coefficients are structured

  • x

F x

slide-110
SLIDE 110

Block thresholding

Assumption: Coefficients are group sparse, nonzero coefficients cluster together Partition coefficients into blocks I1, I2, . . . , Ik and threshold whole blocks Bη ( v) [j] :=

  • v [j]

if j ∈ Ij such that

  • vIj
  • 2 > η, ,
  • therwise,
slide-111
SLIDE 111

Denoising via block thresholding

  • 1. Apply the hard-thresholding operator Bη to F

y

  • 2. If F is a basis, then
  • xest := F −1Bη (F

y) If F is a frame,

  • xest := F †Bη (F

y) , where F † is the pseudoinverse of F (other left inverses of F also work)

slide-112
SLIDE 112

Image denoising (SNR=2.5)

  • y

F y

slide-113
SLIDE 113

F y

slide-114
SLIDE 114

Hη (F y)

slide-115
SLIDE 115

Bη (F y)

slide-116
SLIDE 116

F −1Hη (F y)

slide-117
SLIDE 117

F −1Bη (F y)

slide-118
SLIDE 118

Image denoising (SNR=1)

  • y

F y

slide-119
SLIDE 119

F y

slide-120
SLIDE 120

Hη (F y)

slide-121
SLIDE 121

Bη (F y)

slide-122
SLIDE 122

F −1Hη (F y)

slide-123
SLIDE 123

F −1Bη (F y)

slide-124
SLIDE 124

Denoising via thresholding

  • y

F −1Hη (F y)

  • x
slide-125
SLIDE 125

Denoising via thresholding

  • y

F −1Bη (F y)

  • x
slide-126
SLIDE 126

Denoising via thresholding

  • y

F −1Hη (F y)

  • x
slide-127
SLIDE 127

Denoising via thresholding

  • y

F −1Bη (F y)

  • x
slide-128
SLIDE 128

Speech denoising

slide-129
SLIDE 129

Spectrogram (STFT)

Time Frequency

slide-130
SLIDE 130

STFT thresholding

Time Frequency

slide-131
SLIDE 131

STFT thresholding

Data STFT thresholding

slide-132
SLIDE 132

STFT block thresholding

Time Frequency

slide-133
SLIDE 133

STFT block thresholding

Data STFT block thresh.