Sparse signal representation and the tunable Q-factor wavelet transform
Ivan Selesnick Polytechnic Institute of New York University Brooklyn, New York
1
Sparse signal representation and the tunable Q-factor wavelet - - PowerPoint PPT Presentation
Sparse signal representation and the tunable Q-factor wavelet transform Ivan Selesnick Polytechnic Institute of New York University Brooklyn, New York 1 Introduction Problem: Decomposition of a signal into the sum of two components: 1.
1
2
1 2 3 4 5 6 7 8 9 10 −40 −20 20 40 EEG SIGNAL TIME (SECONDS)
3
100 200 300 400 500 −1 1 PULSE 1 100 200 300 400 500 −1 1 PULSE 2 100 200 300 400 500 −1 1 PULSE 3 100 200 300 400 500 −1 1 PULSE 4 TIME (SAMPLES) 0.02 0.04 0.06 0.08 0.1 0.12 5 10 SPECTRUM
Q−factor = 1.15
0.02 0.04 0.06 0.08 0.1 0.12 20 40
Q−factor = 4.6
0.02 0.04 0.06 0.08 0.1 0.12 10 20
Q−factor = 1.15
0.02 0.04 0.06 0.08 0.1 0.12 50 100
Q−factor = 4.6
FREQUENCY (CYCLES/SAMPLE)
Figure 1: The resonance of an isolated pulse can be quantified by its Q-factor, defined as the ratio of its center frequency to its bandwidth. Pulses 1 and 3, essentially a single cycle in duration, are low-resonance pulses. Pulses 2 and 4, whose oscillations are more sustained, are high-resonance pulses. 4
100 200 300 400 500 −2 −1 1 2 TEST SIGNAL 100 200 300 400 500 −2 −1 1 2 HIGH−RESONANCE COMPONENT 100 200 300 400 500 −2 −1 1 2 LOW−RESONANCE COMPONENT 100 200 300 400 500 −2 −1 1 2 RESIDUAL TIME (SAMPLES) 100 200 300 400 500 −2 −1 1 2 TEST SIGNAL 100 200 300 400 500 −2 −1 1 2 LOW−PASS FILTERED 100 200 300 400 500 −2 −1 1 2 BAND−PASS FILTERED 100 200 300 400 500 −2 −1 1 2 HIGH−PASS FILTERED TIME (SAMPLES)
(a) Resonance-based decomposition. (b) Frequency-based filtering. Figure 2: Resonance- and frequency-based filtering. (a) Decomposition of a test signal into high- and low-resonance components. The high-resonance signal component is sparsely represented using a high Q-factor WT. Similarly, the low-resonance signal component is sparsely represented using a low Q-factor WT. (b) Decomposition of a test signal into low, mid, and high frequency components using LTI discrete-time filters. 5
SIGNAL LOW−RESONANCE COMPONENT
HIGH−RESONANCE COMPONENT Figure 3: Resonance-based signal decomposition must be nonlinear: The signal in the bottom left panel is the sum of the signals above it; however, the low-resonance component of a sum is not the sum of the low-resonance components. The same is true for the high-resonance component. Neither the low- nor high-resonance components satisfy the superposition property.
6
7
−π −απ απ π 0.5 1 FREQUENCY (ω) X(ω) −π π 0.5 1 FREQUENCY (ω) Y(ω) −π π 0.5 1 FREQUENCY (ω) X(ω) −π −π/α π/α π 0.5 1 FREQUENCY (ω) Y(ω)
8
−π −(1−β)π (1−β)π π 0.5 1 FREQUENCY (ω) X(ω) −π π 0.5 1 FREQUENCY (ω) Y(ω) −π π 0.5 1 FREQUENCY (ω) X(ω) −π −(1−1/β)π (1−1/β)π π 0.5 1 FREQUENCY (ω) Y(ω)
9
0(!)
+
1(!)
10
−π −απ −(1−β)π (1−β)π απ π 0.5 1 FREQUENCY (ω) H0(ω) −π −απ −(1−β)π (1−β)π απ π 0.5 1 FREQUENCY (ω) H1(ω)
11
12
−π −απ −(1−β)π (1−β)π απ π 0.5 1 ω X(ω)
(a) Fourier transform of input signal, X(ω).
−π −απ −(1−β)π (1−β)π απ π 0.5 1 FREQUENCY (ω) H0(ω) −π −απ −(1−β)π (1−β)π απ π 0.5 1 FREQUENCY (ω) H1(ω)
(b) Frequency responses H0(ω) and H1(ω).
−π −απ −(1−β)π (1−β)π απ π 0.5 1 ω H0(ω) X(ω) −π −απ −(1−β)π (1−β)π απ π 0.5 1 ω H1(ω) X(ω)
(c) Fourier transforms of input signal after filtering.
−π −0.5π 0.5π π 0.5 1 ω V0(ω) −π −0.5π 0.5π π 0.5 1 ω V1(ω)
(d) Fourier transforms after scaling. 13
1 (!)
14
1 (!) :=
j2
m=0
1(!)
15
π/4 π/2 3π/4 π 0.5 1 FREQUENCY RESPONSES − 4 LEVELS alpha = 0.67, beta = 1.00, Q = 1.00, R = 3.00 FREQUENCY (ω) 50 100 150 200 250 300 −0.1 −0.05 0.05 0.1 WAVELET TIME (SAMPLES) π/4 π/2 3π/4 π 0.5 1 FREQUENCY RESPONSES − 13 LEVELS alpha = 0.87, beta = 0.40, Q = 4.00, R = 3.00 FREQUENCY (ω) 50 100 150 200 250 300 −0.1 −0.05 0.05 0.1 WAVELET TIME (SAMPLES)
16
0(k)
+
1(k)
17
N/2 N − 1 X(k) N0/2 N0 − 1 Y (k) N/2 N − 1 X(k) N0/2 N0 − 1 Y (k)
18
N/2 N − 1 X(k) N1/2 N1 − 1 Y (k) N/2 N − 1 X(k) N1/2 N1 − 1 Y (k)
19
N/2 N − 1 X(k)
(a) DFT of input signal, X(k).
N/2 N − 1 H0(k)
T T
N/2 N − 1 H1(k)
T T
(b) Filters H0(k) and H1(k). The transitions-bands are indicated by ‘T’.
N/2 N − 1 H0(k)X(k) N/2 N − 1 H1(k)X(k)
(c) DFT of input signal after filtering.
N0/2 N0 − 1 V0(k) N1/2 N1 − 1 V1(k)
(d) DFT after scaling. 20
100 200 300 400 500 9 8 7 6 5 4 3 2 1
1.89% 6.14% 11.72% 16.33% 22.82% 26.80% 13.92% 0.38% 0.00%
SUBBANDS (LOW−Q RADWT) TIME (SAMPLES) SUBBAND
P = 2, Q = 3, S = 1, Levels = 10 Dilation = 1.50, Redundancy = 3.00
100 200 300 400 500 17 16 15 14 13 12 11 10 9 8 7
2.43% 7.97% 3.52% 0.54% 7.30% 16.03% 4.72% 1.74% 20.12% 29.69% 5.75%
SUBBANDS (HIGH−Q RADWT) TIME (SAMPLES) SUBBAND
P = 5, Q = 6, S = 2, Levels = 20 Dilation = 1.20, Redundancy = 3.00
21
100 200 300 400 500 9 8 7 6 5 4 3 2 1
0.00% 2.07% 8.78% 34.39% 0.79% 53.29% 0.68% 0.00% 0.00%
SUBBANDS (LOW−Q RADWT) TIME (SAMPLES) SUBBAND
P = 2, Q = 3, S = 1, Levels = 10 Dilation = 1.50, Redundancy = 3.00
100 200 300 400 500 17 16 15 14 13 12 11 10 9 8 7
0.01% 15.64% 0.11% 0.00% 0.02% 38.66% 0.02% 0.02% 6.26% 39.25% 0.00%
SUBBANDS (HIGH−Q RADWT) TIME (SAMPLES) SUBBAND
P = 5, Q = 6, S = 2, Levels = 20 Dilation = 1.20, Redundancy = 3.00
22
100 200 300 400 500 9 8 7 6 5 4 3 2 1
2.58% 6.31% 10.70% 16.11% 21.44% 22.50% 13.51% 4.87% 1.42%
SUBBANDS (LOW−Q RADWT) TIME (SAMPLES) SUBBAND
P = 2, Q = 3, S = 1, Levels = 10 Dilation = 1.50, Redundancy = 3.00
100 200 300 400 500 19 18 17 16 15 14 13 12 11 10 9 8 7 6
1.82% 2.75% 3.02% 3.82% 5.65% 6.82% 6.95% 8.72% 12.37% 13.98% 12.04% 8.44% 5.41% 3.07%
SUBBANDS (HIGH−Q RADWT) TIME (SAMPLES) SUBBAND
P = 5, Q = 6, S = 2, Levels = 20 Dilation = 1.20, Redundancy = 3.00
23
100 200 300 400 500 9 8 7 6 5 4 3 2 1
0.00% 3.12% 10.99% 27.98% 10.60% 40.59% 3.53% 2.64% 0.23%
SUBBANDS (LOW−Q RADWT) TIME (SAMPLES) SUBBAND
P = 2, Q = 3, S = 1, Levels = 10 Dilation = 1.50, Redundancy = 3.00
100 200 300 400 500 19 18 17 16 15 14 13 12 11 10 9 8 7 6
2.43% 3.78% 2.35% 2.16% 4.12% 5.87% 6.25% 4.01% 18.71% 12.72% 9.69% 19.51% 1.36% 3.11%
SUBBANDS (HIGH−Q RADWT) TIME (SAMPLES) SUBBAND
P = 5, Q = 6, S = 2, Levels = 20 Dilation = 1.20, Redundancy = 3.00
24
0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.5 1 FREQUENCY (CYCLES/SAMPLE) FREQUENCY RESPONSES 50 100 150 200 250 300 −1 1 ANALYSIS FUNCTION TIME (SAMPLES) 50 100 150 200 250 300 −1 1 ANALYSIS FUNCTION TIME (SAMPLES) 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.5 1 FREQUENCY (CYCLES/SAMPLE) FREQUENCY RESPONSES 50 100 150 200 250 300 −1 1 TIME (SAMPLES) ANALYSIS FUNCTION 50 100 150 200 250 300 −1 1 TIME (SAMPLES) ANALYSIS FUNCTION
25
26
27
28
2 + 2kw2k2 2
1 = Φ2Φt 2 = I, the w1 and w2 can be found in closed form:
2
1 + 2 2
1 x,
1
1 + 2 2
2 x
2
1 + 2 2
1
1 + 2 2
29
w1,w2
w
Reference: Afonso, Bioucas-Dias, Figueiredo. Fast Image Recovery Using Variable Splitting and Constrained Optimization. IEEE Trans. on Image Processing, 2010.
30
i c
31
10 20 30 40 50 60 70 80 90 100 5.5 6 6.5 7 7.5 8 8.5 9 9.5 ITERATION OBJECTIVE FUNCTION ISTA SALSA
Figure 4: Reduction of objective function during the first 100 iterations. SALSA converges faster than ISTA. 32
100 200 300 400 −2 −1 1 2 (a) TEST SIGNAL TIME (SAMPLES) 0.1 0.2 0.3 0.4 0.5 0.5 1 FREQUENCY (CYCLES/SAMPLE) (b) TWO BAND−PASS FILTERS BAND−PASS FILTER 1 BAND−PASS FILTER 2 100 200 300 400 −2 −1 1 2 (c) OUTPUT OF BAND−PASS FILTER 1 TIME (SAMPLES) 100 200 300 400 −2 −1 1 2 (d) OUTPUT OF BAND−PASS FILTER 2 TIME (SAMPLES)
Figure 5: LTI band-pass filtering. The test signal (a) consists of a sinusoidal pulse of frequency 0.1 cycles/sample and a transient. Band-pass filters 1 and 2 in (b) are tuned to the frequencies 0.07 and 0.10 cycles/second respectively. The output signals, obtained by filtering the test signal with each of the two band-pass filters, are shown in (c) and (d). The output of band-pass filter 1, illustrated in (c), contains oscillations due to the transient in the test signal. Moreover, the transient oscillations in (c) have a frequency of 0.07 Hz even though the test signal (a) contains no sustained oscillatory behavior at this frequency. 33
100 200 300 400 −2 −1 1 2 (a) HIGH−RESONANCE COMPONENT TIME (SAMPLES) 100 200 300 400 −2 −1 1 2 (b) LOW−RESONANCE COMPONENT TIME (SAMPLES) 100 200 300 400 −2 −1 1 2 (c) OUTPUT OF BAND−PASS FILTER 1 TIME (SAMPLES) 100 200 300 400 −2 −1 1 2 (d) OUTPUT OF BAND−PASS FILTER 2 TIME (SAMPLES)
Figure 6: Resonance-based decomposition and band-pass filtering. When resonance-based analysis method is applied to the test signal in Fig. 5a, it yields the high- and low-resonance components illustrated in (a) and (b). The output signals, obtained by filtering the high-resonance component (a) with each of the two band-pass filters shown in Fig. 5b, are illustrated in (c) and (d). The transient oscillations in (c) are substantially reduced compared to Fig. 5c. 34
0.05 0.1 0.15 −0.4 −0.2 0.2 0.4 (a) SPEECH SIGNAL [Fs = 16,000 SAMPLES/SECOND] 0.05 0.1 0.15 −0.4 −0.2 0.2 0.4 (b) HIGH−RESONANCE COMPONENT 0.05 0.1 0.15 −0.4 −0.2 0.2 0.4 (c) LOW−RESONANCE COMPONENT TIME (SECONDS) Figure 7: Decomposition of a speech signal (“I’m”) into high- and low-resonance components. The high-resonance component (b) contains the sustained oscillations present in the speech signal, while the low-resonance component (c) contains non-oscillatory transients. (The residual is not shown.) 35
500 1000 1500 2000 2500 3000 0.005 0.01 (a) ORIGINAL SPEECH 500 1000 1500 2000 2500 3000 0.005 0.01 (b) HIGH−RESONANCE COMPONENT 500 1000 1500 2000 2500 3000 0.005 0.01 (c) LOW−RESONANCE COMPONENT FREQUENCY (Hz)
Figure 8: Frequency spectra of the speech signal in Fig. 7 and of the extracted high- and low-resonance components. The spectra are computed using the 50 msec segment from 0.05 to 0.10 seconds. The energy of each resonance component is widely distributed in frequency and their frequency-spectra overlap. 36
0.05 0.1 0.15 −0.2 0.2 (a) RECONSTRUCTION FROM SUBBANDS 9, 10 0.05 0.1 0.15 −0.2 0.2 (b) RECONSTRUCTION FROM SUBBANDS 18, 19 0.05 0.1 0.15 −0.2 0.2 (c) RECONSTRUCTION FROM SUBBANDS 21−24 0.05 0.1 0.15 −0.2 0.2 (D) SUM OF ABOVE 3 SIGNALS TIME (SECONDS)
Figure 9: Frequency decomposition of high-resonance component in Fig. 7. Reconstructing the high-resonance component from a few subbands of the high Q-factor WT at a time, yields an efficient AM/FM decomposition. 37
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.5 1 CONSTANT BANDWITH FREQUENCY 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.5 1 CONSTANT Q−FACTOR FREQUENCY
38
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.5 1 CONSTANT Q−FACTOR (HIGH Q−FACTOR) FREQUENCY 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.5 1 CONSTANT Q−FACTOR (LOW Q−FACTOR) FREQUENCY
39
Figure 10: For reliable resonance-based decomposition, the inner product between the low-Q and high-Q wavelets should be small for all dilations and translations. The computation of the maximum inner product is simplified by assuming the wavelets are ideal band-pass functions and expressing the inner product in the frequency domain.
40
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.5 1 CONSTANT BANDWITH (NARROW−BAND) FREQUENCY 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.5 1 CONSTANT BANDWITH (WIDE−BAND) FREQUENCY
41
42
43
Po-Yu Chen
Basis pursuit denoising (no group structure): argmin
c
ky Φck2
2 + λkck1
Generalized basis pursuit denoising (overlapping group sparsity): argmin
c
ky Φck2
2 + λ N−3
X
i=0
p |c(i)|2 + |c(i + 1)|2 + |c(i + 2)|2 (group size 3)
1
TIME (SECONDS) FREQUENCY 0.5 1 1.5 2 2.5 3 1000 2000 3000 4000 5000 6000 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 TIME (SECONDS) FREQUENCY 0.5 1 1.5 2 2.5 3 1000 2000 3000 4000 5000 6000 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5
Noise-free signal spectrogram Noisy signal spectrogram
36
(K1, K2) = (1, 1) Basis pursuit denoising using `1 norm, i.e. no group structure.
TIME (SECONDS) FREQUENCY 0.5 1 1.5 2 2.5 3 1000 2000 3000 4000 5000 6000 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 TIME (SECONDS) FREQUENCY 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 4000 4500 5000 5500 6000 6500 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5
The spectrogram exhibits many spurious noise spikes which produces ‘musical noise’
37
(K1, K2) = (2, 8)
TIME (SECONDS) FREQUENCY 0.5 1 1.5 2 2.5 3 1000 2000 3000 4000 5000 6000 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5
→
TIME (SECONDS) FREQUENCY 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 4000 4500 5000 5500 6000 6500 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5
The spectrogram does not exhibit spurious noise spikes. The denoised signal is free of ‘musical noise’ artifact
38
(K1, K2) = (8, 2)
TIME (SECONDS) FREQUENCY 0.5 1 1.5 2 2.5 3 1000 2000 3000 4000 5000 6000 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5
→
TIME (SECONDS) FREQUENCY 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 4000 4500 5000 5500 6000 6500 7000 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5
The spectrogram does not exhibit spurious noise spikes. The denoised signal is free of ‘musical noise’ artifact
39
Xiaoran Ning
y = x + noise Total variation denoising: argmin
x
ky xk2
2 + λkDxk1
Dual-component polynomial denoising: argmin
x1,x2
ky x1 x2k2
2 + λ1kDx1k1 + λ2kD2x2k1
2
First order difference (derivative) Second order derivative
100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 The clean signal 100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 Y: Noisy signal, SNR = 15 dB 100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 The recovered signal: SNR = 29.1205 dB
Total Variation Denoising Stair case artifact
100 200 300 400 500 600 700 800 900 100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 Y: Noisy signal, SNR = 15 dB 100 200 300 400 500 600 700 800 900 −1 1 R1: Estimate of X1 100 200 300 400 500 600 700 800 900 0.5 1 R2: Estimate of X2 1.5 The recovered signal = R1 + R2, SNR = 31.2347 dB
Quasi-piecewise constant componet Quasi-piecewise linear component
100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 The clean signal 100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 Sparse Derivative Denoising: SNR = 31.2347 dB 100 200 300 400 500 600 700 800 900 −0.5 0.5 1 1.5 Total Variation Filtering: SNR = 29.1205 dB
Figure: Sparse Derivative Denoising and TV Filtering.
24
44
45
0.02 0.04 0.06 0.08 0.1 0.12 1 2 3 4 5 6 7 Empirical mode decomposition (EMD) Time (seconds) IMF index
46
0.02 0.04 0.06 0.08 0.1 0.12 recon residual band 4 band 3 band 2 band 1 Time (seconds) Sparse TQWT Representation
47
2 + 1kw1k1 + 2kw2k1
48
2 + 1kw1k2 2 + 2kw2k2 2,
1 = Φ2Φt 2 = I, the minimizing w1 and w2 can be found in closed form:
1 x
2 x
49
2 + 1kw1k1 + 2kw2k1
2 + kλ wk1
50
2 + kwk1
u
u f1(u) + µku v(k) d(k)k2 2
v f2(v) + µku(k+1) v d(k)k2 2
51
i c
52
0.02 0.04 0.06 0.08 0.1 0.12 −0.2 −0.1 0.1 0.2 NOISY SPEECH WAVEFORM 0.02 0.04 0.06 0.08 0.1 0.12 −0.2 −0.1 0.1 0.2 HIGH Q−FACTOR COMPONENT 0.02 0.04 0.06 0.08 0.1 0.12 −0.2 −0.1 0.1 0.2 LOW Q−FACTOR COMPONENT 0.02 0.04 0.06 0.08 0.1 0.12 −0.2 −0.1 0.1 0.2 RESIDUAL TIME (SECONDS)
53