GCT535- Sound Technology for Multimedia Fourier Representations of Audio
Graduate School of Culture Technology KAIST Juhan Nam
1
GCT535- Sound Technology for Multimedia Fourier Representations of - - PowerPoint PPT Presentation
GCT535- Sound Technology for Multimedia Fourier Representations of Audio Graduate School of Culture Technology KAIST Juhan Nam 1 Waveforms The basic audio representation that computers can take x(n) = [a1, a2, a3, ...] Great to
1
2
3
Inertial force
Restoration force
1 2 3 4 5 6 7 8 x 10
−3
−1 −0.5 0.5 1
angular frequency frequency period
4
. . . String oscillation
$ % ∑
* 𝑜 %+$ *,-
* 𝑜 = cos
34*5 %
5
6
34* % radian or * % 𝐺 G Hz (𝐺 G: the sampling rate) ( 𝐿 = 0, 1, 2, … , 𝑂 − 1)
7
%
Figures are from https://ccrma.stanford.edu/~jos/dft/
8
N=8
Figures are from https://ccrma.stanford.edu/~jos/dft/
9
𝑦 𝑜 = 1 𝑂 M 𝐵 𝑙 cos 2𝜌𝑙𝑜 𝑂 + 𝜚(𝑙)
%+$ *,-
= 1 𝑂 M 𝐵 𝑙 (𝑓=(34*5
% OP * )+𝑓+=(34*5 % OP * ))/2 %+$ *,-
= 1 𝑂 M(𝐵 𝑙 𝑓=P(*)𝑓=34*5
%
+ 𝐵 𝑙 𝑓+=P(*)𝑓+=34*5
% )/2 %+$ *,-
= 1 𝑂 M(𝑌 𝑙 𝑓=34*5
%
+ 𝑌 𝑙 𝑓+=34*5
% )/2 %+$ *,-
= Real{1 𝑂 M 𝑌 𝑙 𝑓=34*5
% %+$ *,-
} = 1 𝑂 M 𝑌 𝑙 𝑓=34*5
% %+$ *,-
𝑌 𝑙 = 𝐵(𝑙)𝑓=X * = 𝐵 𝑙 cos ϕ 𝑙 + 𝑘 sin ϕ 𝑙
10
𝑡Y 𝑜 Z 𝑡[
∗ 𝑜 = M 𝑓=34Y5 %
Z 𝑓+=34[5
% %+$ 5,-
= ] 𝑂 if 𝑞 = 𝑟 0 otherwise
n=0 N−1
n=0 N−1
n=0 N−1
11
𝑦 𝑜 Z 𝑡*(𝑜) = M 𝑦 𝑜 𝑓+=34[5
%
= M(1 𝑂 M 𝑌 𝑙 𝑓=34Y5
% %+$ Y,-
)𝑓+=34[5
% %+$ 5,- %+$ 5,-
= 1 𝑂 M 𝑌 𝑙 (M 𝑓=34Y5
% %+$ 5,-
𝑓+=34[5
% ) %+$ Y,-
= 1 𝑂 𝑌 𝑙 𝑂 = 𝑌 𝑙 = 𝐵 𝑙 𝑓=X *
12
𝑦(𝑜) = 1 𝑂 M 𝑌 𝑙 𝑓=34*5
% %+$ *,-
𝑌 𝑙 = M 𝑦 𝑜 𝑓+=34*5
% %+$ 5,-
= 𝑌e 𝑙 + 𝑘𝑌f 𝑙 = 𝐵(𝑙)=X * 𝑌 𝑙 = 𝐵 𝑙 = 𝑌e
3 𝑙 + 𝑌f 3 𝑙
𝑌e(𝑙))
50 100 150 200 250 300 −0.2 −0.1 0.1 0.2 time−seconds amplitude
20 40 60 80 −0.2 −0.1 0.1 0.2 time−seconds amplitude
13
14
15
5 10 15 20 25 30
0.5 1
Waveform
5 10 15 20 25 30 5 10 15
Magnitude (N=32)
5 10 15 20 25 30
2 4
Phase (N=32)
5 10 15 20 25 30
0.5 1
Waveform
5 10 15 5 10 15
Magnitude (N=32)
5 10 15
2 4
Phase (N=32)
𝑌 𝑙 = 𝑌 𝑂 − 𝑙 𝑌 𝑙 = 𝑌 −𝑙 ∠𝑌 𝑙 =−∠𝑌 𝑂 − 𝑙 ∠𝑌 𝑙 = −∠𝑌 −𝑙
16
fs
N
N/2
fs /2
17
𝑌 𝑙 = M 𝑦 𝑜 𝑓+=34*5
% %+$ 5,-
= M 𝑓=l5𝑓+=34*5
% %+$ 5,-
= M 𝑓=(l+34*
% )5 %+$ 5,-
= 1 − 𝑓=(l+34*
% )%
1 − 𝑓=(l+34*
% ) = 𝑓= l+34* % %/3 sin
((𝜕 − 2𝜌𝑙 𝑂 )𝑂/2) sin ((𝜕 − 2𝜌𝑙 𝑂 )/2)
5 10 15 20 25 30 −2 −1 1 2 Amplitude
2 4 6 8 10 5 10 15 20 Magntude 5 10 15 20 25 30 −2 −1 1 2
2 4 6 8 10 5 10 15 20 Magntude
18 sin ((𝜕 − 2𝜌𝑙 𝑂 )𝑂/2) sin ((𝜕 − 2𝜌𝑙 𝑂 )/2)
𝜕
sin ((𝜕 − 2𝜌𝑙 𝑂 )𝑂/2) sin ((𝜕 − 2𝜌𝑙 𝑂 )/2)
𝜕
50 100 150 200 250 −2 −1 1 2
Amplitude Before Zeropadding
2 4 6 8 10 12 14 16 20 40 60 80 100
Magntude
200 400 600 800 1000 1200 −2 −1 1 2
After Zeropadding (x4)
10 20 30 40 50 60 50 100 150
19
20
Sine: waveform
5 10 15 20 25 30 35 40 45 50 −0.5 0.5 time−milliseconds amplitude 500 1000 1500 2000 2500 3000 3500 4000 50 100 150 freqeuncy magnitude
Sine: spectrum
20 40 60 80 100 120 140 160 −0.5 0.5 time−milliseconds amplitude 0.5 1 1.5 2 2.5 x 10
4
5 10 15 freqeuncy magnitude
Drum: waveform Drum: spectrum
50 52 54 56 58 60 −0.4 −0.2 0.2 0.4 time−milliseconds amplitude 0.5 1 1.5 2 2.5 x 10
4
10 20 30 40 freqeuncy magnitude
Flute: waveform Flute: spectrum
21
22
% %+$ 5,-
23 Main-lobe width Side-lobe level
24
50% overlap
Source: the JOS SASP book
25
Piano C4 Note Flute A4 Note
26
Piano C4 Note Flute A4 Note
27
Piano C4 Note Flute A4 Note
28
29
30
< Long window > high freq.-resolution low time-resolution < Short window > low freq.-resolution high time-resolution
31
32