CTP431- Music and Audio Computing Spectral Analysis
Graduate School of Culture Technology KAIST Juhan Nam
1
CTP431- Music and Audio Computing Spectral Analysis Graduate School - - PowerPoint PPT Presentation
CTP431- Music and Audio Computing Spectral Analysis Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines Time-domain representation of sound Waveform Time-Frequency domain representation of sound Discrete Fourier
1
2
3
4
Piano C4 Note Flute A4 Note
5
6
Piano C4 Note Flute A4 Note
7
Piano C4 Note Flute A4 Note
$ % ∑
* 𝑜 %+$ *,-
* 𝑜 = cos
34*5 %
8
9
34* % radian or * % 𝐺 G Hz (𝐺 G: the sampling rate) ( 𝐿 = 0, 1, 2, … , 𝑂 − 1)
10
%
Figures are from https://ccrma.stanford.edu/~jos/dft/
11
N=8
Figures are from https://ccrma.stanford.edu/~jos/dft/
12
𝑦 𝑜 = 1 𝑂 M 𝐵 𝑙 cos 2𝜌𝑙𝑜 𝑂 + 𝜚(𝑙)
%+$ *,-
= 1 𝑂 M 𝐵 𝑙 (𝑓=(34*5
% OP * )+𝑓+=(34*5 % OP * ))/2 %+$ *,-
= 1 𝑂 M(𝐵 𝑙 𝑓=P(*)𝑓=34*5
%
+ 𝐵 𝑙 𝑓+=P(*)𝑓+=34*5
% )/2 %+$ *,-
= 1 𝑂 M(𝑌 𝑙 𝑓=34*5
%
+ 𝑌 𝑙 𝑓+=34*5
% )/2 %+$ *,-
= Real{1 𝑂 M 𝑌 𝑙 𝑓=34*5
% %+$ *,-
} = 1 𝑂 M 𝑌 𝑙 𝑓=34*5
% %+$ *,-
𝑌 𝑙 = 𝐵(𝑙)𝑓=X * = 𝐵 𝑙 cos ϕ 𝑙 + 𝑘 sin ϕ 𝑙
13
𝑡Y 𝑜 Z 𝑡[
∗ 𝑜 = M 𝑓=34Y5 %
Z 𝑓+=34[5
% %+$ 5,-
= ] 𝑂 if 𝑞 = 𝑟 0 otherwise
n=0 N−1
n=0 N−1
n=0 N−1
14
𝑦 𝑜 Z 𝑡[(𝑜) = M 𝑦 𝑜 𝑓+=34[5
%
= M(1 𝑂 M 𝑌 𝑙 𝑓=34Y5
% %+$ Y,-
)𝑓+=34[5
% %+$ 5,- %+$ 5,-
= 1 𝑂 M 𝑌 𝑙 (M 𝑓=34Y5
% %+$ 5,-
𝑓+=34[5
% ) %+$ Y,-
= 1 𝑂 𝑌 𝑙 𝑂 = 𝑌 𝑙 = 𝐵 𝑙 𝑓=X *
15
𝑦(𝑜) = 1 𝑂 M 𝑌 𝑙 𝑓=34*5
% %+$ *,-
𝑌 𝑙 = M 𝑦 𝑜 𝑓+=34*5
% %+$ 5,-
= 𝑌e 𝑙 + 𝑘𝑌f 𝑙 = 𝐵(𝑙)=X * 𝑌 𝑙 = 𝐵 𝑙 = 𝑌e
3 𝑙 + 𝑌f 3 𝑙
𝑌e(𝑙))
50 100 150 200 250 300 −0.2 −0.1 0.1 0.2 time−seconds amplitude
20 40 60 80 −0.2 −0.1 0.1 0.2 time−seconds amplitude
16
17
18
5 10 15 20 25 30
0.5 1
Waveform
5 10 15 20 25 30 5 10 15
Magnitude (N=32)
5 10 15 20 25 30
2 4
Phase (N=32)
5 10 15 20 25 30
0.5 1
Waveform
5 10 15 5 10 15
Magnitude (N=32)
5 10 15
2 4
Phase (N=32)
𝑌 𝑙 = 𝑌 𝑂 − 𝑙 𝑌 𝑙 = 𝑌 −𝑙 ∠𝑌 𝑙 =−∠𝑌 𝑂 − 𝑙 ∠𝑌 𝑙 = −∠𝑌 −𝑙
19
fs
N
N/2
fs /2
20
Sine: waveform
5 10 15 20 25 30 35 40 45 50 −0.5 0.5 time−milliseconds amplitude 500 1000 1500 2000 2500 3000 3500 4000 50 100 150 freqeuncy magnitude
Sine: spectrum
20 40 60 80 100 120 140 160 −0.5 0.5 time−milliseconds amplitude 0.5 1 1.5 2 2.5 x 10
4
5 10 15 freqeuncy magnitude
Drum: waveform Drum: spectrum
50 52 54 56 58 60 −0.4 −0.2 0.2 0.4 time−milliseconds amplitude 0.5 1 1.5 2 2.5 x 10
4
10 20 30 40 freqeuncy magnitude
Flute: waveform Flute: spectrum
21
𝑌(0) 𝑌(1) 𝑌(2) 𝑌(3) ⋮ 𝑌(𝑂 − 2) 𝑌(𝑂 − 1) = 1 1 1 1 1 ⋮ 1 𝑋
%
𝑋
% 3
𝑋
% m
⋮ 𝑋
% %+$
1 ⋯ 𝑋
% 3
𝑋
%
% p
⋮ 𝑋
% 3(%+$)
⋯ ⋯ ⋯ ⋯ ⋯ ⋯ 1 𝑋
% %+$
𝑋
% 3(%+$)
𝑋
% m(%+$)
𝑋
% (%+$)(%+$)
𝑦(0) 𝑦(1) 𝑦(2) 𝑦(3) ⋮ 𝑦(𝑂 − 2) 𝑦(𝑂 − 1)
22
% %+$ 5,-
23 Main-lobe width Side-lobe level
24
50% overlap
Source: the JOS SASP book
25
26
27
< Long window > high freq.-resolution low time-resolution < Short window > low freq.-resolution high time-resolution