GCT535- Sound Technology for Multimedia Timbre Analysis
Graduate School of Culture Technology KAIST Juhan Nam
1
GCT535- Sound Technology for Multimedia Timbre Analysis Graduate - - PowerPoint PPT Presentation
GCT535- Sound Technology for Multimedia Timbre Analysis Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines Timbre Analysis Definition of Timbre Timbre Features Zero-crossing rate Spectral summary features
1
2
3
4
5
Changes of spectral envelope ADSR
6
7
(Grey, 1977)
8
(T. Rossing’s music150 slides)
9
10
Voiced Unvoiced
k
k
11
k Rt
k N
12
k
k
k
K
k
13
time [sec] frequency [Hz]
0.5 1 1.5 2 2.5 3 3.5 4 4.5 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
time [sec] frequency [Hz]
0.5 1 1.5 2 2.5 3 3.5 4 4.5 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
Classical: “Beethoven String Quartet” Pop: “Video killed the radio star”
14
15
Spectrum Spectrum (mel-scaled)
16
Spectrum (mel-scaled) MFCC
n=1 N−1
17
Frequency spectrum (512 bins) Frequency spectrum (mel-scaled, 60 bins) MFCC (13 dim) Reconstructed Frequency Spectrum (mel-scaled) Reconstructed Frequency spectrum
18
19
20
21
22