GCT535- Sound Technology for Multimedia Digital Audio
Graduate School of Culture Technology KAIST Juhan Nam
1
GCT535- Sound Technology for Multimedia Digital Audio Graduate - - PowerPoint PPT Presentation
GCT535- Sound Technology for Multimedia Digital Audio Graduate School of Culture Technology KAIST Juhan Nam 1 Digital Representations 0 1 1 0 1 1 0 Sound 1 0 0 1 1 0 1 Image 0 0 1 1 0 1 1 Text Digital
1
… 0 1 1 0 1 1 0 … … 1 0 0 1 1 0 1 … … 0 0 1 1 0 1 1 …
5
6
…0 0 1 0 1 0 …
7
8
Ts
Signal notation
9
10
fm
fm
fs+fm fs-fm fs
*𝑢
*𝑜𝑈
*𝑜/𝑔
*𝑜/𝑔
* ± 𝑙𝑔
Audible range Audible range
11
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x 10
4
−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
*
12
fm
fm
fs+fm fs-fm fs
Audible range Audible range
In order to avoid aliasing
13
fm
fs+fm fs-fm fs
Audible range
14
fs/2 fs
fm
fs+fm fs-fm fs
Lowpass Filter
15
Frequency sweep of the trivial sawtooth wave
Time (s) Frequency (Hz) 1 1.5 2 2.5 3 3.5 4 4.5 0.5 1 1.5 2 x 10
4
Bandlimited sawtooth wave spectrum
5 10 15 20 −60 −40 −20 Frequency (kHz) Magnitude (dB)
5 10 15 20 −60 −40 −20 Frequency (kHz) Magnitude (dB)
Trivial sawtooth wave spectrum
16
( Note that video frame rate corresponds to the sampling rate )
17
18
fm
fm
fs
fs/2
19
sinc functions!
sinc(x) = sin(π x) π x
Before sampling After sampling Reconstruction Time domain Frequency domain
20
– Unsigned: 0 ~ 2^B-1
– Signed: -2^(B-1) ~ 2^(B-1)-1
– 8 or 16 bits are popular choices – WAV file format
– Composed of sign, exponent and mantissa – The represented number is (-1)s x m x 2e (base 2) or (-1)s x m x 10e (base 10) – Examples
– The floating point can represent a much wider range of numbers – 32 or 64 bits are popular choices – Internal processing in DAW
21
…
B bits
Sine Mantissa Exponent
e m s
22
1/2
P(e)
1
−1/2 1/2
Root mean square (RMS) of noise 20log10 Srms Nrms = 20log10 2B−1 / 2 112 = 6.02B+1.76 dB
(With 16bits, SNR = 98.08dB)
20log10 Smax Nmax = 20log10 2B−1 12 = 6.02B dB (With 16bits, SNR = 96.32 dB) RMS of full-scale sine wave
23
20log10 Srms,max Srms,min = 20log10 2B−1 / 2 1/ 2 = 6.02B − 6 (With 16bits, DR = 90.31 dB) Equal Loudness Curve
Again, RMS of full-scale sine wave for both loudest and softest
24
0 dB
Noise floor (By quantization) Max level Head room Clipping Min level
B = 16 bits In digital audio, 0dB is regarded as the maximum level
– As the signal level goes down, SNR decreases – Low-level signals can have colored noise
– Adding a small white noise to the signal before sampling (or high to low bit conversion) – This adds white noise but coloration is prevented – The amount is the order of 3dB
25
No dithering With dithering
See the added white noise. This is less annoying than the colored noise by quantization
26
27
log freq
Intensity / dB
Borrowed from D. Ellis’ E4896 slides
28
Probability