digital audio
play

Digital Audio Graduate School of Culture Technology, KAIST Juhan - PowerPoint PPT Presentation

2018 Fall CTP431: Music and Audio Computing Digital Audio Graduate School of Culture Technology, KAIST Juhan Nam Outlines Introduction Sampling Quantization Digital audio standards Playback Rate Conversion / Resampling


  1. 2018 Fall CTP431: Music and Audio Computing Digital Audio Graduate School of Culture Technology, KAIST Juhan Nam

  2. Outlines • Introduction • Sampling • Quantization • Digital audio standards • Playback Rate Conversion / Resampling

  3. Introduction … 0 1 1 0 1 1 0 … Sound … 1 0 0 1 1 0 1 … Image Text … 0 0 1 1 0 1 1 …

  4. Digital Representations • Sampling and Quantization - Sound (samples) - Image (pixels) • Trade-off - Between quality and data size

  5. Digital Audio Chain

  6. Microphone and Speakers • Microphones - Sound to electrical signal - Dynamic mic: Fleming’s right-hand rule - Condenser mic: Q = CV, C = A/d Condenser mic Dynamic mic - Pre-amp (A= area, D= distance) • Loudspeakers - Electrical signal to sound - Similar to dynamic mic in principle - Fleming’s left-hand rule - Crossover networks: woofer / tweeter - Power amp Source: http://www.shure.com/americas/support/find-an-answer/difference-between-a-dynamic-and-condenser-microphone Source: http://hyperphysics.phy-astr.gsu.edu/hbase/Audio/cross.html

  7. Sampling • Convert continuous-time signals to discrete-time signals by periodically picking up the instantaneous values - Represented as a sequence of numbers: pulse code modulation (PCM) - Sampling period ( T s ): the amount of time between samples - Sampling rate ( f s = 1/ T s ) T s Signal notation x ( t ) → x ( nT s )

  8. Sampling Theorem • What is an appropriate sampling rate? - Too high: increase data rate - Too low: become hard to reconstruct the original signal • Sampling Theorem - In order for a band-limited signal to be reconstructed fully, the sampling rate must be greater than twice the maximum frequency in the signal 𝑔 " : sampling rate 𝑔 " > 2 & 𝑔 ' 𝑔 ' : maximum frequency - Half the sampling rate is called Nyquist frequency ( 𝑔 " /2 )

  9. Aliasing • If the sampling rate is less than twice the maximum frequency, the high-frequency content is folded over to lower frequency range 1 0.8 0.6 0.4 0.2 0 − 0.2 − 0.4 − 0.6 − 0.8 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 4 x 10

  10. Aliasing in Frequency Domain • For general signals, high-frequency content is folded over to lower frequency range Audible range -f m f s -f s f s -f m f m f s +f m

  11. To avoid Aliasing • Increase sampling rate f s > 2 ⋅ f m • Or use lowpass filters before sampling -f s -f m f s -f m f s f m f s +f m Lowpass Filter f s /2 f s -f s -f s /2

  12. Example of Aliasing 0 0 Magnitude (dB) Magnitude (dB) − 20 − 20 − 40 − 40 − 60 − 60 5 10 15 20 5 10 15 20 Frequency (kHz) Frequency (kHz) Trivial sawtooth wave spectrum Bandlimited sawtooth wave spectrum 4 x 10 2 1.5 Frequency (Hz) 1 0.5 Frequency sweep of the trivial sawtooth wave 12 0 1 1.5 2 2.5 3 3.5 4 4.5 Time (s)

  13. Example of Aliasing in Video https://www.youtube.com/watch?v=jHS9JGkEOmA

  14. Quantization • Discretizing the amplitude of real-valued signals - Round the amplitude to the nearest discrete steps - The discrete steps are determined by the number of bit bits - N bits can range from -2 N -1 to 2 N -1 -1 2 N -1 -1 Quantization step -2 N -1

  15. Quantization Error • Quantization causes noise - Average power of quantization noise: obtained from the probability density function (PDF) of the error P ( e ) Root mean square (RMS) of noise 1 1/2 112 x 2 p ( e ) dx ∫ = − 1/2 -1/2 1/2 • Signal to Noise Ratio (SNR) - Based on average power RMS of full-scale sine wave 2 B − 1 / S rms 2 20log 10 = 20log 10 = 6.02 B + 1.76 dB N rms 112

  16. Digital Audio Standards • Determined by the limit in human hearing - Maximum audible frequency (bandwidth): 20kHz - Dynamic range: depends on frequency (the maximum is about 120dB) Sound Level (dB) Human hearing range

  17. Digital Audio Standards • Compact disc - Sampling rate: 44.1 kHz: > 2 x 20 kHz - Bit depth: 16 bits: SNR = 98.08dB • Blu-ray disc / professional audio - Sampling rate: 48 / 96 /192kHz: > 2 x 20 kHz - Bit depth: 16 / 20 / 24 bits • Telephone - Sampling rate: 8 / 16 kHz - Bit depth: 8 bits (with companding)

  18. Playback Rate Conversion • Playback rate does not have to be the same as the recording rate • Adjusting the playback rate given the recorded audio creates different tones - Sliding tapes on the magnetic header in a variable speed - Speeding down: “monster-like” - Speeding up: “chipmunk-like” - It can be even negative rate: reverse playback

  19. Demo: Playback Rate Conversion • https://musiclab.chromeexperiments.com/Voice-Spinner

  20. Resampling • Reconstruct the original signal and sample it with a new sampling rate • For a digital system with a constant playback rate - Up-sampling makes the original have slower speed and lower pitch - Down-sampling makes the original have faster speed and higher pitch

  21. Resampling • Resampling changes pitch, length and timbre at the same time! Original Speed Down (Up-sampling) Speed Up (Down-sampling) [The DaFX book]

  22. Practice: Audacity • Recording • Editing • Digital Audio Effects

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend