Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound - PowerPoint PPT Presentation

COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1

Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2

Wave equation 𝑄𝑠𝑓𝑡𝑡𝑣𝑠𝑓 = 𝐽 𝑏𝑢𝑛 + 𝐽(𝑌, 𝑍, 𝑎, 𝑢) 𝐽(𝑌, 𝑍, 𝑎, 𝑢) is not an arbitrary function. Rather: 𝜖𝑌 2 + 𝜖 2 𝜖 2 𝜖𝑍 2 + 𝜖 2 𝜖 2 1 𝐽 𝑌, 𝑍, 𝑎, 𝑢 = 𝜖𝑢 2 𝐽 𝑌, 𝑍, 𝑎, 𝑢 𝜖𝑎 2 𝑤 2 𝑤 = 340 m/s 3

The wave equation + boundary conditions give complicated shadow and reflection effects. What happens when sound enters the ear ? plane wave + single slit sea waves + islands 4

Musical sounds (brief introduction) 5

Example: guitar Write one string displacement at t = 0 as sum of sines. 𝜌 Modes are sin( 𝑀 𝑘𝑦) where 𝑀 is the length of the string, 𝑘 is an integer. 6

𝑀 Physics says: 𝜕 = 𝑑 𝑀 where constant 𝑑 depends on physical properties of string (mass density, tension) 7

Modes of a vibrating string each have fixed points which reduce the effective length. 𝑀 𝑀 𝑀 𝑀 2 3 4 Physics says: 𝜕 = 𝑑 2𝑑 3𝑑 4𝑑 𝑀 𝑀 𝑀 𝑀 8

𝜕 = 𝑑 2𝑑 3𝑑 4𝑑 𝑀 𝑀 𝑀 𝑀 𝜕 0 “fundamental” “overtones” (1 st harmonic) The temporal frequency 𝑛 𝜕 0 is called the 𝑛 -th harmonic. 9

For stringed instruments, most of the sound is produced by vibrations of the instrument body (neck, front and back plates). http://www.acs.psu.edu/drussell/guitars/hummingbird.html The lines in the sketches below are the nodal points. They don't move. These are vibration modes , not harmonics. The guitar sound is a sum of these modes. 10

Difference of two frequencies 𝜕 1 and 𝜕 2 : 𝜕 2 𝑚𝑝𝑕 2 octaves. 𝜕 1 e.g. 1 octave is a doubling of frequency. 11

(Western) Musical Notes Each “octave” ABCDEFGA is divided into 12 “semitones”, separated into 1/12 octave. C-D, D-E, F-G, G-A, A-B are two semitones each E-F, B-C are one semitone each. 12

Q: How many semi-tones are there from 𝜕 0 to 𝜕 ? 13

Q: How many semi-tones are there from 𝜕 0 to 𝜕 ? 𝜕 A: 12 𝑚𝑝𝑕 2 𝜕 0 𝜕 𝜕 0 Fundamental frequency of note 14

88 fundamental frequencies (Hz) on a keyboard The fundamental frequencies of successive notes define a geometric progression. This is different from the harmonics of a vibrating string which define an arithmetic progression . 15

Speech Sounds 16

What determines speech sounds? • voiced vs. unvoiced ‘zzzz’ vs. ‘ ssss ’, ‘ vvvv ’ vs. ‘ ffff ’ • articulators (jaw, tongue, lips) ‘ aaaa ’, ‘ eeee ’, ‘ oooo ’, … 17

Voiced sounds are produced by “glottal pulses”. 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑕 𝑢 − 𝑘 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑘=0 18

Exercise 16 Q7. 𝑕 𝑢 − 𝑢 0 = 𝑕 𝑢 ∗ 𝜀(𝑢 − 𝑢 0 ) 19

Voiced sounds are produced by “glottal pulses”. 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑕 𝑢 − 𝑘 𝑈 = 𝑕 𝑢 ∗ 𝜀 𝑢 − 𝑘 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑘=0 𝑘=0 20

𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑕 𝑢 − 𝑘 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑘=0 decrease 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 by increasing tension in vocal cords ≡ increase frequency of pulses 21

Let 𝑏 𝑢 be the impulse response function of the articulators. (jaw, tongue,lips) 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝐽 𝑢 = 𝑏 𝑢 ∗ 𝑕 𝑢 ∗ 𝜀 𝑢 − 𝑘 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑘=0 22

𝑜 𝑞𝑣𝑚𝑡𝑓 −1 𝑜 𝑞𝑣𝑚𝑡𝑓 −1 23

𝑜 𝑞𝑣𝑚𝑡𝑓 −1 𝑜 𝑞𝑣𝑚𝑡𝑓 −1 24

Oral and nasal cavity have resonant modes of vibration, like air cavity in guitar does. 26

Time domain Temporal frequency domain Peaks are called “formants” 27

𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝐆 𝜀 𝑢 − 𝑘 𝑈 = ? 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑘=0 𝑈 𝑕 is the period of the glottal pulse train. The pulse train has 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 pulses in 𝑈 time steps, i.e. 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 = 𝑈 . Assume that the Fourier transform is taken over 𝑈 samples. 28

Assignment 3: Show 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 −1 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 −1 𝐆 𝜀 𝑢 − 𝑘 𝑈 = 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝜀 𝜕 − 𝑛 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑘=0 𝑛=0 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 29

Units of temporal frequency 𝜕 𝑈 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 is the period of the glottal pulse train. 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 pulses in 𝑈 time samples. To convert 𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 to ‘pulses per second’, we divide 𝑈 (to get pulses per sample) and then multiply by ‘time samples per second’. High quality audio uses 44,100 samples per second. 30

𝑜 𝑕𝑚𝑝𝑢𝑢𝑏𝑚 is the fundamental frequency of the voiced sound. It determines the "pitch". Adult males : 100-150 Adult females : 150-250 Hz Children: over 250 Hz 31

glottal pulse spectrum “formants” sound spectrum 𝜕 0 = 100 𝐼𝑨 𝜕 0 = 200 𝐼𝑨 glottal pulse spectrum formant spectrum sound spectrum 32

Voiced vowel sounds 33

Unvoiced sounds noise instead of glottal pulses 34

Unvoiced sounds noise instead of glottal pulses Flat amplitude spectrum on average ( ‘white noise’) 35

Consonants Restrict flow of air by moving tongue, lips into contact with the teeth & palate. Fricatives - voiced z, v, zh, th (the) - unvoiced ? Stops - voiced b, d, g - unvoiced ? Nasals (closed mouth) - m, n, ng 36

Consonants Restrict flow of air by moving tongue, lips into contact with the teeth & palate. Fricatives - voiced z, v, zh, th (the) - unvoiced s, f, sh, th (theta) Stops - voiced b, d, g - unvoiced p, t, k Nasals (closed mouth) - m, n, ng 37

I did not have time to cover the following slides properly. I will present them again in lecture 22. 38

Spectrogram Partition a sound signal into 𝐶 blocks of 𝑈 samples each (i.e. the sound has 𝐶𝑈 samples in total). Take the Fourier transform of each block. 39

Spectrogram Partition a sound signal into 𝐶 blocks of 𝑈 samples each (i.e. the sound has 𝐶𝑈 samples in total). Take the Fourier transform of each block. Let 𝑐 be the block number, and 𝜕 units be cycles per block. 40

Cycles per second (Hz) 𝜕 0 = Time (samples) 41

e.g. T = 512 samples (12 ms), 𝜕 0 = 86 Hz T = 2048 samples (48 ms) 𝜕 0 = 21 Hz 42

e.g. T = 512 samples (12 ms), 𝜕 0 = 86 Hz T = 2048 samples (48 ms), 𝜕 0 = 21 Hz You cannot simultaneously localize the frequency and the time. This is a fundamental tradeoff. We have seen it before (recall the Gaussian). 43

Narrowband (good frequency resolution, poor temporal resolution … ~50ms) Wideband (poor frequency resolution, good temporal resolution) 44

Examples: Spectrograms of 10 vowel sounds 45

Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound - PowerPoint PPT Presentation

COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation =

Frequency Decomposition The base frequency or the fundamental frequency is the lowest frequency.

7. Sound CHAPTER HIGHLIGHTS Nature of sound Sine waves amplitude frequency Sine waves,

SOUND SOUND Wha hat is t is sound sound? Click on the image below to find out. Sounds are

? Message sound Message P(wolf|sound) P(sound| wolf) x P(wolf) 1 9/4/19 P(sound| wolf)

Sonification - Sound of Science VU, WS 2013 Lecture 8 - Parameter Mapping Visda Goudarzi

Time-Frequency Analysis Time Frequency Analysis in Visual Signal Yetmen Wang AnCAD, Inc.

SYNTHESIZING 3D SOUND SYNTHESIZING 3D SOUND AND AND SOUND LOCALIZATION SOUND LOCALIZATION

Sound & Editing Lily, Matt, Mei, Michaela Sound WHAT IS SOUND? An audible vibration of the

Sound 1 Sound "50% of the movie experience is sound - George Lucas Sound is used

Hearing and other senses Sound Sound: sensed variations in air pressure Frequency:

The Ear and Hearing Sound and sensations: Physical attributes of sound: intensity, frequency,

Computer Graphics Spectral Analysis Philipp Slusallek Spatial Frequency Frequency

Sound Slide 2 / 50 Characteristics of Sound Sound can travel through any kind of matter, but

AES 116th, Workshop 14 The role of multiple low-frequency signals in the perception of

Time-frequency multipliers for sound synthesis Ph. Depalle , R. Kronland-Martinet and B.

Chapter 7 Audition Sound Sound is the compression and rarefaction of air, or, in other

Phonetics & Phonology Jrgen Trouvain Areas of phonetics Speech production Speech

Nested Dichotomous Models Allen Davis, MSPH Jeff Gift, Ph.D. Jay Zhao, Ph.D. National Center

Pronunciation Variation: TTS & Probabilistic Models CMSC 35100 Natural Language Processing

AIRWAY - BREATHING - HABITS AIRWAY - BREATHING - HABITS & & MYOFUNCTIONAL

Nutrition Diplomacy: Promoting Health and Peace, One Plate at a Time Dr. Johanna Mendelson

Are you ready for ICD 10? Denesecia Green, Senior Health Insurance Specialist Centers for

for Adult Patients with Advanced Illnesses and their Caregivers PCORI Applicant Town Hall Session

Palliative Care in the ED: Understand how to integrate Palliative Care into the emergency

Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound - PowerPoint PPT Presentation

COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation =

Frequency Decomposition The base frequency or the fundamental frequency is the lowest frequency.

7. Sound CHAPTER HIGHLIGHTS Nature of sound Sine waves amplitude frequency Sine waves,

SOUND SOUND Wha hat is t is sound sound? Click on the image below to find out. Sounds are

? Message sound Message P(wolf|sound) P(sound| wolf) x P(wolf) 1 9/4/19 P(sound| wolf)

Sonification - Sound of Science VU, WS 2013 Lecture 8 - Parameter Mapping Visda Goudarzi

Time-Frequency Analysis Time Frequency Analysis in Visual Signal Yetmen Wang AnCAD, Inc.

SYNTHESIZING 3D SOUND SYNTHESIZING 3D SOUND AND AND SOUND LOCALIZATION SOUND LOCALIZATION

Sound &amp; Editing Lily, Matt, Mei, Michaela Sound WHAT IS SOUND? An audible vibration of the

Sound 1 Sound &quot;50% of the movie experience is sound - George Lucas Sound is used

Hearing and other senses Sound Sound: sensed variations in air pressure Frequency:

The Ear and Hearing Sound and sensations: Physical attributes of sound: intensity, frequency,

Computer Graphics Spectral Analysis Philipp Slusallek Spatial Frequency Frequency

Sound Slide 2 / 50 Characteristics of Sound Sound can travel through any kind of matter, but

AES 116th, Workshop 14 The role of multiple low-frequency signals in the perception of

Time-frequency multipliers for sound synthesis Ph. Depalle , R. Kronland-Martinet and B.

Chapter 7 Audition Sound Sound is the compression and rarefaction of air, or, in other

Phonetics &amp; Phonology Jrgen Trouvain Areas of phonetics Speech production Speech

Nested Dichotomous Models Allen Davis, MSPH Jeff Gift, Ph.D. Jay Zhao, Ph.D. National Center

Pronunciation Variation: TTS &amp; Probabilistic Models CMSC 35100 Natural Language Processing

AIRWAY - BREATHING - HABITS AIRWAY - BREATHING - HABITS &amp; &amp; MYOFUNCTIONAL

Nutrition Diplomacy: Promoting Health and Peace, One Plate at a Time Dr. Johanna Mendelson

Are you ready for ICD 10? Denesecia Green, Senior Health Insurance Specialist Centers for

for Adult Patients with Advanced Illnesses and their Caregivers PCORI Applicant Town Hall Session

Palliative Care in the ED: Understand how to integrate Palliative Care into the emergency

Sound & Editing Lily, Matt, Mei, Michaela Sound WHAT IS SOUND? An audible vibration of the

Sound 1 Sound "50% of the movie experience is sound - George Lucas Sound is used

Phonetics & Phonology Jrgen Trouvain Areas of phonetics Speech production Speech

Pronunciation Variation: TTS & Probabilistic Models CMSC 35100 Natural Language Processing

AIRWAY - BREATHING - HABITS AIRWAY - BREATHING - HABITS & & MYOFUNCTIONAL