COMP 546 Lecture 22 Spectrograms (revisited), Auditory filters - PowerPoint PPT Presentation

COMP 546 Lecture 22 Spectrograms (revisited), Auditory filters Thurs. April 5, 2018 1

Spectrogram Partition a sound signal into 𝐶 blocks of 𝑈 samples each (i.e. the sound has 𝐶𝑈 samples in total). 2

Spectrogram Partition a sound signal into 𝐶 blocks of 𝑈 samples each (i.e. the sound has 𝐶𝑈 samples in total). Take the Fourier transform of each block. Let 𝑐 be the block number, and 𝜕 units be cycles per block. [I will convert 𝜕 to cycles per second a few slides from now.] 3

𝑈 2 cycles per block : 2 1 1 2 3 …. 𝑐 Block number 4

𝑈 2 𝜕 0 𝑐𝑚𝑝𝑑𝑙𝑡 𝜕 0 units are 𝑡𝑓𝑑 𝑑𝑧𝑑𝑚𝑓𝑡 𝑑𝑧𝑑𝑚𝑓𝑡 𝑐𝑚𝑝𝑑𝑙𝑡 𝜕 in 𝜕 units are 𝑐𝑚𝑝𝑑𝑙 ∗ = 𝑡𝑓𝑑 𝑡𝑓𝑑 cycles per second : 2 𝜕 0 𝜕 0 1 2 3 …. Block number 𝑐 5

𝑈 2 𝜕 0 𝑐𝑚𝑝𝑑𝑙𝑡 𝜕 0 units are 𝑡𝑓𝑑 𝜕 in 1 𝑡𝑓𝑑 𝜕 0 units are cycles 𝑐𝑚𝑝𝑑𝑙 per second : 2 𝜕 0 𝜕 0 1 2 3 𝑐 time (sec) … 𝜕 0 𝜕 0 𝜕 0 𝜕 0 6

𝑈 2 𝜕 0 High quality audio: 44,100 samples/sec 𝜕 in 1 𝑡𝑓𝑑 𝜕 0 units are cycles 𝑐𝑚𝑝𝑑𝑙 per second Multiply by 44,100 samples/sec to get : 𝑈 samples per block. 2 𝜕 0 𝜕 0 1 2 3 𝑐 time (sec) … 𝜕 0 𝜕 0 𝜕 0 𝜕 0 7

t t e.g. T = 512 samples (12 ms), 𝜕 0 = 86 Hz T = 2048 samples (48 ms), 𝜕 0 = 21 Hz You cannot have high precision of both frequency and time. 8

Narrowband (good frequency resolution, poor temporal resolution … ~48ms) Wideband (poor frequency resolution, good temporal resolution … ~12 ms) 9

Example: Wideband spectrograms of 10 vowel sounds formants 10

Spectrogram time scales capture auditory events in the world (e.g. parts of speech, impacts, …) at relatively large time scales. e.g. period of 12 ms, 𝜕 0 = 86 Hz, 𝜇 ~ 4 meters These low frequencies play little role in spatial hearing (last lecture). 11

What are the impulse response functions of auditory filters? (durations, bandwidths and center frequencies) 12

Auditory filters • head related impulse response • basilar membrane http://www.neurosci.info/courses/systems/Nobels/1961%20von%20Bekesy/bekesy-lecture.pdf • hair cells and ganglion cells in cochlea • brainstem e.g. MSO, LSO • cortex A1 (later today … larger time scales) 13

Auditory filters Classical experiments used pure tones and/or noise. (starting in 1940’s and going for 50 years) • recording from single cells (BM, nerve fibres in cochear nerve, brainstem) • psychophysics e.g. masking 14

Example: Frequency tuning curves (thresholds) for different ganglion cells to pure tone stimuli 15

Psychophysical Masking How does presence of one frequency component affect our ability to hear other frequency components? Two similar frequencies mask each other more than two different frequencies. 16

Example Masking Experiment 𝜕 𝑢𝑓𝑡𝑢 𝜕 𝑛𝑏𝑡𝑙 time Interval 1 interval 2 Task: Which interval contains the test tone? 17

For each test frequency 𝜕 0 with some given SPL, For each masking frequency 𝜕 𝑁 Measure a masking threshold 𝐽 𝑁 (𝜕 𝑁 ) Define “ critical bandwidth” for 𝜕 0 by ∆𝜕 . ∆𝜕 𝐽 𝑁 (Masking Threshold) 𝜕 𝑁 𝜕 0 18

Auditory filters: typical bandwidth model Δ𝜕 0 1000 2000 3000 4000 …. 22,000 Δ𝜕 is ~100 Hz for center frequency up to 1000 Hz. Δ𝜕 is ~ 1/3 octave from 1000 Hz up to 22, 000 Hz. 19

Gammatone filter model Similar to Gabor filters but window is asymmetric. (Also, note shifted in time to enforce causality .) 10000 5000 3000 center frequency 1000 700 400 20

Auditory filters • head related impulse response • basilar membrane • hair cells and ganglion cells in cochlea • brainstem e.g. MSO, LSO • cortex (A1 and beyond) 21

V1: recall Hubel and Wiesel (1962) Such a stimulus works well if you already know the cell is orientation and motion selective. 22

Q: What to do if you don’t know anything about the receptive field? A: Compute “spike triggered average”. y 23

Use random input (often white noise). What is the average spatio- temporal stimulus that preceded the spikes? e.g. XT illustration = ‘spike triggered average’ x 24

Real data for V1 receptive field (XYT) Spike triggered average stimulus (backwards in time). Spike at t=0. Negative Positive [DeAngeles 1995] 25

Auditory Cortex Receptive Fields Inputs to A1 and have been spectrally bandpass filtered. There is ~ no more phase locking to stimulus sound. 26

Example of responses of 8 auditory nerve fibres to a voice sound Spectrogram of voice saying “Joe took father’s green shoe bench out”. Spike histograms of auditory nerve fibres (cat) with different peak (“characteristic”) frequency sensitivities. [Delgotte 1997] 27

What stimuli to use? (Cats don’t understand human speech, so it unlikely we would find cells tuned for it.) Recall Hubel and Wiesel had first tried using center- surround stimuli for cells in V1. The analogy in audition would be to use the same bandpass stimuli used for auditory fibres. Any other ideas? 28

Random “chord” stimuli [deCharms, 1998] frequency 𝝏 29

What spike triggered average should we expect from a bandpass cell ? 𝜕 + 𝑢 30

Do we find more interesting cells such as… ? 𝜕 𝜕 𝜕 + - - + 𝑢 𝑢 𝑢 31

Examples: Spectro-temporal receptive fields of A1 neurons [de Charms, 1998] 32

Orientation 𝜕, 𝑢 selective ? Verify the responses of the above cell to a tone and its harmonics, changing over time: 33

ASIDE: Two Applications 34

Cochlear implants are used for profoundly deaf people whose hair cells destroyed by disease but auditory nerve is intact. Microphone + speech/sound processor Electrode array (inserted into cochlea) 35

MP3: Data Compression Simultaneous masking: what I mentioned earlier Forward masking: Sound at time t can mask sound at time t + Δ𝑢 and nearby frequency bands, even if Δ𝑢 is greater than auditory (gammatone) filter. In both cases, you can use fewer bits to code sound and listeners won’t notice. 36

COMP 546 Lecture 22 Spectrograms (revisited), Auditory filters - PowerPoint PPT Presentation

COMP 546 Lecture 22 Spectrograms (revisited), Auditory filters Thurs. April 5, 2018 1 Spectrogram Partition a sound signal into blocks of samples each (i.e. the sound has samples in total). 2 Spectrogram Partition a sound

Welcome! COMP 546 Computational Perception Prof: Michael Langer See public web page for this

COMP 546 Lecture 21 Cochlea to brain, Source Localization Tues. April 3, 2018 1 Ear pinna

COMP 546 Lecture 23 Echolocation Tues. April 10, 2018 1 Echos time = arrival echo

Welcome to Comp/Phys/Mtsc 715 1/11/2011 Introduction Comp/Phys/Mtsc 715 Taylor 1 1/11/2011

PCWP-HCP meeting: COMP update April 2018 Lesley Greene (Volunteer Patient Advocate for EURORDIS)

Scanning COMP 520: Compiler Design (4 credits) Professor Laurie Hendren hendren@cs.mcgill.ca

Welcome to COMP 530 Don Porter 1 COMP 530: Opera.ng Systems Welcome! I just moved here from

Functional Programming Part II Radu Nicolescu Department of Computer Science University of

Welcome to COMP 530 Don Porter 1 COMP 530: Operating Systems Welcome! Todays goals:

Acid Rock Drainage and the Effects on Water Quality BRIAN M. LAWLESS ENVS 546 UNIVERSITY OF

January 22, 2020 2/6/2020 Forecast5 Analytics, Inc. 1 Fiscal Metrics At $13,546 per

Service Units Total Slots Used % Used 546 Troops

EQUITY & BOND OFFERING PRESENTATION VILLA WORLD LIMITED (ASX: VLW) ABN 38 117 546 326 22

Prospect Heights Historic Map 546 Carlton Ave. Rooftop Renovation GENERAL STATS: BLOCK 1136

Client Alert Bankruptcy Code Sections 503(b)(9) and 546(c): Contact Attorneys Regarding A Bitter

Investor Presentation 550 Highway 7 Ave E. Suite 338 Richmond Hill, Ontario, L4B 3Z4 Telephone

Auditory Sensory System Agenda Review Auditory Sense: Hearing Other senses

? Message sound Message P(wolf|sound) P(sound| wolf) x P(wolf) 1 9/4/19 P(sound| wolf)

1 Timing: Used to locate sound sources Auditory System: Demands Frequency (logarithmic,

Auditory System & Hearing Chapters 9 part II Lecture 17 Jonathan Pillow Sensation &

Chapter 7 Audition Sound Sound is the compression and rarefaction of air, or, in other

Facilitating Research at UW-Madison with HTC Lauren Michael, Research Computing Facilitator OSG

Slide 1 _ _ Sensation and

Health Effects Lecture 7: Noise - Part 1 (01.04.2020) Mark Brink ETH Zrich D-USYS Homepage:

COMP 546 Lecture 22 Spectrograms (revisited), Auditory filters - PowerPoint PPT Presentation

COMP 546 Lecture 22 Spectrograms (revisited), Auditory filters Thurs. April 5, 2018 1 Spectrogram Partition a sound signal into blocks of samples each (i.e. the sound has samples in total). 2 Spectrogram Partition a sound

Welcome! COMP 546 Computational Perception Prof: Michael Langer See public web page for this

COMP 546 Lecture 21 Cochlea to brain, Source Localization Tues. April 3, 2018 1 Ear pinna

COMP 546 Lecture 23 Echolocation Tues. April 10, 2018 1 Echos time = arrival echo

Welcome to Comp/Phys/Mtsc 715 1/11/2011 Introduction Comp/Phys/Mtsc 715 Taylor 1 1/11/2011

PCWP-HCP meeting: COMP update April 2018 Lesley Greene (Volunteer Patient Advocate for EURORDIS)

Scanning COMP 520: Compiler Design (4 credits) Professor Laurie Hendren hendren@cs.mcgill.ca

Welcome to COMP 530 Don Porter 1 COMP 530: Opera.ng Systems Welcome! I just moved here from

Functional Programming Part II Radu Nicolescu Department of Computer Science University of

Welcome to COMP 530 Don Porter 1 COMP 530: Operating Systems Welcome! Todays goals:

Acid Rock Drainage and the Effects on Water Quality BRIAN M. LAWLESS ENVS 546 UNIVERSITY OF

January 22, 2020 2/6/2020 Forecast5 Analytics, Inc. 1 Fiscal Metrics At $13,546 per

Service Units Total Slots Used % Used 546 Troops

EQUITY &amp; BOND OFFERING PRESENTATION VILLA WORLD LIMITED (ASX: VLW) ABN 38 117 546 326 22

Prospect Heights Historic Map 546 Carlton Ave. Rooftop Renovation GENERAL STATS: BLOCK 1136

Client Alert Bankruptcy Code Sections 503(b)(9) and 546(c): Contact Attorneys Regarding A Bitter

Investor Presentation 550 Highway 7 Ave E. Suite 338 Richmond Hill, Ontario, L4B 3Z4 Telephone

Auditory Sensory System Agenda Review Auditory Sense: Hearing Other senses

? Message sound Message P(wolf|sound) P(sound| wolf) x P(wolf) 1 9/4/19 P(sound| wolf)

1 Timing: Used to locate sound sources Auditory System: Demands Frequency (logarithmic,

Auditory System &amp; Hearing Chapters 9 part II Lecture 17 Jonathan Pillow Sensation &amp;

Chapter 7 Audition Sound Sound is the compression and rarefaction of air, or, in other

Facilitating Research at UW-Madison with HTC Lauren Michael, Research Computing Facilitator OSG

Slide 1 ___________________________________ ___________________________________ Sensation and

Health Effects Lecture 7: Noise - Part 1 (01.04.2020) Mark Brink ETH Zrich D-USYS Homepage:

EQUITY & BOND OFFERING PRESENTATION VILLA WORLD LIMITED (ASX: VLW) ABN 38 117 546 326 22

Auditory System & Hearing Chapters 9 part II Lecture 17 Jonathan Pillow Sensation &

Slide 1 _ _ Sensation and