Indoor Sound Localization Fares Abawi Universitt Hamburg Fakultt - PowerPoint PPT Presentation

MIN-Fakultät Fachbereich Informatik Indoor Sound Localization Fares Abawi Universität Hamburg Fakultät für Mathematik, Informatik und Naturwissenschaften Fachbereich Informatik Technische Aspekte Multimodaler Systeme Monday, 12-12-2016 1

Contents ► Introduction ► Cross-Correlation ► Quality Effecting Factors ► Sound Localization: ► Time Difference of Arrival ► Steered Beamforming ► Bio-Inspired Sound Localization ► Comparison ► Summary ► References 2

Introduction Definition Sound localization is … 3

Introduction [4] 4

Introduction The Jeffess Model – Oversimplified model of the mammalian MSO [VIDEO] [6] 5

Introduction [4] Lateral Superior Olive : ILD is performed Medial Superior Olive : ITD is performed 6

Introduction Binaural cues [VIDEOS] [7] Varying ITD Varying ILD Varying ITD & ILD Trading ITD off against ILD 7

Checkpoint ► Introduction ► Cross-Correlation ► Quality Effecting Factors ► Sound Localization: ► Time Difference of Arrival ► Steered Beamforming ► Bio-Inspired Sound Localization ► Comparison ► Summary ► References 8

Cross-Correlation Get the delay between two signals by shifting one against the other Multiply-> Sum-> Shift-> Repeat ! Convolution Theorem : Convolution in the time domain is simply a multiplication in the frequency domain and vice versa 9

Cross-Correlation Complexity : Time-Domain xcorr = 𝑜 2 Cooley-Tuckey FFT = 𝑜. log(𝑜) Notes on Time->Frequency Domain Transformation • The sampling frequency must be twice the maximum frequency a system needs to acquire, according to the Nyquist Theorem , in order to avoid temporal aliasing . • A windowing function ( Analysis window ) must be applied to signal before transformation to avoid frequency leakage and smearing. The window can be in the form of a Hann window, Hamm window or the like. • Keep in mind: The cross-correlation of two signals produces a vector with a length of both signal lengths -1. If ignored the cross-correlation will be distorted due to circular convolution. 10

Cross-Correlation Two sinusoids with a difference of 7 samples Peak detected at x = -7 after performing cross-correlation 11

Quality Effecting Factors Echo and Reverb [ANIMATION] [8] 13

Quality Effecting Factors Noise Noise power spectral densities can be estimated by finding the minima from time-frequency bins that do not contain speech [4] Could this work for any sound signal ? Any Environment ?? 14

Quality Effecting Factors Doppler shift [VIDEO] [9] 15

Time Difference of Arrival In-house Alert Sounds Detection and Direction of Arrival Estimation to Assist People with Hearing Difficulties [1] 17

Time Difference of Arrival [1] 18

Time Difference of Arrival Calculating the delay at which sound arrives the circular microphone array 𝜐 (𝑙,𝑗) = 2 𝑆 𝐷 sin 𝜄 𝑙 − 𝜄 𝑗 sin 𝜄 𝑙 − 𝜄 𝑗 + 𝜄 𝑗 − 𝜒 𝑡 2 2 Approximating the angle by incrementing 𝜒 𝑡 from 0 ° to 36 0 ° selecting the angle which reduces the difference between the analytical delay and that acquired through cross-correlation [1] 19

Steered Beamforming Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering [2] 20

Steered Beamforming • Detect the sound from an array of omnidirectional microphones • Steer the beam towards all possible angles • Use particle filtering to predict the motion of the sound source • Can detect angle and position ! [2] 21

Steered Beamforming [5] 22

Bio-Inspired Sound Localization Neural and Statistical Processing of Spatial Cues for Sound Source Localisation [3] 23

Bio-Inspired Sound Localization [3] • Detect the direction of incoming sound • Filter the sound signal (Gammatone FB) • Detect ITD and ILD • Reduce the dimensionality (Inferior Colliculus -> Naïve Bayes) • Classify (FFNN) • rotate the robot’s head in the direction of the sound, aligning a single microphone with the sound source. 24

Comparison TDOA Beamforming Bio-Inspired SSL Steps Cross-Correlate Shift, Cross-Correlate, Cross-Correlate, and measure delay sum and measure Minimize power dimensionality, feed to network and predict Speed Fast Moderate Slow Accuracy Lowest Moderate Best Resources Low High High Training Not Required Not Required Required 26

Summary • Mammalians Localize sound through binaural and monaural cues • Interaural level difference (ILD) is the measure of sound level/loudness across two inputs • Interaural time difference (ITD) is the measure of sound level/loudness across two inputs • The Lateral Superior Olive (LSO) : where ILD is measured in the brain • The Medial Superior Olive (MSO) : where ITD is measured in the brain • Cross-Correlation measures the delay between two signal • Cross-Correlation is performed efficiently in the Frequency domain • Quality effecting factors: • Echo • Reverb • Noise • Doppler shift 28

Summary • Computerized systems can measure the direction of sound by: • Time difference of arrival or phase delay • Steered beamforming • Heuristic and statistical methods • Beamforming can detect more than a single sound source • Sound can be detected by binaural or multi-microphone array systems (circular or aligned) 29

References [1] M. Daoud, M. Al-Ashi, F. Abawi, and A. Khalifeh, “In -house alert sounds detection and direction of arrival estimation to assist people with hearing difficulties,” in IEEE/ACIS 14th International Conference on Computer and Information Science (ICIS), pp. 297 – 302, Nevada, US, June 2015. [2] J.-M. Valin, F. Michaud and J. Rouat, “Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering,” Robotics Autonomous Syst. J. 55, 216 – 228, 2007. [3] J. Davila-Chacon, S. Magg, J. Liu, and S. Wermter. “Neural and statistical processing of spatial cues for sound source localization,” in IEEE Intl. Conf. on Neural Networks (IJCNN-13), pp. 1 – 8, Dallas, US, 2013. 30

References [4] B. Grothe, M. Pecka, and D. McAlpine, “ Mechanisms of Sound Localization in Mammals” in Physiological Reviews Published 1 July 2010 Vol. 90 no. 3, 983-1012 http://physrev.physiology.org/content/90/3/983 [5] A. Greensted , “Delay Sum Beamforming” in The Lab Book Pages, 2012 http://www.labbookpages.co.uk/audio/beamforming/delaySum.html [6] J. Schnupp, E. Nelken , A. King, “The Jeffress Model – Animation” in Auditory Neuroscience https://auditoryneuroscience.com/topics/jeffress-model-animation [7] J. Schnupp, E. Nelken , A. King, “Binaural Cues” in Auditory Neuroscience https://auditoryneuroscience.com/topics/binaural-cue-demos [8] “Echo and Reverb animation” in The Physics Classroom http://www.physicsclassroom.com/mmedia/waves/er.gif [9] “Waves and Sound: The Doppler Effect” In PHYSCLIPS ,UNSW, School of Physics, Sydney http://www.animations.physics.unsw.edu.au/jw/doppler.htm 31

Further Reading [10] B. Clénet and H. Romsdorfer, “Circular microphone array based beamforming and source localization on reconfigurable hardware”. Diss. Master’s thesis, Graz University of Technology, 2010. [11] J. Davila-Chacon, J. Twiefel, J. Liu, and S. Wermter. "Improving Humanoid Robot Speech Recognition with Sound Source Localisation." International Conference on Artificial Neural Networks. Springer International Publishing, 2014. 32

Questions ? Thank you ! 33

Indoor Sound Localization Fares Abawi Universitt Hamburg Fakultt - PowerPoint PPT Presentation

MIN-Fakultt Fachbereich Informatik Indoor Sound Localization Fares Abawi Universitt Hamburg Fakultt fr Mathematik, Informatik und Naturwissenschaften Fachbereich Informatik Technische Aspekte Multimodaler Systeme Monday, 12-12-2016

SYNTHESIZING 3D SOUND SYNTHESIZING 3D SOUND AND AND SOUND LOCALIZATION SOUND LOCALIZATION

2015 EvAAL-ETRI 2015 EvAAL-ETRI Indoor Localization Indoor Localization Competition

Indoor Localization with Wi-Fi Signal Strength Fingerprints Kaifei Chen 1 Motivation of Indoor

Category-level localization Cordelia Schmid Category-level localization Localization of

Management of Indoor Moulds Part I Introduction of Indoor Moulds January 2019 1 Management of

Indoor Places Lukas Kuster Motivation GPS for localization [7] 2 Motivation Indoor

Robot Indoor Localization Robot Based on Computer Vision Ubiquitous Computing Course 2015

Tack: Learning Towards Contextual and Ephemeral Indoor Localization With Crowdsourcing Liyao

Biomimetic Sound Biomimetic Sound Localization Utilizing Localization Utilizing Head

(ENS) Indoor Allergen Hazards: Requirements for Multiple Dwellings Local Law 55 of 2018 Indoor

SOUND SOUND Wha hat is t is sound sound? Click on the image below to find out. Sounds are

? Message sound Message P(wolf|sound) P(sound| wolf) x P(wolf) 1 9/4/19 P(sound| wolf)

Sonification - Sound of Science VU, WS 2013 Lecture 8 - Parameter Mapping Visda Goudarzi

E. Elnahrawy, X. Li, and R. Martin Rutgers U. WLAN-Based Localization Localization in

Localization Nischal K N System Overview Mapping Hector Mapping Localization Path Planning

Category-level localization Cordelia Schmid Category-level localization Localization up to a

Outline Successful Multiparty Audio Communication over the Internet Introduction Problems

Score informed audio source separation using a parametric model of non-negative spectrogram

L13. Sound Localization delay September 16, 2011 = + r ( ) f

Latent Variable Models and Signal Separation Class 13. 11 Oct 2012 11-755 MLSP: Bhiksha Raj

The Big Picture: Where are We Now? I/O System Design Issues interrupts Processor Network

GCT634@KAIST Invited lecture: Sound Source Separation 7 June 2018 Keunwoo Choi at QMUL.uk,

Wave-U-Net A Multi-Scale Neural Network for End-to-End Audio Source Separation DANIEL STOLLER 1 ,

Audio: Generation & Extraction Charu Jaiswal Music Composition which approach? Feed

Indoor Sound Localization Fares Abawi Universitt Hamburg Fakultt - PowerPoint PPT Presentation

MIN-Fakultt Fachbereich Informatik Indoor Sound Localization Fares Abawi Universitt Hamburg Fakultt fr Mathematik, Informatik und Naturwissenschaften Fachbereich Informatik Technische Aspekte Multimodaler Systeme Monday, 12-12-2016

SYNTHESIZING 3D SOUND SYNTHESIZING 3D SOUND AND AND SOUND LOCALIZATION SOUND LOCALIZATION

2015 EvAAL-ETRI 2015 EvAAL-ETRI Indoor Localization Indoor Localization Competition

Indoor Localization with Wi-Fi Signal Strength Fingerprints Kaifei Chen 1 Motivation of Indoor

Category-level localization Cordelia Schmid Category-level localization Localization of

Management of Indoor Moulds Part I Introduction of Indoor Moulds January 2019 1 Management of

Indoor Places Lukas Kuster Motivation GPS for localization [7] 2 Motivation Indoor

Robot Indoor Localization Robot Based on Computer Vision Ubiquitous Computing Course 2015

Tack: Learning Towards Contextual and Ephemeral Indoor Localization With Crowdsourcing Liyao

Biomimetic Sound Biomimetic Sound Localization Utilizing Localization Utilizing Head

(ENS) Indoor Allergen Hazards: Requirements for Multiple Dwellings Local Law 55 of 2018 Indoor

SOUND SOUND Wha hat is t is sound sound? Click on the image below to find out. Sounds are

? Message sound Message P(wolf|sound) P(sound| wolf) x P(wolf) 1 9/4/19 P(sound| wolf)

Sonification - Sound of Science VU, WS 2013 Lecture 8 - Parameter Mapping Visda Goudarzi

E. Elnahrawy, X. Li, and R. Martin Rutgers U. WLAN-Based Localization Localization in

Localization Nischal K N System Overview Mapping Hector Mapping Localization Path Planning

Category-level localization Cordelia Schmid Category-level localization Localization up to a

Outline Successful Multiparty Audio Communication over the Internet Introduction Problems

Score informed audio source separation using a parametric model of non-negative spectrogram

L13. Sound Localization delay September 16, 2011 = + r ( ) f

Latent Variable Models and Signal Separation Class 13. 11 Oct 2012 11-755 MLSP: Bhiksha Raj

The Big Picture: Where are We Now? I/O System Design Issues interrupts Processor Network

GCT634@KAIST Invited lecture: Sound Source Separation 7 June 2018 Keunwoo Choi at QMUL.uk,

Wave-U-Net A Multi-Scale Neural Network for End-to-End Audio Source Separation DANIEL STOLLER 1 ,

Audio: Generation &amp; Extraction Charu Jaiswal Music Composition which approach? Feed

Audio: Generation & Extraction Charu Jaiswal Music Composition which approach? Feed