time of arrival estimation for blind beamforming
play

Time-of-arrival estimation for blind beamforming Pasi Pertil , - PowerPoint PPT Presentation

Time-of-arrival estimation for blind beamforming Pasi Pertil , pasi.pertila (at) tut.fi www.cs.tut.fi/~pertila/ Aki Tinakari, aki.tinakari (at) tut.fi Tampere University of Technology Tampere, Finland Presentation outline 1) Traditional


  1. Time-of-arrival estimation for blind beamforming Pasi Pertilä , pasi.pertila (at) tut.fi www.cs.tut.fi/~pertila/ Aki Tinakari, aki.tinakari (at) tut.fi Tampere University of Technology Tampere, Finland

  2. Presentation outline 1) Traditional beamforming / beam steering 2) Ad-hoc microphone arrays 3) Three ad-hoc array beam steering methods – Time-of-Arrival (TOA) based solutions 4) Simulation of TOA accuracy 5) Measurements with an array of smartphones – Accuracy of TOA estimation – Obtained beamforming quality DSP 2013, Santorini 7/6/13 2

  3. Traditional Beamforming • Linear combination of microphone signals X i ( ω ), where i =1,…,M • Requirements for steering the beam: 1) Array shape is known (mic. position matrix M ) 2) Sensors are synchronous (time offset is zero/known) 3) Direction/position to steer the array is known or can be scanned e.g. based on energy. • Simple Delay-and-Sum Beamformer (DSB) M − 1 Y ( ω ) = exp( i ωτ i )  X i ( ω ) ∑     i = 0 time-shifting DSP 2013, Santorini 7/6/13 3

  4. Signal observation (near field)  • Sound Time-of-Flight (TOF) is τ i = m i − s c − 1  • Align signals by advancing x i (t) by τ i x 0 ( t ) = s ( t − τ 0 ) s ( t ) x 1 ( t ) = s ( t − τ 1 ) x M − 1 ( t ) = s ( t − τ M − 1 ) DSP 2013, Santorini 7/6/13 4

  5. Ad-Hoc microphone array • Independent devices equipped with a microphone • Traditional beamforming requirements unfulfilled 1. Array geometry is unknown ( M is unknown) 2. Devices aren’t synchronized (unknown time offsets Δ i ) 3. The space cannot be easily panned to find source direction θ to steer the beam into DSP 2013, Santorini 7/6/13 5

  6. Time of Arrival (TOA) • Signal time-of-arrival (TOA) for and ad-hoc array c − 1 s − m i τ i = + Δ i       time offset propagation delay • Time-difference-of-Arrival (TDOA) for mics i, j τ i , j = τ i − τ j • TDOAs τ i,j can be measured using e.g. correlation • Previously considered as source spatial information A. Brutti and F. Nesta, “Tracking of multidimensional TDOA for multiple sources with distributed microphone pairs,” Computer Speech & Language, vol. 27, • TDOA and TOA vectors are written as P=M(M-1)/2 DSP 2013, Santorini 7/6/13 6

  7. Time of Arrival (TOA) • By defining an observation matrix – E.g. for three microphones H = • The linear model between TOA and TDOA is • TOA proposed as source spatial representation DSP 2013, Santorini 7/6/13 7

  8. Time of Arrival (TOA) – 1st • Baseline method (TDOA subset) : 1. Select a reference microphone (e.g. 1 st mic) 2. Use relative delays τ i,j between the reference (i =1) and rest (j =2,…,M) as TOA - Does not utilize TDOA information between all sensors DSP 2013, Santorini 7/6/13 8

  9. Time of Arrival (TOA) – 2nd • Moore-Penrose inverse solution for TOA • H 0 is H without the first column to account for one missing degree of freedom, i.e. the TOA is relative to 1 st sensor (which is set to zero). + Utilizes TDOA information between all sensors DSP 2013, Santorini 7/6/13 9

  10. Time of Arrival (TOA) – 3rd • Kalman filtering based TOA estimation (state eq.) (measurement eq.) ! $ τ – x consists of TOA and TOA velocity, x = # &  τ " % – A is transition matrix, q, r are noise – Predict p ( x t | y t-1 ) and update p ( x t | y t ) steps. – Outlier rejection based on projected measurement likelihood + Utilizes TDOA information between all pairs + Can track speaker during noise contaminated segments. DSP 2013, Santorini 7/6/13 10

  11. TOA Estimation simulation • 3 microphones 48kHz • Source rotates around the array • Gaussian noise added to TDOA observations τ ij , σ = 20 • Gaussian noise in Δ i , σ 2 = 1 0 offset values DSP 2013, Santorini 7/6/13 11

  12. Simulation – TOA accuracy Baseline (subset Moore-Penrose Kalman of TDOAs) Inverse filter TOA RMS error (samples@48k, 100 trials) Baseline 19.9 Moore-Penrose 16.2 Kalman filter 8.7 DSP 2013, Santorini 7/6/13 12

  13. Measurements • 10 smartphones were used to capture audio • 9 and 12 second sentences were used – Speaker walked around the array • Reverberation time T60 ~ 370 ms • Room size: 5.1m × 6.6m • TDOAs were manually annotated to obtain ground truth TOA. • Reference signal was captured with headworn microphones. DSP 2013, Santorini 7/6/13 13

  14. Performance of TOA estimators in measurements (samples @ 48kHz) 500 461 437 Rec 1 450 RMS Error Rec 2 400 350 300 232 223 250 200 150 110 100 47 50 0 Baseline Moore-Penrose Kalman filter DSP 2013, Santorini 7/6/13 14

  15. Obtained beamforming quality • We used estimated TOAs to steer DSB • Output y(t) quality was evaluated with BSS- metric “Signal-to-Artifacts-Ratio” or SAR *) ( ) SAR = 20log 10 s target e artifacts y ( t ) = s target ( t ) + e artifacts ( t ) – Scored in segments due to speaker movement (gain variation) – Only active segments considered (with VAD) – Modified metric: Segmental Signal-to-Artifacts Ratio Arithmetic mean (SSARA) *) http://bass-db.gforge.inria.fr/bss eval/ DSP 2013, Santorini 7/6/13 15

  16. Objective speech quality 8 Best Mic. 7 6 TDOA SSARA (dB) 5 Moore-Pensore 4 inverse 3 Kalman filter 2 Ground Truth 1 TOA 0 Rec #1 Rec #2 DSP 2013, Santorini 7/6/13 16

  17. Conclusions • Proposed TOA as the spatial source information of an ad-hoc microphone array – Previous research only considered TDOA – Dimension of TOA is M-1 , for TDOA M(M-1)/2 • Three TOA estimation solutions considered – TDOA subset (baseline), pseudo-inverse, and Kalman filtering à most accurate • TOA allows beam-steering towards source – w/o mic. positions / synchronization: blindly – Kalman filter based TOA provided best objective signal quality for beamforming DSP 2013, Santorini 7/6/13 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend