Time-of-arrival estimation for blind beamforming Pasi Pertil , - - PowerPoint PPT Presentation

time of arrival estimation for blind beamforming
SMART_READER_LITE
LIVE PREVIEW

Time-of-arrival estimation for blind beamforming Pasi Pertil , - - PowerPoint PPT Presentation

Time-of-arrival estimation for blind beamforming Pasi Pertil , pasi.pertila (at) tut.fi www.cs.tut.fi/~pertila/ Aki Tinakari, aki.tinakari (at) tut.fi Tampere University of Technology Tampere, Finland Presentation outline 1) Traditional


slide-1
SLIDE 1

Time-of-arrival estimation for blind beamforming

Pasi Pertilä, pasi.pertila (at) tut.fi www.cs.tut.fi/~pertila/ Aki Tinakari, aki.tinakari (at) tut.fi Tampere University of Technology Tampere, Finland

slide-2
SLIDE 2

Presentation outline

1) Traditional beamforming / beam steering 2) Ad-hoc microphone arrays 3) Three ad-hoc array beam steering methods

– Time-of-Arrival (TOA) based solutions

4) Simulation of TOA accuracy 5) Measurements with an array of smartphones

– Accuracy of TOA estimation – Obtained beamforming quality

7/6/13 DSP 2013, Santorini 2

slide-3
SLIDE 3
  • Linear combination of microphone signals

Xi(ω), where i =1,…,M

  • Requirements for steering the beam:

1) Array shape is known (mic. position matrix M) 2) Sensors are synchronous (time offset is zero/known) 3) Direction/position to steer the array is known or can be scanned e.g. based on energy.

  • Simple Delay-and-Sum Beamformer (DSB)

Traditional Beamforming

7/6/13 DSP 2013, Santorini 3

Y(ω) = exp(iωτ i)

time-shifting

     Xi(ω)

i=0 M−1

slide-4
SLIDE 4
  • Sound Time-of-Flight (TOF) is
  • Align signals by advancing xi(t) by

Signal observation (near field)

7/6/13 DSP 2013, Santorini 4

 τ i = mi −s c−1

s(t) x0(t) = s(t −τ 0) x1(t) = s(t −τ1) xM−1(t) = s(t −τ M−1)

 τ i

slide-5
SLIDE 5

Ad-Hoc microphone array

  • Independent devices equipped with a microphone
  • Traditional beamforming requirements unfulfilled
  • 1. Array geometry is unknown (M is unknown)
  • 2. Devices aren’t synchronized (unknown time offsets Δi )
  • 3. The space cannot be easily panned to find source

direction θ to steer the beam into

7/6/13 DSP 2013, Santorini 5

slide-6
SLIDE 6
  • Signal time-of-arrival (TOA) for and ad-hoc array
  • Time-difference-of-Arrival (TDOA) for mics i, j
  • TDOAs τi,j can be measured using e.g. correlation
  • Previously considered as source spatial information
  • TDOA and TOA vectors are written as

P=M(M-1)/2

Time of Arrival (TOA)

7/6/13 DSP 2013, Santorini 6

τ i, j = τ i −τ j

τ i = c−1 s− mi propagation delay      + Δi time offset 

  • A. Brutti and F. Nesta, “Tracking of multidimensional TDOA for multiple sources

with distributed microphone pairs,” Computer Speech & Language, vol. 27,

slide-7
SLIDE 7
  • By defining an observation matrix

– E.g. for three microphones H =

  • The linear model between TOA and TDOA is
  • TOA proposed as source spatial representation

Time of Arrival (TOA)

7/6/13 DSP 2013, Santorini 7

slide-8
SLIDE 8

Time of Arrival (TOA) – 1st

  • Baseline method (TDOA subset):
  • 1. Select a reference microphone (e.g. 1st mic)
  • 2. Use relative delays τi,j between the

reference (i =1) and rest (j =2,…,M) as TOA

  • Does not utilize TDOA information between

all sensors

7/6/13 DSP 2013, Santorini 8

slide-9
SLIDE 9

Time of Arrival (TOA) – 2nd

  • Moore-Penrose inverse solution for TOA
  • H0 is H without the first column to account for
  • ne missing degree of freedom, i.e. the TOA

is relative to 1st sensor (which is set to zero). + Utilizes TDOA information between all sensors

7/6/13 DSP 2013, Santorini 9

slide-10
SLIDE 10
  • Kalman filtering based TOA estimation

(state eq.)

(measurement eq.) – x consists of TOA and TOA velocity, – A is transition matrix, q, r are noise – Predict p(xt|yt-1) and update p(xt|yt) steps. – Outlier rejection based on projected measurement likelihood

+ Utilizes TDOA information between all pairs + Can track speaker during noise contaminated segments.

Time of Arrival (TOA) – 3rd

7/6/13 DSP 2013, Santorini 10

x = τ  τ ! " # $ % &

slide-11
SLIDE 11

TOA Estimation simulation

7/6/13 DSP 2013, Santorini 11

  • 3 microphones 48kHz
  • Source rotates

around the array

  • Gaussian noise

added to TDOA

  • bservations
  • Gaussian noise in
  • ffset values

τ ij, σ = 20

Δi, σ 2 =10

slide-12
SLIDE 12

Simulation – TOA accuracy

Baseline (subset

Moore-Penrose Kalman

  • f TDOAs)

Inverse filter

7/6/13 DSP 2013, Santorini 12

8.7 16.2 19.9

Kalman filter Moore-Penrose Baseline TOA RMS error (samples@48k, 100 trials)

slide-13
SLIDE 13

Measurements

  • 10 smartphones were used to capture audio
  • 9 and 12 second sentences were used

– Speaker walked around the array

  • Reverberation time T60 ~ 370 ms
  • Room size: 5.1m × 6.6m
  • TDOAs were manually annotated to obtain

ground truth TOA.

  • Reference signal was captured with

headworn microphones.

7/6/13 DSP 2013, Santorini 13

slide-14
SLIDE 14

Performance of TOA estimators in measurements

7/6/13 DSP 2013, Santorini 14

437 223 47 461 232 110 50 100 150 200 250 300 350 400 450 500

Baseline Moore-Penrose Kalman filter

RMS Error (samples @ 48kHz)

Rec 1 Rec 2

slide-15
SLIDE 15

Obtained beamforming quality

  • We used estimated TOAs to steer DSB
  • Output y(t) quality was evaluated with BSS-

metric “Signal-to-Artifacts-Ratio” or SAR*)

– Scored in segments due to speaker movement (gain variation) – Only active segments considered (with VAD) – Modified metric: Segmental Signal-to-Artifacts Ratio Arithmetic mean (SSARA)

7/6/13 DSP 2013, Santorini 15

*) http://bass-db.gforge.inria.fr/bss eval/

SAR= 20log10 starget eartifacts

( )

y(t) = starget(t)+eartifacts(t)

slide-16
SLIDE 16

Objective speech quality

1 2 3 4 5 6 7 8 Rec #1 Rec #2

SSARA (dB)

Best Mic. TDOA Moore-Pensore inverse Kalman filter Ground Truth TOA

7/6/13 DSP 2013, Santorini 16

slide-17
SLIDE 17

Conclusions

  • Proposed TOA as the spatial source

information of an ad-hoc microphone array

– Previous research only considered TDOA – Dimension of TOA is M-1, for TDOA M(M-1)/2

  • Three TOA estimation solutions considered

– TDOA subset (baseline), pseudo-inverse, and Kalman filtering à most accurate

  • TOA allows beam-steering towards source

– w/o mic. positions / synchronization: blindly – Kalman filter based TOA provided best

  • bjective signal quality for beamforming

7/6/13 DSP 2013, Santorini 17