Capturing the elevation dependence of ITD using an extension of the - - PowerPoint PPT Presentation

capturing the elevation dependence of itd using an
SMART_READER_LITE
LIVE PREVIEW

Capturing the elevation dependence of ITD using an extension of the - - PowerPoint PPT Presentation

Capturing the elevation dependence of ITD using an extension of the spherical-head model Rahulram Sridhar 3D3A Laboratory, Princeton University This talk is based on the work by Sridhar and Choueiri, published under the same title as paper


slide-1
SLIDE 1

Capturing the elevation dependence

  • f ITD using an extension of the

spherical-head model

Rahulram Sridhar 3D3A Laboratory, Princeton University

This talk is based on the work by Sridhar and Choueiri, published under the same title as paper number 9447 in the proceedings of the AES 139th convention

slide-2
SLIDE 2

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-3
SLIDE 3

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-4
SLIDE 4

Introduction & motivation

What is ITD?

Top view of human head

sound source

slide-5
SLIDE 5

Introduction & motivation

Why must it be individualized?

Top view of two different human heads showing variations in head shape and size

Images taken from Ball et al. “A comparison between Chinese and Caucasian head shapes,” Applied Ergonomics 41 (2010) 832-839

slide-6
SLIDE 6

Introduction & motivation

How should we individualize it?

slide-7
SLIDE 7

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-8
SLIDE 8

Background & Previous Work

From measured HRIRs3,5

Techniques for estimating individualized ITDs

3D scan + acoustic ray tracing6 Anthropometry-based model based on spatial variation of ITD7-10 Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

slide-9
SLIDE 9

Background & Previous Work

From measured HRIRs3,5

Techniques for estimating individualized ITDs

Numbered references correspond to those in paper 9447 on which this talk is based

Sound localization facility at Wright Patterson Air Force Base in Dayton, Ohio

Image taken from Hartmann, W. M., “How We Localize Sound,” Physics Today, pp. 24-29, 1999

slide-10
SLIDE 10

Background & Previous Work

Techniques for estimating individualized ITDs

3D scan + acoustic ray tracing6

Propagation delay visualization. Sound source is 39 deg. to the right of the dummy head

Image take from Gamper, H. et al., “Estimation of multi path propagation delays and interaural time differences from 3-D head scans,” ICASSP 2015

Numbered references correspond to those in paper 9447 on which this talk is based

slide-11
SLIDE 11

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

slide-12
SLIDE 12

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

Ellipsoidal-head model4 fairly simple and accurate

slide-13
SLIDE 13

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

Ellipsoidal-head model4 fairly simple and accurate tricky mapping to anthropometric features

slide-14
SLIDE 14

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

Ellipsoidal-head model4 Spherical-head model12 fairly simple and accurate tricky mapping to anthropometric features

slide-15
SLIDE 15

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

Ellipsoidal-head model4 Spherical-head model12 fairly simple and accurate tricky mapping to anthropometric features easy mapping to anthropometric features

slide-16
SLIDE 16

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on simple geometric shape of head4,12

References correspond to those in paper 9447 on which this talk is based

Ellipsoidal-head model4 Spherical-head model12 fairly simple and accurate tricky mapping to anthropometric features easy mapping to anthropometric features not so accurate

slide-17
SLIDE 17

Background & Previous Work

Techniques for estimating individualized ITDs

Anthropometry-based model based on spatial variation of ITD7-10

References correspond to those in paper 9447 on which this talk is based

slide-18
SLIDE 18

Background & Previous Work

Summary

References correspond to those in paper 9447 on which this talk is based

slide-19
SLIDE 19

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-20
SLIDE 20

Spherical-head model for ITD estimation

Woodworth & Schlosberg formula11 Introduction

References correspond to those in paper 9447 on which this talk is based

Spherical-head (Top View)

sound source

is speed of sound

slide-21
SLIDE 21

Spherical-head model for ITD estimation

Model formulation

slide-22
SLIDE 22

Spherical-head model for ITD estimation

Model formulation

Fit single optimized head radius per listener for many listeners

slide-23
SLIDE 23

Spherical-head model for ITD estimation

Model formulation

measured anthropometric data

width depth height

Fit single optimized head radius per listener for many listeners

slide-24
SLIDE 24

Spherical-head model for ITD estimation

Model formulation

measured anthropometric data

width depth height

Fit Fit single optimized head radius per listener for many listeners

slide-25
SLIDE 25

Spherical-head model for ITD estimation

Model formulation

measured anthropometric data

width depth height

Fit Fit single optimized head radius per listener for many listeners individualized

substitute for

slide-26
SLIDE 26

Spherical-head model for ITD estimation

Limitations

▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

45 90 135 180 325 350 375 400 425 450 Elevation (deg.) Mean ITD (μs)

Azimuth = 45°

Measured

slide-27
SLIDE 27

Spherical-head model for ITD estimation

Limitations

▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

  • 45

90 135 180 325 350 375 400 425 450 Elevation (deg.) Mean ITD (μs)

Azimuth = 45°

Measured Spherical-head model

slide-28
SLIDE 28

Spherical-head model for ITD estimation

Limitations

slide-29
SLIDE 29

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-30
SLIDE 30

Proposed extension to spherical-head model

Woodworth & Schlosberg formula11 Introduction

References correspond to those in paper 9447 on which this talk is based

Spherical-head (Top View)

sound source

slide-31
SLIDE 31

Proposed extension to spherical-head model

Woodworth & Schlosberg formula11 Introduction Formula for estimating low-frequency ITD1

References correspond to those in paper 9447 on which this talk is based

Spherical-head (Top View)

sound source

slide-32
SLIDE 32

Proposed extension to spherical-head model

Introduction

Interaural coordinate system azimuth elevation

interaural axis frontal axis

half-plane

slide-33
SLIDE 33

Model formulation

Proposed extension to spherical-head model

slide-34
SLIDE 34

Model formulation

Proposed extension to spherical-head model

Fit single optimized head radius per listener per half-plane for many listeners

performed per half-plane

slide-35
SLIDE 35

Model formulation

Proposed extension to spherical-head model

Fit single optimized head radius per listener per half-plane for many listeners

performed per half-plane

measured anthropometric data

width depth height arc length (approx.)

slide-36
SLIDE 36

Model formulation

Proposed extension to spherical-head model

Fit single optimized head radius per listener per half-plane for many listeners

performed per half-plane

measured anthropometric data

width depth height arc length (approx.)

Fit

performed per half-plane

slide-37
SLIDE 37

Model formulation

Proposed extension to spherical-head model

Fit single optimized head radius per listener per half-plane for many listeners

performed per half-plane

measured anthropometric data

width depth height arc length (approx.)

Fit individualized

substitute for performed per half-plane

slide-38
SLIDE 38

Model formulation (low-frequency ITD)

Proposed extension to spherical-head model

Fit single optimized head radius per listener per half-plane for many listeners

performed per half-plane

measured anthropometric data

width depth height arc length (approx.)

Fit individualized

substitute for performed per half-plane

slide-39
SLIDE 39

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-40
SLIDE 40

High-frequency ITD estimation accuracy

Evaluation of models

  • 80
  • 40

40 80 20 40 60 Azimuth (deg.) Mean Absolute ITD Error (μs)

Spherical-head model

High-frequency ITDs

slide-41
SLIDE 41

High-frequency ITD estimation accuracy

Evaluation of models

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • 80
  • 40

40 80 20 40 60 Azimuth (deg.) Mean Absolute ITD Error (μs)

Spherical-head model Proposed model Points offset horizontally by ±0.5° for clarity

High-frequency ITDs

slide-42
SLIDE 42

Low-frequency ITD estimation accuracy

Evaluation of models

  • 80
  • 40

40 80 80 160 240 Azimuth (deg.) Mean Absolute ITD Error (μs)

Spherical-head model

Low-frequency ITDs

slide-43
SLIDE 43

Low-frequency ITD estimation accuracy

Evaluation of models

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • 80
  • 40

40 80 80 160 240 Azimuth (deg.) Mean Absolute ITD Error (μs)

Spherical-head model Proposed model

Low-frequency ITDs

slide-44
SLIDE 44

Ability of proposed model to capture elevation

Evaluation of models

dependence of measured ITDs

▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

  • 45

90 135 180 325 350 375 400 425 450 Elevation (deg.) Mean ITD (μs)

Azimuth = 45°

Measured Spherical-head model

slide-45
SLIDE 45

Ability of proposed model to capture elevation

Evaluation of models

dependence of measured ITDs

▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

  • ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

45 90 135 180 325 350 375 400 425 450 Elevation (deg.) Mean ITD (μs)

Azimuth = 45°

Measured Spherical-head model Proposed model

High-frequency ITDs

slide-46
SLIDE 46

Contribution of anthropometric features to head radius

Proposed extension to spherical-head model

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

45 90 135 180 2 4 6 Elevation (deg.) Contribution to spherical-head radius (cm)

Head width

High-frequency ITD

slide-47
SLIDE 47

Contribution of anthropometric features to head radius

Proposed extension to spherical-head model

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • 45

90 135 180 2 4 6 Elevation (deg.) Contribution to spherical-head radius (cm)

Head width Head depth

High-frequency ITD

slide-48
SLIDE 48

Contribution of anthropometric features to head radius

Proposed extension to spherical-head model

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

45 90 135 180 2 4 6 Elevation (deg.) Contribution to spherical-head radius (cm)

Head width Head depth Pinnae mean arc length

High-frequency ITD

slide-49
SLIDE 49

Contribution of anthropometric features to head radius

Proposed extension to spherical-head model

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

45 90 135 180 2 4 6 Elevation (deg.) Contribution to spherical-head radius (cm)

Head width Head depth Pinnae mean arc length Head width

High-frequency ITD Low-frequency ITD

slide-50
SLIDE 50

Contribution of anthropometric features to head radius

Proposed extension to spherical-head model

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲ ▲

■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■ ■

  • 45

90 135 180 2 4 6 Elevation (deg.) Contribution to spherical-head radius (cm)

Head width Head depth Pinnae mean arc length Head width Head depth

High-frequency ITD Low-frequency ITD

slide-51
SLIDE 51

Outline

  • Introduction & motivation
  • Background & previous work
  • Spherical-head model for ITD estimation
  • Proposed extension to spherical-head model
  • Evaluation of models
  • Conclusions
slide-52
SLIDE 52

Conclusions

  • Spherical-head model doesn’t capture elevation

dependence of ITDs

  • Results in inaccurate estimates of ITD at large

azimuths

  • Proposed extension to spherical-head model

captures elevation dependence of ITDs

  • Results in up to 9% (47%) reduction in avg. error

when estimating high- (low-) frequency ITDs.

slide-53
SLIDE 53

Conclusions

  • Influence of anthropometric features on ITD

estimate varies based on sound source location

  • Pinnae mean arc length influences high-

frequency ITD for sources behind the listener

  • Head depth influences high-frequency ITD for

sources in front of the listener