type of artificial neural network techniques Amir Shokri - - PowerPoint PPT Presentation

type of artificial neural
SMART_READER_LITE
LIVE PREVIEW

type of artificial neural network techniques Amir Shokri - - PowerPoint PPT Presentation

Fast auralization using radial basis functions type of artificial neural network techniques Amir Shokri Amirsh.nll@gmail.com a b s t r a c t This work presents a new technique to produce fast and reliable auralizations with a computer code


slide-1
SLIDE 1

Amir Shokri Amirsh.nll@gmail.com

Fast auralization using radial basis functions type of artificial neural network techniques

slide-2
SLIDE 2

This work presents a new technique to produce fast and reliable auralizations with a computer code for room acoustics simulation. It discusses the binaural room impulse responses generation classic method and presents a new technique using radial basis functions type of artificial neural networks. The radial basis functions type of artificial neural networks is briefly presented and its training and testing proce-dures are

  • discussed. The artificial neural network models the filtered head-related impulse responses for 64,442

directions uniformly distributed around the head with a significant reduction in computa-tional cost of around 90% in the generation of binaural impulse responses. It is shown that the filtered head-related impulse responses calculated with the classical convolution method and with the artificial neural network technique are almost indistinguishable. It is concluded that the new technique produces fastest and reliable binaural room impulse responses for auralization purposes.

a b s t r a c t

slide-3
SLIDE 3

This work deals with room acoustics computer simulation and its techniques to generate auralization at selected seats in the room. In general, the room acoustics simulation follows the requirements of geometrical acoustics. This means that the sound waves can be treated as acoustic rays that leave the sound source and propagate in the room, reflecting and refracting on their internal surfaces. There are two main ways of modeling acous-tic rays: the ray-tracing method and the image source method. There are also hybrid algorithms that use the image source method for the calculation of first specular reflections and the ray-tracing method for the calculus of the remaining ones. However, as already pointed out by several authors, diffuse reflection plays an important role in room acoustics, providing a greater uniformity in the sound field. Having in mind the room’s auralization, the diffuse reflections are also fundamental, leading to greater authenticity. In this case, it seems essential to have a good model to deal with diffuse reflections, since the ray-tracing technique cannot handle properly. One of the ways to approxi-mately model diffuse reflections is the radiosity technique.

Introduction

slide-4
SLIDE 4

○ Radial basis functions type of artificial neural network ○ Training and testing the artificial neural network set ○ Fast auralization with ann technique ○ Computational cost ○ Comparative results for filtered HRIRs ○ Conclusion remarks

Index

slide-5
SLIDE 5

An artificial neural network (ANN) is an information processing system based on simplified mathematical models of biological neurons whose learning process results from experience. The knowledge gained by the network through the examples are stored in the form of connection synaptic weights that are adjusted in order to make the correct decisions when presented to new entries. In other words, the network has the ability to generalize the learned information. The process of adjusting synaptic weights is performed by the learning algorithm. Artificial neural networks are useful tools for solving many types of problems as, for instance, classification, grouping, optimization, approximation and forecast-ing. One of the main applications of ANNs is on pattern recognition, and this is the application under consideration here: the ANNs are trained to learn the HRIRs patterns.

Radial basis functions type of artificial neural network

slide-6
SLIDE 6

Fig 1

slide-7
SLIDE 7

Fig 2

slide-8
SLIDE 8

The RBF parameters were calculated as follows: first the centers are obtained using the non-supervised K- means algorithm. Once the centers have been calculated, the widths are deter-mined. Finally, after defining the parameters of the radial func-tions, the free parameters of the output layer are computed using the same procedures that are used for the output layer of other types of neural networks.

Training and testing the artificial neural network set

slide-9
SLIDE 9

Fig 3

slide-10
SLIDE 10

Fig 4

slide-11
SLIDE 11

Fig 5

slide-12
SLIDE 12

Once the acoustic field in the room simulation is completed, the goal in sequel consists in the determination of the room impulse mono (RIRs) and binaural (BRIRs) responses at selected points. As regards the calculation of RIRs, it is about converting the energy arrival, via Hilbert’s transform [48] and filtering in octave bands, obtaining filtered impulse responses, whose computational cost is relatively

  • small. In order to compute the BRIRs, however, it is necessary to take into account the head-related

impulse responses (HRIRs) – or their corresponding in frequency domain, the so-called head-related transfer functions (HRTFs). In the computa-tional codes that generate auralization, this is usually done via the convolution procedure.

Fast auralization with ann technique

slide-13
SLIDE 13

Fig 6

slide-14
SLIDE 14

In order to examine the convolution method (CM) numerical efficiency and that of the artificial neural network method (ANNM), a comparison is made as to the number of arithmetic operations that each technique requires. The number of operations in the convolution method to com-pute the filtered HRIRs equals the sum of two

  • parts. The first one corresponds to the number of multiplications between the ray spectrum (in octave

bands) and the HRTF of the considered direc-tion. Note, however, that due to the HRTF symmetry, only l/2 prod-ucts, with l being the number of samples, are necessary. The second one corresponds to the number

  • f operations for calculating the inverse Fourier transform.

Computational cost

slide-15
SLIDE 15

Fig 7

slide-16
SLIDE 16

Fig8

slide-17
SLIDE 17

Table 1

slide-18
SLIDE 18

As mentioned, the convolution technique is the classic BRIR generation method and it is present in almost all acoustic field sim-ulation software with auralization, to our knowledge. Therefore, in order to verify the reliability of the method of generating BRIRs with artificial neural networks of the radial basis function type, a comparison between the two methods is presented in the sequel. Since, once the filtered HRIRs are generated, the procedure is iden-tical, involving the delays and sum to generate the BRIRs, the com- parison between the two methods will be done among the filtered HRIRs. In other words, since the procedures of delay and sum of the filtered HRIRs are exactly the same in the two techniques, if the filtered HRIRs computed by both techniques are almost identical, the resulting BRIR will be also the same.

Comparative results for filtered HRIRs

slide-19
SLIDE 19

Fig 9 & 10

slide-20
SLIDE 20

List of references: [1] Kuttruff H. Room acoustics. 5th ed. London: Spon Press; 2009. [2] Savioja L, Svensson UP. Overview of geometrical room acoustic modeling techniques. J Acoust Soc Am 2015;138(2):708–30. [3] Ondet M, Barbry JL. Modeling of sound propagation in fitted workshops using ray

  • tracing. J Acoust Soc Am 1989;85(2):787–96.

[4] Borish J. Extension of the image model to arbitrary polyhedra. J Acoust Soc Am 1984;75:1827. [5] Vorländer M. Simulation of the transient and steady-state sound propagation in rooms using a new combined ray-tracing/image-source algorithm. J Acoust Soc Am 1989;86(1):172–8.

Refrences

slide-21
SLIDE 21

List of references: [6] D’Antonio P, Cox TJ. Diffusor application in rooms. Appl Acoust 2000;60:113–42. [7] Cox TJ, Dalenbäck BI, D’Antonio P, Embrechts JJ, Jeon JY, Mommertz E, et al. A tutorial

  • n scattering and diffusion coefficients for room acoustic surfaces. Acta Acustica United

Acustica 2006;92(1):1–15. [8] Alarcão D, Bento Coelho JL, Tenenbaum RA, On modeling of room acoustics by a sound energy transition approach. In: Proceedings of EEA Symposium on Architectural Acoustics, 2000. [9] Tenenbaum RA, Camilo TS, Torres JCB, Gerges SNY. Hybrid method for numerical simulation of room acoustics: part 1 – theoretical and numerical aspects. J Braz Soc Mech Sci Eng 2007;29(2):211–21.

Refrences

slide-22
SLIDE 22

List of references: [10] Schröeder M, Digital computers in room acoustics. In: Proc. 4th ICA, Copenhagen, 1962. [11] Blauert J, Lehnert H, Pompetzki W, Xiang N. Binaural room simulation. Acustica 1990;72:295–6. [12] Ahnert W, Feistel R, Binaural auralization from a sound system simulation programme. In: Proc. 91th AES Convention, New York, 1991. [13] Lehnert H, Blauert J. Principles of binaural room simulation. Appl Acoust 1992;36:259. [14] Møller H. Fundamentals of binaural technology. Appl Acoust 1992;36:171. [15] Vian J-P, Martin J. Binaural room acoustics simulation: practical uses and applications. Appl Acoust 1992;36:293.

Refrences

slide-23
SLIDE 23

List of references: [16] Kleiner M, Dalenbäck B-I, Svensson P. Auralization – an overview. J Audio Eng Soc 1993;41:861. [17] Begault D. 3-D sound for virtual reality and multimedia. Cambridge: Academic Press Professional; 1994. [18] Dalenbäck B-I, McGrath D. Narrowing the gap between virtual reality and auralization. In: Proc. 15th ICA, Trondheim, 1995. [19] Sottek R, Virtual binaural auralization of product sound quality: importance and application in practice. Proc. EURONOISE, Naples, 2003. [20] Rindel JH, Otondo F, Christensen CL. Sound source representation for auralization. In:

  • Proc. Int. Symp. on Room Acoust., Awaji, 2004.

Refrences

slide-24
SLIDE 24

List of references: [21] Torres JCB, Petraglia MR, Tenenbaum RA. An efficient wavelet-based HRTF for

  • auralization. Acta Acustica United Acustica 2004;90:108.

[22] Otondo F, Rindel JH. A new method for the radiation representation of musical instruments in auralizations. Acta Acustica United Acustica 2005;91:902. [23] Dalenbäck B-I, Strömberg M. Real time walkthrough auralization – the first year. In: Proc. IOA Spring Conference, Copenhagen, 2006. [24] Summers JE. What exactly is meant by the term ‘auralization?’. J Acoust Soc Am 2008;124(2):697. [25] Dalenbäck B-I. A new model for room acoustics prediction and auralization Doctoral

  • thesis. Gothenburg: Chalmers University; 1995.

Refrences

slide-25
SLIDE 25

List of references: [26] Hammershøi D. Binaural technique – a method of true 3-D sound reproduction. Doctoral thesis. Denmark: Aalborg University; 1995. [27] Savioja L. Modelling techniques for virtual acoustics Doctoral thesis. Finland: Helsinki University of Technology; 1999. [28] Lokki T. Physically-based auralization – design implementation and evaluation Doctoral

  • thesis. Finland: Helsinki University of Technology; 2002.

[29] Torres JCB. Efficient auralization system using wavelet transforms Doctoral thesis. Brazil: Federal University of Rio de Janeiro; 2004.

Refrences

slide-26
SLIDE 26

List of references: [30] Schröeder D. Integration of real-time room acoustical simulations in VR environments Doctoral thesis. Germany: RWTH Aachen University; 2004. [31] Thaden R. Auralization in building acoustics Doctoral thesis. Germany: RWTH Aachen University; 2005. [32] Naranjo JFL. Machine learning applied in the generation of binaural room impulse responses and in auralization in rooms Doctoral thesis. State University of Rio de Janeiro; 2014. [33] Taminato FO. Artificial neural networks applied to model head-related impulse responses to generate auralization Ph.D. Thesis. State University of Rio de Janeiro; 2018.

Refrences

slide-27
SLIDE 27

List of references: [34] Savioja L, Xiang N. Introduction to the special issue on room acoustic modeling and

  • auralization. J Acoust Soc Am 2019;145(4):2597–600.

[35] Brinkmann F, Aspöck L, Ackermann D, Lepa S, Vorländer M, Weinzierl S. A round robin

  • n room acoustical simulation and auralization. J Acoust Soc Am 2019;145(4):2746–60.

[36] Lindau A, Erbes V, Lepa S, Maempel HJ, Brinkmann F, Weinzierl S. A spatial audio quality inventory for virtual acoustic environments (SAQI). Acta Acustica United Acustica 2014;100(5):984–94. [37] Peng J. Feasibility of subjective speech intelligibility assessment based on auralization. Appl Acoust 2005;66:591–601.

Refrences

slide-28
SLIDE 28

List of references: [38] Hodgson M, York N, Yang W, Bliss M. Comparison of predicted, measured and auralized sound fields with respect to speech intelligibility in classrooms using CATT- Acoustic and ODEON. Acta Acustica United Acustica 2008;94(6):883–90. [39] Tenenbaum RA, Taminato FO, Melo VSG, Torres JCB. Auralization generated by modeling HRIRs with artificial neural networks and its validation using articulation tests. Appl Acoust 2018;130:260–9. [40] Haykin S. Neural networks and learning machines. 3rd ed. New Jersey: Prentice Hall; 2009. [41] Mulgrew B. Applying radial basis functions. IEEE Signal Process Mag 1996:50–65.

Refrences

slide-29
SLIDE 29

List of references: [42] Bishop C. Neural networks for pattern recognition. Oxford: Oxford University Press; 2005. [43] Broomhead DS, Lowe D. Multivariable functional interpolation and adaptive networks. Complex Syst 1988;2:321–55. [44] Moody J, Darken CJ. Fast learning in networks of locally-tuned processing units. Neural Comput 1989;1:281–94. [45] Mulgrew B. Applying radial basis functions. IEEE Signal Process Mag 1996;50–65. [46] Chen S, Muldrew B, Mclaughlin S, Adaptative bayesian feedback equalizer based on a radial basis function network. IEEE International Conference on Communications, Chicago, 3, pp. 1267–1271, 1992.

Refrences

slide-30
SLIDE 30

List of references: [47] Brinkmann F., Lindau A., Weinzierl S., Geissler G., van de Par S., Müller-Trapet M., Opdam R., Vorländer M. The FABIAN head-related transfer function data base. (2017), available at http://dx.doi.org/10.14279/depositonce-5718.2 (Last viewed September, 2018). [48] Kak S. Number theoretic Hilbert transform. Circ Syst Signal Process 2014;33:2539–48. [49] Blauert J. Spatial hearing. Cambridge: The MIT Press; 1997. [50] Willcox RR. Introduction to robust estimation and hypothesis testing. New York: Academic Press; 2005. [51] Proakis JG, Manolakis DG. Digital signal processing: principles, algorithms and

  • applications. 3rd ed. New Jersey: Prentice-Hall; 1996.

Refrences

slide-31
SLIDE 31

List of references: [52] Conte SD, Boor C. Elementary numerical analysis: an algorithmic approach. 3rd ed. New York: McGraw-Hill; 1980. [53] Xie B, Zhong X, He N. Typical data and cluster analysis on head-related transfer

  • functions. Appl Acoust 2015;94:1–13.

Refrences