K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling - PowerPoint PPT Presentation

K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling Tjalkens and Bert de Vries Eindhoven University of Technology, the Netherlands Email bert.de.vries@tue.nl NIPS-2017 ML4AUDIO workshop, 8-Dec 2017

Use Case / Problem Statement

Approach: probabilistic modeling ACOUSTIC MODEL SPECIFICATION – Define a generative probabilistic model for acoustic signals that contains scenes as latent states. TRAINING 1. “Representation training”: Unsupervised offline training on a large database of acoustic signals across many scenes 2. Train new scenes : Continue with supervised training on an online recorded small set of scene-labeled waveforms CLASSIFICATION – Goal: assign future streaming acoustic data to the correct (or similar) scenes 5

(Mixture of) Hidden Semi-Markov Models small, hierarchically structured, 𝑑 scenes (“classes”) ∫ with duration modeling HSMM 𝑑 = 1, … , 𝐷 𝜄 segments (s) 𝑨 0 𝑨 1 𝑨 1 𝑨 𝐿 𝑒 𝐿 𝑒 1 𝑒 2 features (60 MFCC per 40 ms 20 ms hop) 𝑦 1 ∫ ∫ ∫ 𝑦 𝑗=1 𝐿 𝑒 𝑗 𝑦 𝑒 1 𝑦 𝑒 1 +1 𝑦 𝑒 1 +𝑒 2 𝑦 𝑗=1 𝐿−1 𝑒 𝑗 +1 samples

generative model: dynamics: parameters: class prior:

Data set: TUT Acoustic Scenes 2016 • Collected by Tampere University of Technology • 15 acoustic scenes • ~40 min. of audio per class Data Preparation • Data set 1 : draw one example (30secs) from each of 11 randomly chosen scenes • Data set 2 : draw one example from remaining (4) classes. • Classify : test on remaining examples of data set 2 9

Step 1: Train Duration Models 𝑑 scenes ∫ HSMM 𝑑 = 1, … , 𝐷 𝜄 segments 𝑨 0 𝑨 1 𝑨 1 𝑨 𝐿 𝑒 𝐿 𝑒 1 𝑒 2 𝑦 1 features (MFCC) ∫ ∫ ∫ 𝑦 𝑗=1 𝐿 𝑒 𝑗 samples

Duration distributions (initialization Pois(.) ) 14

Duration distributions (after training) 15

Step 2: One-shot Training 𝑑 scenes HSMM 𝑑 = 1, … , 𝐷 𝜄 segments 𝑨 0 𝑨 1 𝑨 1 𝑨 𝐿 𝑒 𝐿 𝑒 1 𝑒 2 𝑦 1 features (MFCC) ∫ ∫ ∫ 𝑦 𝑗=1 𝐿 𝑒 𝑗 samples

Classification 𝑑 ? scenes HSMM 𝑑 = 1, … , 𝐷 𝜄 segments 𝑨 0 𝑨 1 𝑨 1 𝑨 𝐿 𝑒 𝐿 𝑒 1 𝑒 2 𝑦 1 features (MFCC) ∫ ∫ ∫ 𝑦 𝑗=1 𝐿 𝑒 𝑗 samples

Results 21

Summary and Future Plans • Ongoing research on in-situ one-shot learning of a personalized acoustic scene classifier • Use case is hearing aids personalization, but also applicable to urban monitoring, elderly care, etc. • Generative modeling approach, inspired by one-shot learning work of (a.o.) Brendan Lake et al (2014), Matthew Johnson et al. (2013) • An HSMM-based probabilistic classifier shows promising performance on one-shot learning task compared to 1NN-DTW. • Specifically, learned priors for segment duration models parameters helps the classifier to recognize new classes from a single example. • Future work includes more thorough analysis and exploration of competing models. 22

Acknowledgements • Matthew Johnson et al. for Package Pyhsmm (@ https://github.com/mattjj/pyhsmm) Thank you

K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling - PowerPoint PPT Presentation

K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling Tjalkens and Bert de Vries Eindhoven University of Technology, the Netherlands Email bert.de.vries@tue.nl NIPS-2017 ML4AUDIO workshop, 8-Dec 2017 Use Case / Problem Statement

Acoustic Acoustic Control Systems BV Acoustic Acoustic Control Systems BV Control Systems BV

SHOT Brand Price NOTES WEST COAST MAGNUM SIZES 4 - 9 $ 39.20 Eagle shot prices may not be

Acoustic Modeling: Tied-state HMMs & DNN-based models Lecture 7 CS 753 Instructor: Preethi

Zero-Shot Learning for Word Translation: Successes and Failures Ndapa Nakashole, University of

Siamese Network & Matching Network for one-shot learning Reference Papers Siamese Neural

The Center for Acoustic Neuroma Translabyrinthine Resection of Acoustic Neuroma Indications 1 -

VARIFLEX operable walls Introduction Acoustic overview Acoustic selection table Types of VX

Adaptation Techniques for Acoustic Adaptation Techniques for Acoustic Adaptation Techniques for

Acoustic Fingerprinting Soundz Jake Runzer June 28, 2018 Jake Runzer Acoustic Fingerprinting

A Bayesian Approach to A Bayesian Approach to Unsupervised One- Unsupervised One -Shot Shot

Thermo Shot Thermo Shot F30 Series F30 Series NEC Avio Avio Infrared Technologies Co., Ltd.

Horizontal Movable Shot Blasting Machine PW2-40DA Horizontal movable shot blasting machine Dust

Lecture 8 Sample Sample Chapter 8 and 10 Statistic Shot Noise Limit Homodyne Demodula-

Infinite Mixture Prototypes for Few-Shot Learning Adaptively inferring model capacity for simple

Federated Zero-Shot Learning: A Proposal Francesco Odierna CS PhD student @ University of Pisa

Co-Representation Network for Generalized Zero-Shot Learning Fei Zhang, Guangming Shi XIDIAN

Analog rotating black holes in a MHD inflow Based on ? Analog geometry for acoustic waves With

A combustion instability model accounting for dynamic flame-flow-acoustic interactions el Assier 1

Multilingual Speech Recognition With A Single End-To-End Model Shubham Toshniwal 1 , Tara N.

Do You Hear What I Hear? Fingerprintin Smart Devices Through Embedded Acoustic Components A.Das,

Acoustic streaming modeling Milad Setareh Applied Mechanics/Fluid Dynamics, Amirkabir University

General-purpose Ambisonic playback systems for electroacoustic concerts A practical approach

Auralization Technology Maarten Hornikx April 4, 2019 Introduction Auralization technology

Parameterised Sigmoid and ReLU Hidden Activation Functions for DNN Acoustic Modelling Chao Zhang

K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling - PowerPoint PPT Presentation

K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling Tjalkens and Bert de Vries Eindhoven University of Technology, the Netherlands Email bert.de.vries@tue.nl NIPS-2017 ML4AUDIO workshop, 8-Dec 2017 Use Case / Problem Statement

Acoustic Acoustic Control Systems BV Acoustic Acoustic Control Systems BV Control Systems BV

SHOT Brand Price NOTES WEST COAST MAGNUM SIZES 4 - 9 $ 39.20 Eagle shot prices may not be

Acoustic Modeling: Tied-state HMMs &amp; DNN-based models Lecture 7 CS 753 Instructor: Preethi

Zero-Shot Learning for Word Translation: Successes and Failures Ndapa Nakashole, University of

Siamese Network &amp; Matching Network for one-shot learning Reference Papers Siamese Neural

The Center for Acoustic Neuroma Translabyrinthine Resection of Acoustic Neuroma Indications 1 -

VARIFLEX operable walls Introduction Acoustic overview Acoustic selection table Types of VX

Adaptation Techniques for Acoustic Adaptation Techniques for Acoustic Adaptation Techniques for

Acoustic Fingerprinting Soundz Jake Runzer June 28, 2018 Jake Runzer Acoustic Fingerprinting

A Bayesian Approach to A Bayesian Approach to Unsupervised One- Unsupervised One -Shot Shot

Thermo Shot Thermo Shot F30 Series F30 Series NEC Avio Avio Infrared Technologies Co., Ltd.

Horizontal Movable Shot Blasting Machine PW2-40DA Horizontal movable shot blasting machine Dust

Lecture 8 Sample Sample Chapter 8 and 10 Statistic Shot Noise Limit Homodyne Demodula-

Infinite Mixture Prototypes for Few-Shot Learning Adaptively inferring model capacity for simple

Federated Zero-Shot Learning: A Proposal Francesco Odierna CS PhD student @ University of Pisa

Co-Representation Network for Generalized Zero-Shot Learning Fei Zhang, Guangming Shi XIDIAN

Analog rotating black holes in a MHD inflow Based on ? Analog geometry for acoustic waves With

A combustion instability model accounting for dynamic flame-flow-acoustic interactions el Assier 1

Multilingual Speech Recognition With A Single End-To-End Model Shubham Toshniwal 1 , Tara N.

Do You Hear What I Hear? Fingerprintin Smart Devices Through Embedded Acoustic Components A.Das,

Acoustic streaming modeling Milad Setareh Applied Mechanics/Fluid Dynamics, Amirkabir University

General-purpose Ambisonic playback systems for electroacoustic concerts A practical approach

Auralization Technology Maarten Hornikx April 4, 2019 Introduction Auralization technology

Parameterised Sigmoid and ReLU Hidden Activation Functions for DNN Acoustic Modelling Chao Zhang

Acoustic Modeling: Tied-state HMMs & DNN-based models Lecture 7 CS 753 Instructor: Preethi

Siamese Network & Matching Network for one-shot learning Reference Papers Siamese Neural