K-shot Learning of Acoustic Context
NIPS-2017 ML4AUDIO workshop, 8-Dec 2017
Ivan Bocharov, Tjalling Tjalkens and Bert de Vries
Eindhoven University of Technology, the Netherlands Email bert.de.vries@tue.nl
K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling - - PowerPoint PPT Presentation
K-shot Learning of Acoustic Context Ivan Bocharov, Tjalling Tjalkens and Bert de Vries Eindhoven University of Technology, the Netherlands Email bert.de.vries@tue.nl NIPS-2017 ML4AUDIO workshop, 8-Dec 2017 Use Case / Problem Statement
NIPS-2017 ML4AUDIO workshop, 8-Dec 2017
Eindhoven University of Technology, the Netherlands Email bert.de.vries@tue.nl
5
∫
scenes (“classes”) ∫
∫
∫
𝐿
𝑒𝑗
𝑦 𝑗=1
𝐿−1 𝑒𝑗+1
features
(60 MFCC per 40 ms 20 ms hop)
segments (s)
samples small, hierarchically structured, with duration modeling
University of Technology
Data Preparation
(30secs) from each of 11 randomly chosen scenes
from remaining (4) classes.
examples of data set 2
9
∫
scenes ∫
∫
∫
𝐿
𝑒𝑗
features (MFCC)
segments
samples
∫
scenes ∫
∫
∫
𝐿
𝑒𝑗
features (MFCC)
segments
samples
14
15
scenes ∫
∫
∫
𝐿
𝑒𝑗
features (MFCC)
segments
samples
scenes ∫
∫
∫
𝐿
𝑒𝑗
features (MFCC)
segments
samples
scenes ∫
∫
∫
𝐿
𝑒𝑗
features (MFCC)
segments
samples
21
acoustic scene classifier
urban monitoring, elderly care, etc.
performance on one-shot learning task compared to 1NN-DTW.
helps the classifier to recognize new classes from a single example.
competing models.
22