Missing-data methods with cepstral data Summary of work done at - PDF document

morris,bourlard@idiap.ch http://www.idiap.ch/ Missing-data methods with cepstral data ❇ Summary of work done at IDIAP ❇ Main achievements ● RESPITE meeting, 7-8 June 2002, Page 1

DUMA - Data Utility Maps from MLPs ❇ • Can’t use noise estimation to generate data utility maps for multi-condition models - noisy data may be “clean”. • Can’t assume that mismatching data are outliers. (see Fig.) clean clean SNR 15 SNR 15 SNR 5 SNR 5 −200 −100 0 −200 −100 0 log obs prob, multi condition data models log obs prob, clean data models Fig shows log prob histograms for clean, SNR 15 and SNR 5 dB N1 (subway) MFCC_E_D_A data, left for clean models, right for multicondition models. Probabilities increase with noise level, rather than decrease as might be expected. • Could the entropy of an MLP classifier trained on a small window about a data point tell us something about its utility? RESPITE meeting, 7-8 June 2002, Page 2

MLP state confusion characteristics four seven sil six three two one eight five oh sp nine zero four seven sil six three two one eight five oh sp nine zero 20 20 40 40 60 60 80 80 100 100 120 120 140 140 160 160 180 180 20 40 60 80 100 120 140 160 180 20 40 60 80 100 120 140 160 180 data clean, models clean data clean, models frame error rate 62.6% multicondition four seven sil six three two one eight five oh sp nine zero four seven sil six three two one eight five oh sp nine zero 20 20 40 40 60 60 80 80 100 100 120 120 140 140 160 160 180 180 20 40 60 80 100 120 140 160 180 20 40 60 80 100 120 140 160 180 data SNR 5, models clean data SNR 5, models frame error rate 88.7% multicondition For clean models, “eight” is attractor. For multicond. models ‘sil’ and ‘sp’ act as attractor noise models. RESPITE meeting, 7-8 June 2002, Page 3

Local MLP state confusion characteristics 10 60 40 5 20 0 20 40 60 80 100 120 140 1 30 20 0.5 10 20 40 60 80 100 120 140 1 30 20 0.5 10 0 20 40 60 80 100 120 140 1 30 20 0.5 10 0 20 40 60 80 100 120 140 1 30 20 0.5 10 0 20 40 60 80 100 120 140 Top = FBANK_D coeffs. Down from top are DU masks for clean, SNR 20, 10 & 0 dB. Utterance is MAH_139OA. Masks based on confidence-matrix corrected MLP output entropies. Max and median observed corrected entropy values mapped to 0 and 1. Left are conf. mats for the 6 subband MLPs. RESPITE meeting, 7-8 June 2002, Page 4

Missing-data methods with cepstral data When log spectral data have evidence pdf obs obs ( ) = ϕ i δ xi ( – ) + ( 1 – ϕ i ) unif 0 xi ( , ) u xi xi the evidence pdf for any linear function of this data can be obtained, and has the same form. 20 20 15 10 10 5 0 20 40 60 80 100 120 140 160 20 20 15 15 10 5 10 20 40 60 80 100 120 140 160 1 20 15 0.5 10 5 0 20 40 60 80 100 120 140 160 1 20 15 0.5 10 5 0 20 40 60 80 100 120 140 160 90 60 40 80 20 20 40 60 80 100 120 140 160 Top=clean fbank (power), 2=SNR0 fbank, 3=oracle MD mask, 4=simple MD mask, 5= multi-cepstral intervals for hard MD mask. Signal = FAK_3Z82A, noise = N1 (subway). RESPITE meeting, 7-8 June 2002, Page 5

CDPP - Clean Data PDF Propagation Intervals of uncertainty for cepstral coeffs are extremely wide everywhere except where almost whole spectral frame clean. Can reduce problem by appending subband cepstral features. Recognition still bad unless intervals somehow scaled down. Can obtain much tighter cepstral pdf by deriving clean speech log spec. energy pdf directly from noise spectral energy pdf. In this case the “oracle” would 100% restore clean data. General formula for pdf of function y=f(x) of rand variable x. If g(x) = f -1 (x) is monotonic, then ( ) = ( ( ) ) g ' y ( ) py y px g y For noise energy pdf ( ) the resulting evidence pdf for pn x clean log speech energy is x clean pn e x obs x clean   ( ) = – u x e e   This has strong squashing effect on noise pdf. • Results so far do not better clean cepstral baseline - except for 0.5% on clean speech (though at 98.83% acc, this is still a 60.7% decrease in WER, or 66.0% decrease in WIL). • May (or may not!) improve over “max assumption” for MD with spectral data. RESPITE meeting, 7-8 June 2002, Page 6

Summary of work done at IDIAP ❇ 1999 FCMB Full Combination Multi-Band IDCN Incomplete Data Classifier Network 2000 MLCW ML (etc.) Combination Weighting techniques FCMS Full-Combination Multi-Stream 2001 MPFC M A P Full Combination TRUD Theory for Recognition with Uncertain Data 2002 MSTK MultiStream ToolKit DUMA Data Utility MAps CDPP Clean data P DF Propagation RESPITE meeting, 7-8 June 2002, Page 7

1999 FCMB Full Combination MultiBand 2 d ∑ ; ( ) ≅ ( ) P q k x c i Θ i ( ) P q k x P c i x MLP = 1 i FE MLP Decoder hello? MLP FE Separate MLPs estimate state posteriors for every MLP combination of sub-bands Combination Expert Weights IDCN Incomplete Data Classifier Network = y j x ( ) p x r j ( ) , ( ) = ( ) input x z k x P s k x x 1 y 1 z 1 ... ... ... x i y j z k ... ... ... x n x y n y z n z FFT IDCN HMM hello? RESPITE meeting, 7-8 June 2002, Page 8

2000 MLCW ML (etc.) Combination Weighting techniques Band 1 Band 1 0.5 0.7 Band 2 Band 2 Band 3 Band 3 0.6 Band 4 Band 4 0.4 Weights Weights 0.5 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 1 5 10 15 20 25 1 5 10 15 20 25 Phonemes Phonemes Clean speech Noise in band 3 FCMS Full-Combination Multi-Stream Combine multiple complementary sources of speech information • short term spectrum (10 ms) • difference features (50 ms) • amplitude modulation spectrum (100-500 ms) • visual features (mouth shape) • different features at each scale (FFT, MFC, LPC, PLP) RESPITE meeting, 7-8 June 2002, Page 9

2001 MPFC M A P Full Combination For expert weights static, MAP FC weights give weight 1 to expert with highest MAP score for each utterance. Tests with static + diff ftrs showed strong % improvement MLP HMM decoder FE MLP HMM decoder max hello? FE MLP HMM decoder Priors HMM decoder TRUD Theory for Recognition with Uncertain Data = ( ) X argmaxQE P Q X Θ [ , ∼ ( ) ] QMAP s X ∫ d = arg QP Q Θ ( ) p X Q Θ ( , ) u X ( ) ( ) max X For soft missing-data with “max assumption” obs obs – – ( ) = ϕ i δ xi ( ) + ( 1 ϕ i ) unif 0 xi ( , ) u xi xi RESPITE meeting, 7-8 June 2002, Page 10

2002 MSTK MultiStream ToolKit MDDM Missing-Data with Duration Models DUMA Data Utility MAps from MLPs CDPP Clean data P DF Propagation Main achievements ❇ • Not useful: IDCN, MLCW • Maybe useful in future: TRUD, DUMA, CDPP • Useful: FCMB / FCMS, MPFC, MSTK • Integration of missing-data with multi-stream methods ● RESPITE meeting, 7-8 June 2002, Page 11

Missing-data methods with cepstral data Summary of work done at - PDF document

morris,bourlard@idiap.ch http://www.idiap.ch/ Missing-data methods with cepstral data Summary of work done at IDIAP Main achievements RESPITE meeting, 7-8 June 2002, Page 1 DUMA - Data Utility Maps from MLPs Cant

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Missing Values in SAS Magnus Mengelbier Director PhUSE 2011 1 Topics Introduction

Feature Extraction Combining Feature Extraction Combining Spectral Noise Reduction and Spectral

Cepstral analysis in speech processing From speech production model, we have: s[n] = (p[n]*g[n] +

Searching for and replacing missing values Nicholas Tierney Statistician DataCamp Dealing With

Bayesian Generalized linear mixed models with data missing not at random Overview: Two simple

Whats Missing? SOCI 101 November 29, 2011 SOCI 101 () Whats Missing? November 29, 2011

Advances in ML: Theory Meets Practice Julie Josse Review on Missing Values Methods with Demos

Missing data and data imputation with the Swiss Household Panel Andr Berchtold LIVES, LINES,

PS 406 Week 8 Section: Panel Methods and Missing Data D.J. Flynn May 21, 2014 D.J. Flynn

Dealing with missing values part 1 Applied Multivariate Statistics Spring 2012 Overview

Dealing with missing values part 1 Applied Multivariate Statistics Spring 2013 Overview

Performing and tracking imputation Nicholas Tierney Statistician DataCamp Dealing With Missing

Estimating Gaussian Mixture Models from Data with Missing Features by Daniel McMichael CSSIP

Methods for Handling Missing Data Joseph Hogan Brown University MDEpiNet Conference Workshop

Implied Volatility, Fundamental solutions, asymptotic analysis and symmetry methods, Linz, Ricam

Synthesis of Ranking Functions and Synthesis of Inductive Invariants and Synthesis of

Singular Perturbations in Stochastic Control and Hamilton-Jacobi-Bellman Equation Hicham Kouhkouh

Buffer Overflows and Defenses CS 419: Computer Security Lecture 10 and 11 Buffer Overflow a

Getting Unemployed Youth into Jobs: Lessons for New Zealand from Jobs: Lessons for New Zealand

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning

Taming Reluctant Random Walks In The Positive Quadrant 2 , Marni Mishna 2 , and Yann

Session 5 Software and Operating Systems Security Sbastien Combfis Fall 2019 This work is

Sambuz

Useful Links

Newsletter

Mail Us