Latent Variable Models and Signal Separation Class 13. 11 Oct 2012 - PowerPoint PPT Presentation

11-755 Machine Learning for Signal Processing Latent Variable Models and Signal Separation Class 13. 11 Oct 2012 11-755 MLSP: Bhiksha Raj

Sound separation and enhancement  A common problem: Separate or enhance sounds  Speech from noise  Suppress “bleed” in music recordings  Separate music components..  A popular approach: Can be done with pots, pans, marbles and expectation maximization  Probabilistic latent component analysis  Tools are applicable to other forms of data as well.. 11-755 MLSP: Bhiksha Raj

Sounds – an example A sequence of notes  Chords from the same notes  A piece of music from the same (and a few additional) notes  3

Sounds – an example  A sequence of sounds  A proper speech utterance from the same sounds 4

Template Sounds Combine to Form a Signal  The individual component sounds “combine” to form the final complex sounds that we perceive  Notes form music  Phoneme-like structures combine in utterances  Sound in general is composed of such “building blocks” or themes  Which can be simple – e.g. notes, or complex, e.g. phonemes  Our definition of a building block: the entire structure occurs repeatedly in the process of forming the signal  Claim: Learning the building blocks enables us to manipulate sounds 5

The Mixture Multinomial 5 5 5 5 5 1 5 1 4 4 2 3 3 2 3 3 2 2 1 1 6 6 6 6  A person drawing balls from a pair of urns  Each ball has a number marked on it  You only hear the number drawn  No idea of which urn it came from  Estimate various facets of this process.. 11-755 MLSP: Bhiksha Raj

More complex: TWO pickers 6 4 1 5 3 2 2 2 … 1 1 3 4 2 1 6 5 5 5 5 5 1 5 1 4 4 2 3 3 2 3 3 2 2 1 1 6 6 6 6  Two different pickers are drawing balls from the same pots  After each draw they call out the number and replace the ball  They select the pots with different probabilities  From the numbers they call we must determine  Probabilities with which each of them select pots  The distribution of balls within the pots 11-755 MLSP: Bhiksha Raj

Solution 6 4 1 5 3 2 2 2 … 1 1 3 4 2 1 6 5 5 5 5 5 1 5 1 4 4 2 3 3 2 3 3 2 2 1 1 6 6 6 6  Analyze each of the callers separately  Compute the probability of selecting pots separately for each caller  But combine the counts of balls in the pots!! 11-755 MLSP: Bhiksha Raj

Recap with only one picker and two pots Called P(red|X) P(blue|X) Probability of Red urn:  6 .8 .2 P(1 | Red) = 1.71/7.31 = 0.234  4 .33 .67 P(2 | Red) = 0.56/7.31 = 0.077  5 .33 .67 P(3 | Red) = 0.66/7.31 = 0.090  1 .57 .43 P(4 | Red) = 1.32/7.31 = 0.181 2 .14 .86  P(5 | Red) = 0.66/7.31 = 0.090 3 .33 .67  4 .33 .67 P(6 | Red) = 2.40/7.31 = 0.328  5 .33 .67 Probability of Blue urn:  2 .14 .86 P(1 | Blue) = 1.29/11.69 = 0.122  2 .14 .86 P(2 | Blue) = 0.56/11.69 = 0.322 1 .57 .43  4 .33 .67 P(3 | Blue) = 0.66/11.69 = 0.125  3 .33 .67 P(4 | Blue) = 1.32/11.69 = 0.250  4 .33 .67 P(5 | Blue) = 0.66/11.69 = 0.125  6 .8 .2 P(6 | Blue) = 2.40/11.69 = 0.056  2 .14 .86 1 .57 .43  P(Z=Red) = 7.31/18 = 0.41 6 .8 .2  P(Z=Blue) = 10.69/18 = 0.59 7.31 10.69 11-755 MLSP: Bhiksha Raj 23

Two pickers  Probability of drawing a number X for the first picker:  P 1 (X) = P 1 (red)*P(X|red) + P 1 (blue)*P(X|blue)  Probability of drawing X for the second picker  P 2 (X) = P 2 (red)*P(X|red) + P 2 (blue)*P(X|blue)  Note: P(X|red) and P(X|blue) are the same for both pickers  The pots are the same, and the probability of drawing a ball marked with a particular number is the same for both  The probability of selecting a particular pot is different for both pickers  P 1 (X) and P 2 (X) are not related 11-755 MLSP: Bhiksha Raj

Two pickers 6 4 1 5 3 2 2 2 … 1 1 3 4 2 1 6 5 5 5 5 5 1 5 1 4 4 2 3 3 2 3 3 2 2 1 1 6 6 6 6  Probability of drawing a number X for the first picker: P 1 (X) = P 1 (red)*P(X|red) + P 1 (blue)*P(X|blue)   Probability of drawing X for the second picker P 2 (X) = P 2 (red)*P(X|red) + P 2 (blue)*P(X|blue)   Problem: Given the set of numbers called out by both pickers estimate P 1 (color) and P 2 (color) for both colors  P(X | red) and P(X | blue) for all values of X  11-755 MLSP: Bhiksha Raj

With TWO pickers Called P(red|X) P(blue|X) PICKER 2 6 .8 .2 Called P(red|X) P(blue|X) 4 .33 .67 4 .57 .43 5 .33 .67 4 .57 .43 1 .57 .43 3 .57 .43 2 .14 .86 2 .27 .73 3 .33 .67 1 .75 .25 4 .33 .67 6 .90 .10 5 .33 .67 5 .57 .43 2 .14 .86 2 .14 .86 4.20 2.80 1 .57 .43 4 .33 .67  Two tables 3 .33 .67 4 .33 .67  The probability of selecting 6 .8 .2 pots is independently 2 .14 .86 1 .57 .43 computed for the two 6 .8 .2 pickers 7.31 10.69 PICKER 1 11-755 MLSP: Bhiksha Raj

With TWO pickers Called P(red|X) P(blue|X) PICKER 2 6 .8 .2 Called P(red|X) P(blue|X) 4 .33 .67 4 .57 .43 5 .33 .67 4 .57 .43 1 .57 .43 3 .57 .43 2 .14 .86 2 .27 .73 3 .33 .67 1 .75 .25 4 .33 .67 6 .90 .10 5 .33 .67 5 .57 .43 2 .14 .86 2 .14 .86 4.20 2.80 1 .57 .43 4 .33 .67 P(RED | PICKER1) = 7.31 / 18 3 .33 .67 4 .33 .67 P(BLUE | PICKER1) = 10.69 / 18 6 .8 .2 2 .14 .86 1 .57 .43 6 .8 .2 P(RED | PICKER2) = 4.2 / 7 P(BLUE | PICKER2) = 2.8 / 7 7.31 10.69 PICKER 1 11-755 MLSP: Bhiksha Raj

With TWO pickers Called P(red|X) P(blue|X) Called P(red|X) P(blue|X) 6 .8 .2 4 .57 .43 4 .33 .67 4 .57 .43 5 .33 .67 3 .57 .43 1 .57 .43 2 .27 .73 2 .14 .86 1 .75 .25 3 .33 .67 6 .90 .10 4 .33 .67 5 .57 .43 5 .33 .67 2 .14 .86  To compute probabilities of 2 .14 .86 1 .57 .43 numbers combine the tables 4 .33 .67 3 .33 .67  Total count of Red: 11.51 4 .33 .67 6 .8 .2  Total count of Blue: 13.49 2 .14 .86 1 .57 .43 6 .8 .2 11-755 MLSP: Bhiksha Raj

With TWO pickers: The SECOND picker Called P(red|X) P(blue|X) Called P(red|X) P(blue|X) 6 .8 .2 4 .57 .43 4 .33 .67 4 .57 .43 5 .33 .67 3 .57 .43 1 .57 .43 2 .27 .73 2 .14 .86 1 .75 .25 3 .33 .67 6 .90 .10 4 .33 .67 5 .57 .43 5 .33 .67 2 .14 .86 2 .14 .86 Total count for “Red” : 11.51  1 .57 .43 Red:  4 .33 .67 Total count for 1: 2.46  3 .33 .67 Total count for 2: 0.83  4 .33 .67 Total count for 3: 1.23  6 .8 .2 Total count for 4: 2.46  2 .14 .86 Total count for 5: 1.23  1 .57 .43 Total count for 6: 3.30  6 .8 .2 P(6|RED) = 3.3 / 11.51 = 0.29  11-755 MLSP: Bhiksha Raj

In Squiggles  Given a sequence of observations O k,1 , O k,2 , .. from the k th picker  N k,X is the number of observations of color X drawn by the k th picker  Initialize P k (Z), P(X|Z) for pots Z and colors X  Iterate:  For each Color X, for each ( | ) ( ) P X Z P Z  k ( | ) pot Z and each observer k: P Z X  k ( ' ) ( | ' ) P Z P X Z k ' Z  Update probability of  ( | ) N P Z X numbers for the pots: , k X k  k ( | ) P X Z  ( ' | ) N P Z X , k X k  Update the mixture k Z ' weights: probability  ( | ) N P Z X of urn selection for each , k X k  X ( ) P Z  picker k ( ' | ) N P Z X , k X k ' Z X 11-755 MLSP: Bhiksha Raj

Signal Separation with the Urn model  What does the probability of drawing balls from Urns have to do with sounds?  Or Images?  We shall see.. 11-755 MLSP: Bhiksha Raj

The representation FREQ AMPL TIME TIME  We represent signals spectrographically  Sequence of magnitude spectral vectors estimated from (overlapping) segments of signal  Computed using the short-time Fourier transform  Note: Only retaining the magnitude of the STFT for operations  We will, need the phase later for conversion to a signal 11-755 MLSP: Bhiksha Raj

A Multinomial Model for Spectra  A generative model for one frame of a spectrogram  A magnitude spectral vector obtained from a DFT represents spectral magnitude against discrete frequencies  This may be viewed as a histogram of draws from a multinomial t FRAME HISTOGRAM f P t (f ) f The balls are FRAME marked with Power spectrum of frame t discrete frequency t indices from the DFT Probability distribution underlying the t-th spectral vector 11-755 MLSP: Bhiksha Raj

A more complex model  A “picker” has multiple urns  In each draw he first selects an urn, and then a ball from the urn  Overall probability of drawing f is a mixture multinomial Since several multinomials (urns) are combined   Two aspects – the probability with which he selects any urn, and the probability of frequencies with the urns HISTOGRAM multiple draws 11-755 MLSP: Bhiksha Raj

The Picker Generates a Spectrogram  The picker has a fixed set of Urns  Each urn has a different probability distribution over f  He draws the spectrum for the first frame  In which he selects urns according to some probability P 0 ( z )  Then draws the spectrum for the second frame  In which he selects urns according to some probability P 1 ( z )  And so on, until he has constructed the entire spectrogram 11-755 MLSP: Bhiksha Raj

Latent Variable Models and Signal Separation Class 13. 11 Oct 2012 - PowerPoint PPT Presentation

11-755 Machine Learning for Signal Processing Latent Variable Models and Signal Separation Class 13. 11 Oct 2012 11-755 MLSP: Bhiksha Raj Sound separation and enhancement A common problem: Separate or enhance sounds Speech from noise

1 Latent variable models In the next section we will discuss latent variable models for

Latent Variable Models CS3750 Xiaoting Li 1 Out utli line Latent Variable Models

Processing Latent Variable Models and Signal Separation Bhiksha Raj Class 13. 15 Oct 2013

Learning Overcomplete Latent Variable Models through Tensor Methods Anima Anandkumar UC Irvine

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

Pengtao Xie Joint work with Yuntian Deng and Eric Xing Carnegie Mellon University 1 Latent

Latent Variable Models Stefano Ermon, Aditya Grover Stanford University Lecture 6 Stefano

Learning Latent Variable Models through Tensor Methods Anima Anandkumar U.C. Irvine Challenges

Separation energies A = 21 isobaric chain one-nucleon separation energies two-nucleon separation

Numberjack User Guide May 27, 2013 1 Variables Constructor for the class Variable : Constructor

Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research

Guaranteed Learning of Latent Variable Models through Tensor Methods Furong Huang University of

Discrete Latent Variable Models Stefano Ermon, Aditya Grover Stanford University Lecture 15

Outline Latent Variable Generative Models Cooperative Vector Quantizer Model Model

Maximum Reconstruction Estimation for Generative Latent-Variable Models Yong Cheng joint work

Tx Signal: 1000 Hz sine wave; Attenuation; Random noise with 0.5ms spike Tx Signal Noise Rx

Monaural speech separation using source-adapted models Ron Weiss, Dan Ellis { ronw,dpwe }

Mol2Net Diagnosis of the behavior of the African snail (Lissachatina fulica) by means of its

Computational prediction and experimental analysis of RNA structures Sonia Varriale 1 , Stefano

plasmid-mediated fosfomycin resistance gene identified in an Escherichia coli isolate Laurent

L13. Sound Localization delay September 16, 2011 = + r ( ) f

Score informed audio source separation using a parametric model of non-negative spectrogram

Outline Successful Multiparty Audio Communication over the Internet Introduction Problems

Indoor Sound Localization Fares Abawi Universitt Hamburg Fakultt fr Mathematik, Informatik