SLIDE 3 Robust Automatic Speech Recognition through on-line Semi Blind Source Extraction
Francesco Nesta, Marco Matassoni {nesta,matassoni}@fbk.eu
BSS vs BSE
) , ( ) ( ) , ( l k k l k s H x =
is a vector of N sources is a vector of M mixtures (i.e. mic numbers)
) , ( l k x ) , ( l k s
- If N=M, the source signals are as:
) , ( ) , ( ) ( ) , ( l k l k k l k s x W y ≈ = I H W =
−
) ( ) (
1 k
k
(up to order and scaling ambiguity)
k = frequency bin index l = frame index
Blind Source Separation (BSS)
Image at microphones of the target source Image at microphones of the sum of the interfering sources
) , ( ) , ( ) ( ) , ( ) , ( ) , ( l k l k s k l k l k l k
t t t
n h n s x + = + =
blind source extraction paradigm has been proposed to
those limitations[Takahashi, Saruwatari et all 2008].
Blind Source Extraction (BSE) In real-world N>M and may rapidly change over time!