A Classification Approach to Single Channel Source Separation
CS 6772 Project
Ron Weiss
ronw@ee.columbia.edu
A Classi↓cation Approach to Single Channel Source Separation – p. 1/8
A Classification Approach to Single Channel Source Separation CS - - PowerPoint PPT Presentation
A Classification Approach to Single Channel Source Separation CS 6772 Project Ron Weiss ronw@ee.columbia.edu A Classication Approach to Single Channel Source Separation p. 1/8 Single Channel Source Separation Speech Babble noise
Ron Weiss
ronw@ee.columbia.edu
A Classi↓cation Approach to Single Channel Source Separation – p. 1/8
Speech Time (seconds) Frequency (Hz) 1 2 3 1000 2000 3000 4000 Babble noise Time (seconds) 1 2 3 1000 2000 3000 4000 Mixture (10 dB SNR) Time (seconds) 1 2 3 1000 2000 3000 4000
A Classi↓cation Approach to Single Channel Source Separation – p. 2/8
Mixture Time (seconds) Frequency (Hz) 1 2 3 1000 2000 3000 4000 Mask − regions where speech energy dominates Time (seconds) Frequency (Hz) 1 2 3 1000 2000 3000 4000
A Classi↓cation Approach to Single Channel Source Separation – p. 3/8
A Classi↓cation Approach to Single Channel Source Separation – p. 4/8
A Classi↓cation Approach to Single Channel Source Separation – p. 5/8
P(zd|k) = P(rd)N(zd|µk,d, σk,d) + (1 − P(rd))
A Classi↓cation Approach to Single Channel Source Separation – p. 6/8
speech + factory2 noise − 0.88695 dB SNR 0.5 1 1.5 1000 2000 3000 4000 clean speech signal 0.5 1 1.5 1000 2000 3000 4000 RVM mask 0.5 1 1.5 1000 2000 3000 4000 A priori mask 0.5 1 1.5 1000 2000 3000 4000 Refiltering using RVM mask − 7.7788 dB SNR 0.5 1 1.5 1000 2000 3000 4000 GMM reconstruction − 8.4013 dB SNR 0.5 1 1.5 1000 2000 3000 4000
A Classi↓cation Approach to Single Channel Source Separation – p. 7/8
[1]
by missing data techniques. In WISP, pages 295–307, April 2001. [2]
recognition with missing and unreliable acoustic data. Speech Communication, 34:267–285, May 2001. [3]
robust speech recognition. Speech Communication, 43:275–296, 2004. [4]
[5]
In Proceedings of EuroSpeech, 2003. [6]
feature methods of robust speech recognition. In Proceedings of ICSLP, 2000. [7]
Muller, editors, Advances in Neural Information Processing Systems 12, pages 652–658. MIT Press, 2000.
A Classi↓cation Approach to Single Channel Source Separation – p. 8/8