multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
Crosstalk Analysis
Stuart N Wrigley Vincent Wan Guy J Brown Steve Renals
29 January 2003
Crosstalk Analysis Stuart N Wrigley Vincent Wan Guy J Brown Steve - - PowerPoint PPT Presentation
multimodal meeting manager - m4 Crosstalk Analysis Stuart N Wrigley Vincent Wan Guy J Brown Steve Renals 29 January 2003 Speech and Hearing Research Group, University of Sheffield, UK multimodal meeting manager - m4 Crosstalk Analysis
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
29 January 2003
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
S C SC N
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
# See LeBlanc and de Leon. Speech Separation by Kurtosis Maximization, IEEE ICASSP
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
0 10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100 False Alarm probability (in %) Correct detection probability (in %) Single feature: mfcc speaker alone crosstalk alone speaker+crosstalk silence 0 10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100 False Alarm probability (in %) Correct detection probability (in %) Single feature: max normalised XC speaker alone crosstalk alone speaker+crosstalk silence
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100 False Alarm probability (in %) Correct detection probability (in %) MMROC using features: energy, kurtosis, fundamentalness, max XC, mean XC, max normalised XC, mean normalised XC, speaker alone crosstalk alone speaker+crosstalk silence
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100 False Alarm probability (in %) Correct detection probability (in %) MMROC using features: kurtosis, fundamentalness, max XC, mean XC, max normalised XC, mean normalised XC, speaker alone crosstalk alone speaker+crosstalk silence
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100 False Alarm probability (in %) Correct detection probability (in %) MMROC using features: kurtosis, fundamentalness, speaker alone crosstalk alone speaker+crosstalk silence
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK
Manual frame transcriptions of meeting bmr001 (portion) 500 1000 1500 2000 2500 3000 3500 4000 0.5 1 1.5 EHMM classication of same portion Time (16ms frames, 10ms shift) 500 1000 1500 2000 2500 3000 3500 4000 0.5 1 1.5
speaker alone speaker + crosstalk crosstalk alone silence / noise
multimodal meeting manager - m4 Speech and Hearing Research Group, University of Sheffield, UK