OPENING THE BLACK BOX FOR FORENSIC AUTOMATIC SPEAKER RECOGNITION
Anil Alexander and Finnian Kelly Oxford Wave Research Ltd
? ? ? ?
Oxford Wave Research Ltd WHO WE ARE Oxford Wave Research Ltd (OWR) - - PowerPoint PPT Presentation
? ? ? ? Anil Alexander OPENING THE BLACK BOX FOR FORENSIC and AUTOMATIC SPEAKER RECOGNITION Finnian Kelly Oxford Wave Research Ltd WHO WE ARE Oxford Wave Research Ltd (OWR) is an audio and speech R&D company based in Oxford, UK.
Anil Alexander and Finnian Kelly Oxford Wave Research Ltd
? ? ? ?
Oxford Wave Research Ltd (OWR) is an audio and speech R&D company based in Oxford, UK. We develop systems for: Automatic Speaker Recognition Speaker Diarization Audio Fingerprinting Our products are used by law enforcement the UK, US, Europe and the Middle East including the MET police, UK MoD, Netherlands Forensic Institute, German BKA etc.
Lay people Juries, judges and lawyers Even forensic experts!
Recent advances in Speaker Recognition involve a huge number of variables – training and evaluation data, feature modelling and parameter choices. A lot of the focus has been on incremental improvements on large datasets of the variability is designed or controlled. How does this sit in in the context of opening the black box in real forensic casework?
Balance Logic Robustness Transparency
Multiple data selection decisions before you even get started
LDA/PLDA UBM Training TV Matrix
How does this affect the Likelihood Ratios or the Strength of evidence?
and an understanding of the legal requirements in their area
straight-forward means of incorporating their knowledge into an automatic analysis.
performance for each case.
*LTF illustration from Catalina Manual
*
Bayesian Likelihood Ratios
unable to look, or indeed adapt the automatic system to their own requirements.
introduce new data at every step of the speaker recognition process.
configurations, and has the ability to train the system specifically for their problem domain.
VOCALISE Voice Comparison and Analysis of the Likelihood of Speech Evidence Flexible Features ‘Automatic’ spectral features ‘Traditional’ forensic phonetic parameters ‘User’- provided features Flexible Modeling State of the art ivector/PLDA ‘Classical’ – GMM/GMM-UBM The ‘Session’ Concept: