More is better: likelihood ratio-based forensic voice comparison with vocalic segmental cepstra frontends
DOI:
https://doi.org/10.1558/ijsll.v20i1.77Keywords:
forensic voice comparison, likelihood ratio, vowel spectra, cepstrumAbstract
The suitability of vowel cepstral spectra for forensic voice comparison is explored within a likelihood ratio-based framework, and non-technical explanations provided for some basic concepts of cepstral analysis and forensic voice comparison. Non-contemporaneous landline telephone recordings of 297 male Japanese speakers are compared using only two replicates per recording of each of their five read-out vowels. 14 cepstrally-mean-subtracted LPC cepstral coefficients modelling the spectral shape to 5 kHz are used as features. When evaluated intrinsically with kernel density multivariate likelihood ratios, all 297 same-speaker comparisons are correctly discriminated as coming from the same speaker, and only 173 of the 43,956 different-speaker comparisons (0.4%) are incorrectly evaluated as coming from the same speaker. The log-likelihood ratio cost for this comparison is very low at 0.013. Fusion with a speaker’s long-term spectral data marginally improves the different-speaker error rate to 0.27% and the log-likelihood ratio cost to 0.009. It is concluded that the approach warrants further examination.Published
2013-07-09
Issue
Section
Articles
How to Cite
Rose, P. (2013). More is better: likelihood ratio-based forensic voice comparison with vocalic segmental cepstra frontends. International Journal of Speech, Language and the Law, 20(1), 77-116. https://doi.org/10.1558/ijsll.v20i1.77