Forensic speaker identification based on spectral moments

Authors

  • R. Rodman North Carolina State University
  • D. McAllister North Carolina State University
  • D. Bitzer North Carolina State University
  • L. Cepeda North Carolina State University
  • P. Abbitt North Carolina State University

DOI:

https://doi.org/10.1558/sll.2002.9.1.22

Keywords:

speaker identification, spectral moments, isolexemic sequences, glottal pulse period

Abstract

A new method for doing text-independent speaker identification geared to forensic situations is presented. By analysing ‘isolexemic’ sequences, the method addresses the issues of very short criminal exemplars and the need for open-set identification. An algorithm is given that computes an average spectral shape of the speech to be analysed for each glottal pulse period. Each such spectrum is converted to a probability density function and the first moment (i.e. the mean) and the second moment about the mean (i.e. the variance) are computed. Sequences of moment values are used as the basis for extracting variables that discriminate among speakers. Ten variables are presented all of which have sufficiently high inter- to intraspeaker variation to be effective discriminators. A case study comprising a ten-speaker database, and ten unknown speakers, is presented. A discriminant analysis is performed and the statistical measurements that result suggest that the method is potentially effective. The report represents work in progress.

Author Biography

  • R. Rodman, North Carolina State University
    Voice I/O Group: Multimedia Laboratory Department of Computer Science North Carolina State University [email protected]

Downloads

Published

2002-03-06

Issue

Section

Articles

How to Cite

Rodman, R., McAllister, D., Bitzer, D., Cepeda, L., & Abbitt, P. (2002). Forensic speaker identification based on spectral moments. International Journal of Speech, Language and the Law, 9(1), 22-43. https://doi.org/10.1558/sll.2002.9.1.22