Impact of mobile phone usage on speech spectral features: some preliminary findings

Authors

  • Slobodan T Jovicic Telecommunications Department, School of Electrical Engineering, University of Belgrade, Belgrade, Serbia Laboratory for Forensic Acoustics and Phonetics, Life Activities Advancement Center, Belgrade, Serbia
  • Nikola Jovanović Telecommunications Department, School of Electrical Engineering, University of Belgrade, Belgrade
  • Miško Subotić Laboratory for Forensic Acoustics and Phonetics, Life Activities Advancement Center, Belgrade
  • Đorđe Grozdić Telecommunications Department, School of Electrical Engineering, University of Belgrade Laboratory for Forensic Acoustics and Phonetics, Life Activities Advancement Center, Belgrade

DOI:

https://doi.org/10.1558/ijsll.v22i1.17880

Keywords:

mobile phone usage, speech production, ltass, ltf, vowel formants

Abstract

The manner of using a mobile phone in voice communications can significantly affect the spectral characteristics of the speech signal. This article presents preliminary results of the analysis of the long-term average speech spectrum (LTASS), the long-term formant distribution (LTF) in voiced sounds, and vowel formants F1, F2 and F3 of six speakers in five modes of mobile phone usage. These modes are: normal holding of a mobile phone (NOR), with a bonbon (sweet) in the mouth (BON), with a cigarette between the lips (CIG), with the mobile phone between cheek and shoulder (SHO) and with the hand covering the mobile phone and mouth (HAN). The results show that each mode has an impact on spectral features and that the modes HAN and SHO have the greatest impact. The most striking results are the relative displacement of F1, which can reach 30% (e.g. vowel /a/ in HAN mode for males), formant F2, near 15% (vowel /i/ in SHO mode for males), and formant F3, about 5% (vowel /u/ in CIG mode for females). These findings suggest that forensic practitioners should exercise caution in interpreting formant measurements in speaker identification cases involving mobile phone transmission.

Author Biographies

  • Slobodan T Jovicic, Telecommunications Department, School of Electrical Engineering, University of Belgrade, Belgrade, Serbia Laboratory for Forensic Acoustics and Phonetics, Life Activities Advancement Center, Belgrade, Serbia
    Slobodan T. Jovičić PhD is professor at the Department of Telecommunications, School of Electrical Engineering, University of Belgrade, Serbia. He is a scientific adviser at the Life Activities Advancement Center and a forensic expert at the Serbian Ministry of Justice. He founded the Laboratory for Forensic Acoustics and Phonetics at the Life Activities Advancement Center, Belgrade. His research interests include speech communication, speech technology, speech disorders and forensic science.
  • Nikola Jovanović, Telecommunications Department, School of Electrical Engineering, University of Belgrade, Belgrade
    Nikola Jovanović received his MSc in telecommunications (speech communication) from the School of Electrical Engineering, University of Belgrade, Serbia. He now works as a researcher at the Military Technical Institute, Belgrade. His main research interests lie within the field of speech communication quality enhancement.
  • Miško Subotić, Laboratory for Forensic Acoustics and Phonetics, Life Activities Advancement Center, Belgrade
    Miško Subotić received his PhD in telecommunications (speech communication) from the School of Electrical Engineering, University of Belgrade, Serbia. He is now director of the Life Activities Advancement Center, an innovative center that covers interdisciplinary aspects in speech communication investigation. His research interests include speech production and perception, bioinformatics bases of speech, speech and hearing disorders, and cognitive speech and e-medicine applications.
  • Đorđe Grozdić, Telecommunications Department, School of Electrical Engineering, University of Belgrade Laboratory for Forensic Acoustics and Phonetics, Life Activities Advancement Center, Belgrade
    Đorđe Grozdić received his MSc in telecommunications (speech communication) from the School of Electrical Engineering, University of Belgrade, Serbia. He is now studying for his doctorate at the School of Electrical Engineering and works in the Laboratory for Forensic Acoustics and Phonetics at the Life Activities Advancement Center, Belgrade. His research interests lie within the field of speech perception, recognition and quality enhancement.

References

Boersma, P. and Weenink, D. (2011) Praat: Doing Phonetics by Computer (Version 5.2.26). Retrieved from http://praat.org/.


Byrne, C. and Foulkes, P. (2004) The ‘mobile phone effect’ on vowel formants. International Journal of Speech, Language and the Law 11(1): 83–102. http://dx.doi.org/10.1558/sll.2004.11.1.83


Fant, G. (1970) Acoustic Theory of Speech Production (2nd edn). The Hague: Mouton & Co.


Fecher, N. (2014) Effects of forensically-relevant facial concealment on acoustic and perceptual properties of consonants. Dissertation, Department of Language and Linguistic Science, University of York.


Flanagan, J. L. (1972) Speech Analysis, Synthesis and Perception (2nd edn). New York: Springer-Verlag. http://dx.doi.org/10.1007/978-3-662-01562-9


Guillemin, B. J. and Watson, C. (2008) Impact of the GSM mobile phone network on the speech signal: some preliminary findings. International Journal of Speech, Language and the Law 15(2): 193–218.


Ito, T., Takeda, K. and Itakura, F. (2005) Analysis and recognition of whispered speech. Speech Communication 45: 129–152. http://dx.doi.org/10.1016/j.specom.2003.10.005


Jovi?i?, S. T. (1998) Formant feature differences between whispered and voiced sustained vowels. Acustica – Acta Acustica 84: 739–743.


Jovi?i?, S. T., Kaši?, Z., ?or?evi?, M. and Rajkovi?, M. (2004) Serbian emotional speech database: design, processing and evaluation. Proceedings of the Conference SPECOM-2004, St Petersburg, Russia: 77–81.


Künzel, H. J. (2001) Beware of the ‘telephone effect’: the influence of telephone transmission on the measurement of formant frequencies. Forensic Linguistics 8(1): 80–99.


Künzel, H. J. (2002) Rejoinder to Francis Nolan’s ‘The “telephone effect” on formants’: a response. Forensic Linguistics 9(1): 83–86.


Lindblom, L. and Sundberg, J. (1971) Acoustical consequences of lip, tongue, jaw, and larynx movement. Journal of the Acoustical Society of America 50(4): 1166–1179. http://dx.doi.org/10.1121/1.1912750


Nolan, F. (2002) The ‘telephone effect’ on formants: a response. Forensic linguistics 9(1): 74–82. http://dx.doi.org/10.1558/sll.2002.9.1.74


Nolan, F. and Grigoras, C. (2005) A case for formant analysis in forensic speaker identification. International Journal of Speech Language and the Law 12(2): 143–173. http://dx.doi.org/10.1558/sll.2005.12.2.143


Rose, P. J. (2002) Forensic Speaker Identification. London: Taylor & Francis. http://dx.doi.org/10.1201/9780203166369


Shewmaker, M. B., Hapner, E. R., Gilman, M., Klein, A. M. and Johns, M. M. (2010) Analysis of voice change during cellular phone use: a blinded controlled study. Journal of Voice 24(3): 308–313. http://dx.doi.org/10.1016/j.jvoice.2008.09.002


Stevens, K. N., and Keyser, S. J. (2010) Quantal theory, enhancement and overlap. Journal of Phonetics 38(1): 10–19. http://dx.doi.org/10.1016/j.wocn.2008.10.004

Published

2015-07-08

Issue

Section

Articles

How to Cite

Jovicic, S. T., Jovanović, N., Subotić, M., & Grozdić, Đorđe. (2015). Impact of mobile phone usage on speech spectral features: some preliminary findings. International Journal of Speech, Language and the Law, 22(1), 111-126. https://doi.org/10.1558/ijsll.v22i1.17880