Skip to main content Accessibility help
×
Home

Some acoustic effects of speaking style on utterances for automatic speaker verification

  • Jana Dankovičová (a1) and Francis Nolan (a2)

Extract

This paper reports the results of an experiment on the effects of six speaking styles on some of the acoustic properties of speech. The experiment was part of an exploration of within-speaker variation in connection with automatic speaker verification (ASV), pursuing the hypothesis that the elicitation of style variation in the training phase of an ASV system (‘structured training’) would enhance the performance of the system. Swedish-speaking subjects produced a digit sequence at varying speaking rates and loudness levels, and also with simulated denasality (pinched nose) and under cognitive stress. Duration of vowels and consonants, and formant frequencies of vowels, were measured. A number of consistent patterns of variation emerged for duration and vowel quality and are reported here. The discussion explores the relation between the patterns observed and the success, or in the case of speech under stress the failure, of structured training in reducing the error rates in ASV.

Copyright

References

Hide All
Doddington, G. R. (1985). Speaker recognition – identifying people by their voices. In Proceedings of the IEEE, 73, 1651–64.
Furui, S. (1994). An overview of speaker recognition technology. In Proceedings of ESCA Workshop on Speaker Recognition, Identification, and Verification, 19. Martigny, Switzerland, 5–7 04 1994.
Gay, T. (1981). Mechanisms in the control of speech rate. Phonetica, 38, 148158.
Jessen, M. (1997) Phonetic manifestations of cognitive and physical stress in trained and untrained police officers. Forensic Linguistics, 4, 125147.
Karlsson, I., Banziger, T., Dankovičová, J., Johnstone, T., Lindberg, J., Melin, H., Nolan, F. & Scherer, K. (1998a). Speaker verification with elicited speaking-styles in the VeriVox project. In Proceedings of La Reconnaissance du Locuteur et ses Applications Commerciales et Criminalistiques (RLA2C), Avignon, 202304 1998.
Karlsson, I., Banziger, T., Dankovičová, J., Johnstone, T., Lindberg, J., Melin, H., Nolan, F. & Scherer, K. (1998b). Within-speaker variability due to speaking manners. In Proceedings of ICSLP '98, Sydney, Australia, 6, 23792382.
Koopmans-van Beinum, F. J. (1980). Vowel Contrast Reduction: An Acoustic and Perceptual Study of Dutch Vowels in Various Speech Conditions. Amsterdam: Academische Pers. B.V.
Sakoe, H. & Chiba, S. (1978). Dynamic programming algorithm optimisation for spoken word recognition. In IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP-26, 4349.
Stevens, K. N. & House, A. S. (1963). Perturbation of vowel articulations by consonantal context: An acoustical study. Journal of Speech and Hearing Research, 6, 111128.

Some acoustic effects of speaking style on utterances for automatic speaker verification

  • Jana Dankovičová (a1) and Francis Nolan (a2)

Metrics

Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed