Duration threshold for identifying speech samples for different phonemes

Authors

  • Hendrik Husstedt German Institute of Hearing Aids, Lübeck, Germany
  • Simone Wollermann German Institute of Hearing Aids, Lübeck, Germany; Universität Lübeck, Lübeck, Germany
  • Daniel Bank German Institute of Hearing Aids, Lübeck, Germany; Universität Lübeck, Lübeck, Germany
  • Mario Schinnerl German Institute of Hearing Aids, Lübeck, Germany; Universität Lübeck, Lübeck, Germany
  • Marlitt Frenz German Institute of Hearing Aids, Lübeck, Germany
  • Jürgen Tchorz Technische Hochschule L¨ubeck, Lübeck, Germany

Keywords:

Duration threshold, identification, classification, speech

Abstract

The identification or classification of acoustic objects is important to decide in which way a sound needs to be interpreted and to rate its importance or relevance. In recent studies, it has been shown that the minimal duration of a sound, which is required for a correct identification, could be a useful audiological parameter, e.g. providing information about the hearing ability of a person. In this work, we want to investigate which cues are used by humans to classify a sound correctly as speech. For this purpose, the duration thresholds for the identification of speech samples starting with different phonemes are analyzed for elderly listeners with normal and impaired hearing. To this end, a two-alternative forced choice (2-AFC) method was used, where, as an alternative to speech, a noise signal with a matched frequency spectrum was presented. In contrast to previous studies, there were no frequency cues available and we found no correlation to the pure tone average (PTA) or speech understanding in noise. As one main conclusion, the results suggest that humans primarily exploit the temporal envelope (ENV) rather than the temporal fine structure (TFS) for the identification of short speech samples above hearing threshold and without frequency cues.

References

Bank, D., Schinnerl, M., Frenz, M., Gassenmeyer, F., and Husstedt, H. (2019). “Dura- tion Threshold for Identifying Sound Samples of Elderly Hearing Impaired,” The Student Conference of the BioMedTec Science Campus, Lu ̈beck, Mar., 2019

Budathoki , D., Tchorz, J., and O’Beirne , G. (2019). “Duration Thresholds for Identifying Different Sound Types,” 22. DGA Jahrestagung, Heidelberg, Germany, 2019.

Gray, G. W. (1942). “Phonemic Microtomy: The Minimal Duration of Perceptible Speech Sounds,” Speech Monogr., 9(1), 75-90. doi: 10.1080/03637754209390064

Gygi, B., Kidd, G. R., and Watson, C. S. (2004). “Spectral-temporal factors in the identification of environmental sounds,” J. Acoust. Soc. Am., 115(3), 1252-1265. doi: 10.1121/1.1635840

Hirsh, I. J., and Watson, C. S. (1996). “Auditory psychophysics and perception,” Ann. Rev. Psychol., 47(1), 461-484. DOI: 10.1146/annurev.psych.47.1.461

Husstedt, H., Bank, D., and Schinnerl, M. (2019). “Comparison of Two Procedures to Measure the Duration Threshold for Identifying Sound Samples,” 22. DGA Jahrestagung, Heidelberg, Germany, 2019.

Husstedt, H., Wollermann, S., and Tchorz, J. (2018). “Analysis of the Transition of the Automatic Selection of Hearing Aid Programs,” 45th Erlanger Kolloquium, Erlangen, Germany, 2018.

Kaernbach , C. (1991). “Simple adaptive testing with the weighted up-down method,” Percept. Psychophys., 49(3), 227-229. doi: 10.3758/BF03214307

Kollmeier, B., and Wesselkamp, M. (1997). “Development and evaluation of a Ger- man sentence test for objective and subjective speech intelligibility assessment,” J. Acoust. Soc. Am., 104(2), 2412-2421. doi: 10.1121/1.419624

Lorenzi, C., Gilbert, G., Carn, H., Garnier, S., and Moore, B. (2006). “Speech perception problems of the hearing impaired reflect inability to use temporal fine structure,” Proc. Natl. Acad. Sci. U. S. A., 103(49), 18866-18869. doi: 10.1073/pnas.0607364103

McDermott, J. H., and Simoncelli, E. P. (2011). “Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis,” Neuron, 71(5), 926-940. doi: 10.1016/j.neuron.2011.06.032

Moore, C. J. M. (1984). “Frequency selectivity and temporal resolution in nor- mal and hearing-impaired listeners,” Br. J. Audiol., 19(3), 189-201. doi: 10.3109/03005368509078973

Nasreddine, Z. S., Phillips, N. A., Be ́dirian, V., Charbonneau, S., Whitehead, V., Collin, I., Cummings, J. L., and Chertkow, H. (2005). “The Montreal Cognitive Assessment, MoCA: A Brief Screening Tool For Mild Cognitive Impairment,” J. Am. Geriatr. Soc., 53, 695-699. doi: 10.1111/j.1532-5415.2005.53221.x

Pietro, M., Laganaro, and M., Schnider, A. (2016). “Auditory agnosia,” in Neuropsychological Research: A Review, Edited by P. Marien and J. Abutalebi (Psychology Press), chap. 15.

Additional Files

Published

2020-03-26

How to Cite

Husstedt, H., Wollermann, S., Bank, D., Schinnerl, M., Frenz, M., & Tchorz, J. (2020). Duration threshold for identifying speech samples for different phonemes. Proceedings of the International Symposium on Auditory and Audiological Research, 7, 69–76. Retrieved from https://proceedings.isaar.eu/index.php/isaarproc/article/view/2019-09

Issue

Section

2019/2. Learning from natural sounds