Extending a computational model of auditory processing towards speech intelligibility prediction

Authors

  • Helia Relaño-Iborra Hearing Systems, Department of Electrical Engineering, Technical University of Denmark, Kgs. Lyngby, Denmark
  • Johannes Zaar Hearing Systems, Department of Electrical Engineering, Technical University of Denmark, Kgs. Lyngby, Denmark
  • Torsten Dau Hearing Systems, Department of Electrical Engineering, Technical University of Denmark, Kgs. Lyngby, Denmark

Abstract

A speech intelligibility model is presented based on the computational auditory signal processing and perception model (CASP; Jepsen et al., 2008). CASP has previously been shown to successfully predict psychoacoustic data obtained in normal-hearing (NH) listeners in a wide range of listening conditions. Moreover, CASP can be parametrized to account for data from individual hearing-impaired listeners (Jepsen and Dau, 2011). In this study, the CASP model was investigated as a predictor of speech intelligibility measured in NH listeners in conditions of additive noise, phase jitter, spectral subtraction and ideal binary mask processing.

References

Chabot-Leclerc, A., Jørgensen, S., and Dau, T. (2014). “The role of auditory spectrotemporal modulation filtering and the decision metric for speech intelligibility prediction,” J. Acoust. Soc. Am., 135, 3502-3512.

Dau, T. (1996). Modeling Auditory Processing of Amplitude Modulation. Doctoral dissertation. Retrieved from Bibliotheks- und Informationssystem der Universit¨at Oldenburg.

Dau, T., P¨uschel, D., and Kohlrausch, A. (1996). “A quantitative model of the effective signal processing in the auditory system. I. Model structure,” J. Acoust. Soc. Am., 99, 3615-3622.

Elhilali, M., Chi, T., and Shamma, S.A. (2003). “A spectro-temporal modulation index (STMI) for assessment of speech intelligibility,” Speech Commun., 41, 331-348.

Holube, I., Fredelake, S., Vlaming, M., and Kollmeier, B. (2010).“Development and analysis of an International Speech Test Signal (ISTS),” Int. J. Audiol., 49, 891-903.

Jepsen, M.L., Ewert, S.D., and Dau, T. (2008). “A computational model of human auditory signal processing and perception,” J. Acoust. Soc. Am., 124, 422-438.

Jepsen, M.L., and Dau T. (2011) “Characterizing auditory processing and perception in individual listeners with sensorineural hearing loss,” J. Acoust. Soc. Am., 129, 262-281.

Jørgensen, S., and Dau, T. (2011). “Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing,” J. Acoust. Soc. Am., 130, 1475-1487.

Jørgensen, S., Ewert, S.D., and Dau, T. (2013). “A multi-resolution envelope-power based model for speech intelligibility,” J. Acoust. Soc. Am., 134, 436-446.

Kjems, U., Boldt, J.B., Pedersen, M.S., Lunner, T., and Wang, D.L. (2009).“Role of mask pattern in intelligibility of ideal binary-masked noisy speech,” J. Acoust. Soc. Am. , 126, 1415-1426.

Lopez-Poveda, E.A., and Meddis, R. (2001). “A human nonlinear cochlear filterbank,” J. Acoust. Soc. Am., 110, 3107-3118.

Nielsen, J.B., and Dau, T. (2009). “Development of a Danish speech intelligibility test,” Int. J. Audiol., 48, 729-741.

Rela˜no-Iborra, H., May, T., Zaar, J., Scheidiger, C., and Dau, T. (2016). “Predicting speech intelligibility based on a correlation metric in the envelope power spectrum domain,” J. Acoust. Soc. Am., 140, 2670-2679.

Scheidiger, C., Zaar, J., Swaminathan, J., and Dau, T. (2017). “Modeling speech intelligibility based on neural envelopes derived from auditory nerve spike trains”. Association for Research in Otolaryngology Mid-Winter Meeting, Baltimore.

Taal, C.H., Hendriks, R.C., Heusdens, R., and Jensen, J. (2011). “An algorithm for intelligibility prediction of time-frequency weighted noisy speech,” IEEE Trans. Audio Speech Lang. Process., 19, 2125-2136.

Wagener, K., Josvassen, J.L., and Ardenkjaer, R. (2003).“Design, optimization and evaluation of a Danish sentence test in noise,” Int. J. Audiol.,42, 10-17.

Zilany, M.S.A., and Bruce, I.C. (2007). “Predictions of speech intelligibility with a model of the normal and impaired auditory-periphery,” 3rd Int. IEEE/EMBS Conf. Neural Eng., 1-2, 481.

Additional Files

Published

2018-03-20

How to Cite

Relaño-Iborra, H., Zaar, J., & Dau, T. (2018). Extending a computational model of auditory processing towards speech intelligibility prediction. Proceedings of the International Symposium on Auditory and Audiological Research, 6, 319–326. Retrieved from https://proceedings.isaar.eu/index.php/isaarproc/article/view/2017-39

Issue

Section

2017/5. Speech perception: Behavioral measures and modelling