Report on a binaural extension of the Speech Transmission Index method for nonlinear systems and narrowband interference

Authors

  • Anton Schlesinger Acoustical Imaging and Sound Control, Delft University of Technology, 2600 GA Delft, The Netherlands
  • Juan-Pablo Ramirez Quality and Usability Lab., Deutsche Telekom Laboratories, Berlin Institute of Technology, D-10587 Berlin, Germany
  • Jasper van Dorph Schuitman Acoustical Imaging and Sound Control, Delft University of Technology, 2600 GA Delft, The Netherlands
  • Marinus M. Boone Acoustical Imaging and Sound Control, Delft University of Technology, 2600 GA Delft, The Netherlands

Abstract

A speech-based and binaural version of the Speech Transmission Index (STI) is presented. An ef cient envelope regression method of the STI (Goldsworthy and Greenberg, 2004) provides the basis for the proposed method and offers the estimation of nonlinear distortion on speech intelligibility. The speech-based STI method is expanded by an auditory lterbank for the peripheral processing and a binaural processing stage. By this means, the in uence of narrow- to broadband interferences and the binaural advantage on speech intelligibility in normal hearing and hearing impaired people can be predicted. The method has been primarily developed to assess nonlinear and binaural processors of speech enhancement, e.g. algorithms of the computational auditory scene analysis, but is generally quali ed for other applications of binaural listening. To illustrate the binaural advantage in speech intelligibility, the binaural STI was contrasted with the monaural STI in room acoustical simulation. The performance of the proposed STI method was evaluated against a subjective test. Initial results show an appropriate operation of the proposed STI method. However, the binaural bene t is undervalued and nonlinearly processed speech is overestimated by the proposed STI with respect to subjective perception. For these conditions, further improvement of the objective measure is required.

References

Beutelmann, R., and Brand, T. (2006). “Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners,” J. Acoust. Soc. Am. 120, 331-342.

Blauert, J. (1996). Spatial hearing (MIT Publication).

Bronkhorst, A. W. (2000). “The cocktail party phenomenon: a review of research on speech intelligibility in multiple-talker conditions,” Acustica 86, 117-128.

Elhilali, M., Chi, T., and Shamma, S. A. (2003). “A spectro-temporal modulation index (STMI) for assessment of speech intelligibility,” Speech Communication 41, 331-348.

Durlach, N. I. (1972). Binaural Signal Detection: Equalization and cancellation theory of binaural masking-level differences (Academic New York), Vol. II, pp. 371-462.

Goldsworthy, R. L., and Greenberg, J. E. (2004). “Analysis of speech-based speech transmission index methods with implications for nonlinear operations,” J. Acoust. Soc. Am. 116, 3679-3689.

Holube, I., and Kollmeier, B. (1996). “Speech intelligibility prediction in hearing-impaired listeners based on a psychoacoustically motivated perception model,” J. Acoust. Soc. Am. 100, 1703-1716.

Houtgast, T., and Steeneken, H. J. M. (1973). “The modulation transfer function in room acoustics as a predictor of speech intelligibility,” Acustica 28, 66-73.

Jefress, L. A. (1948). “A place theory of sound localization,” J. Comp. Physiol. Psychol. 41, 35-39.

Pavlovic, C. V. (1987). “Derivation of primary parameters and procedures for use in speech intelligibility predictions,” J. Acoust. Soc. Am. 82, 413-422.

Payton, K., and Shrestha, M. (2008). “Analysis of short-time speech transmission index algorithms,” Acoustics 08 Proceedings, Paris, France, 633-638.

Ramirez, J. P., Raake, A., and Reusch, D. (2009a). “Intelligibility assessment method for semantically unpredictable sentences in german,” in ISAAR 2009: Binaural Processing and Spatial Hearing J. Buchholz, J. C. Dalsgaard, T. Dau, and T. Poulsen (The Danavox Jubilee Foundation).

Ramirez, J. P., Raake, A., and Reusch, D. (2009b). “Intelligibility assessment method for semantically unpredictable sentences in german” NAG-DAGA Proceedings, Rotterdam.

Steeneken, H. J. M., and Houtgast, T. (2002). “Basics of the STI measuring method,” in Past, present and future of the Speech Transmission Index, edited by Sander J. van Wijngaarden, (TNO Human Factors).

van Wijngaarden, S. J., and Drullman, R. (2008). “Binaural intelligibility prediction based on the speech transmission index,” J. Acoust. Soc. Am. 123, 4514-4523.

Wang, L. (2006) Computational Auditory Scene Analysis (John Wiley and Sohns, Inc. Publication).

Additional Files

Published

2009-12-15

How to Cite

Schlesinger, A., Ramirez, J.-P., Schuitman, J. van D., & Boone, M. M. (2009). Report on a binaural extension of the Speech Transmission Index method for nonlinear systems and narrowband interference. Proceedings of the International Symposium on Auditory and Audiological Research, 2, 353–362. Retrieved from https://proceedings.isaar.eu/index.php/isaarproc/article/view/2009-36

Issue

Section

2009/3. Speech processing and perception under adverse conditions