Constancy in the perception of speech when the level of room-reflections varies

Authors

  • Athony Watkins Department of Psychology, University of Reading, Reading RG6 6AL, United Kingdom
  • Simon Makin Department of Psychology, University of Reading, Reading RG6 6AL, United Kingdom
  • Andrew Raimond Department of Psychology, University of Reading, Reading RG6 6AL, United Kingdom

Abstract

A speech message played several metres from the listener in a room is usually heard to have much the same phonetic content as it does when played nearby, even though the different amounts of reflected sound make the temporal envelopes of these signals very different. To study this ‘constancy’ effect, listeners heard speech messages and speech-like sounds comprising 8 auditory-filter shaped noise-bands that had temporal envelopes corresponding to those in these lters when the speech message is played. The ‘contexts’ were “next you’ll get _to click on”, into which a “sir” or “stir” test word was inserted. These test words were from an 11-step continuum, formed by amplitude modulation. Listeners identi ed the test words appropriately, even in the 8-band conditions where the speech had a ‘robotic’ quality. Constancy was assessed by comparing the in uence of room reflections on the test word across conditions where the context had either the same level of room reflections (i.e. from the same, far distance), or where it had a much lower level (i.e. from nearby). Constancy effects were obtained with both the natural- and the 8-band speech. Results are considered in terms of the degree of ‘matching’ between the context’s and test-word’s bands.

References

Glasberg, B. R., and Moore, B. C. J. (1990). “Derivation of auditory lter shapes from notched-noise data,” Hear Res 47, 103-138.

Houtgast, T., and Steeneken, H. J. M. (1973). “The modulation transfer function in acoustics as a predictor of speech intelligibility,” Acustica 28, 66–73.

ISO 3382 (1997). “Acoustics - Measurement of the reverberation time of rooms with reference to other acoustical parameters,” International Organization for Standardization, Geneva.

Longworth-Reed, L., Brandewie, E., and Zahorik, P. (2009). “Time-forward speech intelligibility in time-reversed rooms,” J. Acoust. Soc. Am. 125, EL13-EL19.

Palmer, S. E., Brooks, J. L., and Nelson, R. (2003). “When does grouping happen?” Acta Psychologia, 114, 311-330.

Shannon, R. V., Zeng, F., Kamath, V., Wygonski, J., and Ekelid, M. (1995). “Speech recognition with primarily temporal cues,” Science, 270, 303–304.

Watkins, A. J. (1991). “Central, auditory mechanisms of perceptual compensation for spectral-envelope distortion,” J. Acoust. Soc. Am. 90, 2942–2955.

Watkins, A. J., and Makin, S. J. (1996). “Effects of spectral contrast on perceptual compensation for spectral-envelope distortion,” J. Acoust. Soc. Am. 99, 3749- 3757.

Watkins, A. J. (2005). “Perceptual compensation for effects of reverberation in speech identi cation,” J. Acoust. Soc. Am. 118, 249-262.

Watkins, A. J., and Makin, S. J. (2007a). “Perceptual compensation for reverberation in speech identi cation: Effects of single-band, multiple-band and wideband contexts,” Acta Acustica united with Acustica 93, 403-410.

Watkins, A. J., and Makin, S. J. (2007b). “Steady-spectrum contexts and perceptual compensation for reverberation in speech identi cation,” J. Acoust. Soc. Am. 121, 257-266.

Additional Files

Published

2009-12-15

How to Cite

Watkins, A., Makin, S., & Raimond, A. (2009). Constancy in the perception of speech when the level of room-reflections varies. Proceedings of the International Symposium on Auditory and Audiological Research, 2, 371–380. Retrieved from https://proceedings.isaar.eu/index.php/isaarproc/article/view/2009-38

Issue

Section

2009/3. Speech processing and perception under adverse conditions