Spectral and temporal processing in the human auditory system

Authors

  • Torsten Dau Centre for Applied Hearing Research, Ørsted•DTU, Technical University of Denmark, DK-2800 Lyngby, Denmark
  • Morten L. Jepsen Centre for Applied Hearing Research, Ørsted•DTU, Technical University of Denmark, DK-2800 Lyngby, Denmark
  • Stephan D. Ewert Medical Physics, University of Oldenburg, D-26111 Oldenburg, Germany

Abstract

An auditory signal processing model is presented that simulates psychoacoustical data from a large variety of experimental conditions related to spectral and temporal masking. The model is based on the modulation lterbank model by Dau et al. [J. Acoust. Soc. Am. 102, 2892-2905 (1997)] but includes the dual-resonance non-linear (DRNL) lterbank suggested by Lopez-Poveda and Meddis [J. Acoust. Soc. Am. 110, 3107-3118 (2001)] to simulate the non-linear cochlear signal processing, as well as several other modi cations at later processing stages motivated by other recent ndings. The model was tested in conditions of tone-in-noise masking, intensity discrimination, spectral masking with tones and narrowband noises, forward masking with (on- and off-fre- quency) noise- and pure-tone maskers, and amplitude modulation detection using different noise carrier bandwidths. One of the key properties of the model is the combination of the fast-acting cochlear compression with the slower compression realized in the adaptation stage of the model. Both play a crucial role for the success of this model.

References

Breebaart, J., van de Par, S., and Kohlrausch, A. (2001a). “Binaural processing model based on contralateral inhibition. I. Model structure.,” J. Acoust. Soc. Am. 110, 1074-1088.

Chi, T., Gao, Y., Guyton, M. C., Ru, P., and Shamma, S. (1999). “Spectro-temporal modulation transfer functions and speech intelligibility,” J. Acoust. Soc. Am. 106, 2719-2732.

Carlyon, R. P., and Shamma, S. (2003). “An account of monaural phase sensitivity,” J. Acoust. Soc. Am. 114, 333-348.

de Charms, R. C., Blake, D. T., and Merzenich, M. M. (1998). “Optimizing sound features for cortical neurons,” Science 280(5368), 1439-1443.

Dau, T., Kollmeier, B., and Kohlrausch, A. (1997). “Modeling auditory processing of amplitude modulation: I. Detection and masking with narrow band carrier,” J. Acoust. Soc. Am., 102, 2892-2905.

Dau, T., Püschel, D., and Kohlrausch (1996). “A quantitative model of the effective signal processing in the auditory system. I. Model structure,” J. Acoust. Soc. Am. 99, 3615–3622.

Derleth, R. P., and Dau, T. (2000). “On the role of envelope fluctuation processing in spectral masking,” J. Acoust. Soc. Am. 108, 285–296.

Elhilali, M., Chi, T., and Shamma, S. (2003). “A spectro-temporal modulation index (STMI) for assessment of speech intelligibility,” Speech Commun. 41, 331-348.

Ewert, S. D., and Dau, T. (2000). “Characterizing frequency selectivity for envelope uctuations,” J. Acoust. Soc. Am. 108, 1181-1196.

Ewert, S. D., and Dau, T. (2004). “External and internal limitations in amplitude-mod- ulation processing,” J. Acoust. Soc. Am. 116, 478-490.

Ewert, S. D., Hau, O., and Dau, T. (2006). “Forward masking: temporal integration or adaptation?,” in Hearing – from basic research to applications., International symposium on Hearing, edited by Birger Kollmeier et al., 165-174.

Hall, J., (1997) Asymmetry of masking revisited: generalization of masker and probe bandwidth. J. Acoust. Soc. Am. 101, 1023-1033.

Hansen, M., and Kollmeier, B. (1999). “Continuous assessment of time-varying speech quality,” J. Acoust. Soc. Am. 106, 2888-2899.

Jepsen, M. L., Ewert, S. D., and Dau, T. (2008). “A computational model of human audi- tory signal processing and perception”, J. Acoust. Soc. Am. (2008). Accepted.

Kohlrausch, A., Fassel, R., and Dau, T. (2000). “The in uence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriers,” J. Acoust. Soc. Am. 108, 723-734.

Lopez-Poveda, E. and Meddis, R. (2001). “A human nonlinear cochlear lterbank,” J. Acoust. Soc. Am. 110, 3107-3118.

Meddis, R., O’Mard, L.P., and Lopez-Poveda, E.A. (2001). “A computational algorithm for computing nonlinear auditory frequency selectivity,” J. Acoust. Soc. Am. 109 (2001) 2852-2861.

Moore B. C. J. (1995). Perceptual Consequences of Cochlear Damage. Oxford University Press, New York.

Moore, B. C. J., and Alcántara, J. I. (1998): Masking patterns for sinusoidal and narrow-band noise maskers. J. Acoust. Soc. Am. 104, 1023-1038.

Muller, M., Robertson D., and Yates, G. K. (1991). “Rate-versus-level functions of pri- mary auditory nerve bres: evidence for square law behaviour of all fibre categories in the guinea pig,” Hearing Research 55, 50-56.

Nelson, D. A., and Swain, A. C. (1996). “Temporal resolution within the upper accessory excitation of a masker,” Acta Acustica 82, 328-334.

Oxenham, A. J., and Moore, B. C. J. (1994). “Modeling the additivity of nonsimultaneous masking,” Hearing Research 80, 105-118.

Oxenham, A. J., and Plack, C. J. (2000): Effects of masker frequency and duration in forward masking: further evidence for the in uence of peripheral nonlinearity. Hearing Research 150, 258-266.

Oxenham, A. J. (2001). “Forward masking: Adaptation or integration?,” J. Acoust. Soc. Am. 109, 732-741.

Piechowiak, T., Ewert, S. D., and Dau T. (2007). “Modeling comodulation masking release using an equalization-cancellation mechanism,” J. Acoust. Soc. Am. 121, 2111-2126.

Ru, P., and Shamma, S. A. (1997). “Representation of musical timbre in the auditory cortex,” J. of New Music Res. 26, 154-169.

Ruggero, M. A., Rich, N. C., Recio, A., Narayan, S. S., and Robles, L. (1997). “Basilar-membrane responses to tones at the base of the chinchilla cochlea,” J. Acoust. Soc. Am. 101, 2151-2163.

Schreiner, C. E., and Calhoun, B. (1995). “Spectral envelope coding in cat primary auditory cortex: Properties of ripple transfer functions,” J. Auditory Neuroscience 1, 39-61.

Thompson, E., and Dau, T. (2008). “Frequency selectivity in binaural processing of fluctuations in interaural level difference,” J. Acoust. Soc. Am. 123, 1017-1029.

Verhey, J. L., Dau, T., and Kollmeier, B. (1999). “Within-channel cues in comodula- tion masking release (CMR): Experiments and model predictions using a modulation-filterbank model,” J. Acoust. Soc. Am. 106, 2733-2745.

Additional Files

Published

2007-12-15

How to Cite

Dau, T., Jepsen, M. L., & Ewert, S. D. (2007). Spectral and temporal processing in the human auditory system. Proceedings of the International Symposium on Auditory and Audiological Research, 1, 21–32. Retrieved from https://proceedings.isaar.eu/index.php/isaarproc/article/view/2007-03

Issue

Section

2007/1. Auditory signal processing and perception