Phoneme representation in primary auditory cortex

Authors

  • Shihab Shamma Institute for Systems Research, Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA
  • Nima Mesgarani Institute for Systems Research, Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA
  • Stephen David Institute for Systems Research, Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA
  • Jonathan Fritz Institute for Systems Research, Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA

Abstract

We examined the responses of neurons in primary auditory cortex (A1) to phonetically labeled speech stimuli. Sentences were taken from the TIMIT database and chosen to represent a diversity of male and female speakers. We presented these stimuli to awake ferrets while recording the activity of isolated A1 neurons. For analysis, we segmented the continuous speech samples into sequences of phonemes, which represent the smallest signi cant units of speech. We characterized the response properties of each neuron as the peristimulus time histogram (PSTH) response to each phoneme. Across a population of A1 neurons, we observed distinct patterns of phoneme selectivity that may provide a neural basis or low-level phoneme discrimination. We investigated how features of speech are encoded in A1 using a method for reconstructing the speech stimulus from the neural population responses. Stimuli were reconstructed using a linear spectro-temporal model to map the response to the stimulus spectrogram. We compared the accuracy of reconstruction across phonemes. One important factor involved in stimulus reconstruction is the presence of correlations in complex natural stimulus such as speech. Prior knowledge of regularities in the stimulus can bene t reconstruction in noise and when spectro-temporal coverage is limited. We studied the influence of prior knowledge of stimulus correlations, noise and spectro-temporal coverage on reconstruction accuracy in neural data and in simulation.

References

Walker, K., King, A., Ahmed, B., and Schnupp, J. W. H. (2006). “Psychometric and neurometric discrimination of non-conspecific vocalizations,” Abstract 430, Mid- Winter Meeting of Association for Research in Otolaryngology, Baltimore.

Miller, G., and Nicely, P. (1955). An analysis of perceptual confusions among some English consonants,” J. Acoust. Soc. Am., vol. 27, 338-352.

Theunissen, F.E., David, S.V., Singh, N.C., Hsu A., Vinje, W.E., and Gallant, J.L. (2001). “Estimating spatio-temporal receptive fields of auditory and visual neurons from their responses to natural stimuli,” 1: Network. 12(3): 289-316.

Klein, D.J., Simon, J. Z., Depireux, D. A., and Shamma, S. A. (2006). “Stimulus-invariant processing and spectrotemporal reverse correlation in primary auditory cortex,” J Comput Neurosci., 20(2): 111-36.

Seneft. S., and Zue, V. (1988). “Transcription and alignment of the timit database”, J. S. Garofolo,” Ed. National Institute of Standards and Technology (NIST), Gaithersburgh, MD.

Simon, J. Z., Depireux, D. A., Klein, D. J., Fritz, J. B., and Shamma, S.A. (2007), “Temporal Symmetry in Primary Auditory Cortex: Implications for Cortical Connectivity, Neural Computation,” 19, 583-638.

Ladefoged, P., A. (2006). “Course in phonetics. Orlando: Harcourt Brace,” 5th ed. Boston: Thomson/Wadsworth.

Chistovich, L. A., and Lublinskaya, V. V. (1979). “The center of gravity effect in vowel spectra and critical distance between the formants: Psychoacoustical study of the perception of vowel-like stimuli,” Hear. Res., 1 185-195.

Bendor, D., and Wang, X. (2005). “The neuronal representation of pitch in primate auditory cortex,” Nature 436, 1161–1165.

Vapnik, V. N. (1995). The Nature of Statistical Learning Theory, Springer. Theunissen, F. E. (1993). “An Investigation of Sensory Coding Principles Using Advanced Statistical Techniques,” Thesis, Univ. California, Berkeley.

Additional Files

Published

2007-12-15

How to Cite

Shamma, S., Mesgarani, N., David, S., & Fritz, J. (2007). Phoneme representation in primary auditory cortex. Proceedings of the International Symposium on Auditory and Audiological Research, 1, 187–200. Retrieved from https://proceedings.isaar.eu/index.php/isaarproc/article/view/2007-18

Issue

Section

2007/2. Physiological correlates of auditory functions