From a public health perspective, understanding the processing of speech sounds is critical in effecting significant improvement in the lives of people with communication disorders. The neural code for speech over the range of sound and noise levels experienced daily is elusive due to strong nonlinearities of the inner ear and central auditory neurons. In collaboration with a phonetician, we are studying neural responses to acoustic parameters that are crucial to differentiating speech sounds. This proposal focuses on the neural coding of vowels in quiet and in noise. The rationale for focusing on vowels is their fundamental role in carrying information, especially in discourse, and their centrality in all know speech systems. We have developed a novel, testable hypothesis for the robust representation in the midbrain of two salient features of vowels: fundamental frequency (F0), or voice pitch, and formant frequencies, the spectral peaks that differentiate vowels. This hypothesis takes into account the facts that i) in addition to having a best frequency (BF), most midbrain neurons are tuned for periodicities in the range of voice pitch, and ii) the strength of the periodicities in te response of the periphery changes systematically depending upon the relation between BF and formant frequency. In particular, the rate fluctuations of auditory-nerve (AN) responses that are synchronized to the F0 of a vowel are weak for fibers tuned near formant frequencies and strong for fibers tuned between formants. This variation in the amplitude of low-frequency rate fluctuations across the AN is propagated to the midbrain, where neurons sensitive to modulation frequency have large rate changes depending on the relation between BF and vowel formant frequencies. The profile of rates across midbrain neurons encodes the formant frequencies of vowels and is robust across a wide range of sound levels and in the presence of noise. This code is appropriately vulnerable to changes in peripheral tuning, decreases in the strength of peripheral nonlinearities such as synchrony capture, and to changes in central inhibitory processing associated with aging. Our vowel-coding hypothesis will be tested by quantitatively relating behavioral thresholds for detection and discrimination of formants to physiological responses at the level of the midbrain. We will further develop our models for signal processing in the auditory midbrain to include a nonlinear feature of neural processing, mode-locking, that is observed in the midbrain. We hypothesize that mode-locking contributes to the representation of strongly periodic sounds, such as voiced speech, by boosting the response of neurons with band-pass modulation tuning to strongly modulated sounds. This work will lead to the development of improved signal-processing algorithms to assist the growing number of people who are afflicted with hearing loss. Because the representation proposed by our vowel-coding hypothesis is fundamentally different from classical models for neural representations of speech sounds, the signal-processing strategies to restore it will differ fundamentally from existing strategies.

Public Health Relevance

The public-health significance of the proposed work is that it will improve our understanding of how a fundamental speech sound, the vowel, is coded by neurons in the auditory system. Using behavioral and physiological techniques we will test a novel hypothesis for robust neural coding of vowels by the healthy auditory system in quiet and noisy conditions, and using computational models we will then test the impact of hearing loss and aging on this neural coding. This work will lead to novel strategies that will preserve and enhance the representation of speech sounds using assistive devices such as hearing aids and auditory prostheses.

National Institute of Health (NIH)
National Institute on Deafness and Other Communication Disorders (NIDCD)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-IFCN-B (03))
Program Officer
Miller, Roger
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Rochester
Biomedical Engineering
Schools of Engineering
United States
Zip Code
Carney, Laurel H; Zilany, Muhammad S A; Huang, Nicholas J et al. (2014) Suboptimal use of neural information in a mammalian auditory system. J Neurosci 34:1306-13
Carney, Laurel H; Sarkar, Srijata; Abrams, Kristina S et al. (2011) Sound-localization ability of the Mongolian gerbil (Meriones unguiculatus) in a task with a simplified response map. Hear Res 275:89-95
Wojtczak, Magdalena; Nelson, Paul C; Viemeister, Neal F et al. (2011) Forward masking in the amplitude-modulation domain for tone carriers: psychophysical results and physiological correlates. J Assoc Res Otolaryngol 12:361-73
Zilany, Muhammad S A; Carney, Laurel H (2010) Power-law dynamics in an auditory-nerve model can account for neural adaptation to sound-level statistics. J Neurosci 30:10380-90
Gai, Yan; Carney, Laurel H (2008) Statistical analyses of temporal information in auditory brainstem responses to tones in noise: correlation index and spike-distance metric. J Assoc Res Otolaryngol 9:373-87
Gai, Yan; Carney, Laurel H (2008) Influence of inhibitory inputs on rate and timing of responses in the anteroventral cochlear nucleus. J Neurophysiol 99:1077-95
Deshmukh, Om D; Espy-Wilson, Carol Y; Carney, Laurel H (2007) Speech enhancement using the modified phase-opponency model. J Acoust Soc Am 121:3886-98
Nelson, Paul C; Ewert, Stephan D; Carney, Laurel H et al. (2007) Comparison of level discrimination, increment detection, and comodulation masking release in the audio- and envelope-frequency domains. J Acoust Soc Am 121:2168-81
Calandruccio, Lauren; Doherty, Karen A; Carney, Laurel H et al. (2007) Perception of temporally processed speech by listeners with hearing impairment. Ear Hear 28:512-23
Nelson, Paul C; Carney, Laurel H (2007) Neural rate and timing cues for detection and discrimination of amplitude-modulated tones in the awake rabbit inferior colliculus. J Neurophysiol 97:522-39

Showing the most recent 10 out of 23 publications