From a public health perspective, understanding the processing of speech sounds is critical in effecting significant improvement in the lives of people with communication disorders. The neural code for speech over the range of sound and noise levels experienced daily is elusive due to strong nonlinearities of the inner ear and central auditory neurons. In collaboration with a phonetician, we are studying neural responses to acoustic parameters that are crucial to differentiating speech sounds. This proposal focuses on the neural coding of vowels in quiet and in noise. The rationale for focusing on vowels is their fundamental role in carrying information, especially in discourse, and their centrality in all know speech systems. We have developed a novel, testable hypothesis for the robust representation in the midbrain of two salient features of vowels: fundamental frequency (F0), or voice pitch, and formant frequencies, the spectral peaks that differentiate vowels. This hypothesis takes into account the facts that i) in addition to having a best frequency (BF), most midbrain neurons are tuned for periodicities in the range of voice pitch, and ii) the strength of the periodicities in te response of the periphery changes systematically depending upon the relation between BF and formant frequency. In particular, the rate fluctuations of auditory-nerve (AN) responses that are synchronized to the F0 of a vowel are weak for fibers tuned near formant frequencies and strong for fibers tuned between formants. This variation in the amplitude of low-frequency rate fluctuations across the AN is propagated to the midbrain, where neurons sensitive to modulation frequency have large rate changes depending on the relation between BF and vowel formant frequencies. The profile of rates across midbrain neurons encodes the formant frequencies of vowels and is robust across a wide range of sound levels and in the presence of noise. This code is appropriately vulnerable to changes in peripheral tuning, decreases in the strength of peripheral nonlinearities such as synchrony capture, and to changes in central inhibitory processing associated with aging. Our vowel-coding hypothesis will be tested by quantitatively relating behavioral thresholds for detection and discrimination of formants to physiological responses at the level of the midbrain. We will further develop our models for signal processing in the auditory midbrain to include a nonlinear feature of neural processing, mode-locking, that is observed in the midbrain. We hypothesize that mode-locking contributes to the representation of strongly periodic sounds, such as voiced speech, by boosting the response of neurons with band-pass modulation tuning to strongly modulated sounds. This work will lead to the development of improved signal-processing algorithms to assist the growing number of people who are afflicted with hearing loss. Because the representation proposed by our vowel-coding hypothesis is fundamentally different from classical models for neural representations of speech sounds, the signal-processing strategies to restore it will differ fundamentally from existing strategies.
The public-health significance of the proposed work is that it will improve our understanding of how a fundamental speech sound, the vowel, is coded by neurons in the auditory system. Using behavioral and physiological techniques we will test a novel hypothesis for robust neural coding of vowels by the healthy auditory system in quiet and noisy conditions, and using computational models we will then test the impact of hearing loss and aging on this neural coding. This work will lead to novel strategies that will preserve and enhance the representation of speech sounds using assistive devices such as hearing aids and auditory prostheses.
Showing the most recent 10 out of 36 publications