The overall goal of the proposed research is to investigate the internal spectral temporal space within which speech perception takes place. A guiding premise of the proposed work is the assumption that many of the most important phenomena of speech perception can be explained in terms of general auditory processes. This in mind, experiments are designed to exploit mechanisms listeners use to maintain continuity of speech and other complex sounds under normal environmental conditions which frequently are not optimal. Many studies have demonstrated that listeners are most likely to maintain continuity of a sound source that does not undergo extreme spectral transformation over too short a period of time, but little is known about what sorts of transformation are tolerable. Given that transformations that traverse relatively little space are more likely to be perceived as continuous, the proposed research is designed to map out a spectral temporal space and to describe the and model principles governing perceptual continuity of speech. Along the way, a number of fundamental questions will be addressed concerning the structure of phonetic inventories. The Principle Investigator is co-author of a somewhat controversial theoretical position that holds that many of the most widespread phonetic regularities can be explained by efforts of language communities to arrange their phonetic inventories in a way that exploits auditory predispositions of listeners. This """"""""auditory enhancement hypothesis"""""""" is put to a number of critical tests in attempts to explain near-universal tendencies to use certain types of vowels. It is argued that, for example, general tendencies of languages to have higher pitch for high vowels, and more nasalization of low vowels can be explained in terms of making these vowels auditorily maximally distinctive. By exploiting mechanisms that aid the listener in maintaining perceptual continuity, one can empirically assess the perceptual distance between vowel sounds that are in accordance with these phonetic regularities and vowel sounds that are not. Finally, an extensive modeling effort is described that investigates whether an unsupervised neural network model can be developed that can account for the behavior of human listeners in the proposed experiments.

Agency
National Institute of Health (NIH)
Institute
National Institute on Deafness and Other Communication Disorders (NIDCD)
Type
First Independent Research Support & Transition (FIRST) Awards (R29)
Project #
5R29DC000719-04
Application #
3461701
Study Section
Hearing Research Study Section (HAR)
Project Start
1989-12-01
Project End
1994-11-30
Budget Start
1992-12-01
Budget End
1993-11-30
Support Year
4
Fiscal Year
1993
Total Cost
Indirect Cost
Name
University of Wisconsin Madison
Department
Type
Schools of Arts and Sciences
DUNS #
161202122
City
Madison
State
WI
Country
United States
Zip Code
53715
Kluender, K R; Lotto, A J; Holt, L L et al. (1998) Role of experience for language-specific functional mappings of vowel sounds. J Acoust Soc Am 104:3568-82
Lotto, A J; Kluender, K R (1998) General contrast effects in speech perception: effect of preceding liquid on stop consonant identification. Percept Psychophys 60:602-19
Lotto, A J; Holt, L L; Kluender, K R (1997) Effect of voice quality on perceived height of English vowels. Phonetica 54:76-93
Lotto, A J; Kluender, K R; Holt, L L (1997) Perceptual compensation for coarticulation by Japanese quail (Coturnix coturnix japonica). J Acoust Soc Am 102:1134-40
Lotto, A J; Kluender, K R; Green, K P (1996) Spectral discontinuities and the vowel length effect. Percept Psychophys 58:1005-14
Kluender, K R; Lotto, A J; Jenison, R L (1995) Perception of voicing for syllable-initial stops at different intensities: does synchrony capture signal voiceless stop consonants? J Acoust Soc Am 97:2552-67
Kluender, K R; Lotto, A J (1994) Effects of first formant onset frequency on [-voice] judgments result from auditory processes not specific to humans. J Acoust Soc Am 95:1044-52
Kluender, K R; Jenison, R L (1992) Effects of glide slope, noise intensity, and noise duration on the extrapolation of FM glides through noise. Percept Psychophys 51:231-8
Van Veen, B D; Jenison, R L (1991) Auditory space expansion via linear filtering. J Acoust Soc Am 90:231-40
Jenison, R L; Greenberg, S; Kluender, K R et al. (1991) A composite model of the auditory periphery for the processing of speech based on the filter response functions of single auditory-nerve fibers. J Acoust Soc Am 90:773-86