Spectral Temporal Factors in Perception of Fluent Speech

Kluender, Keith

Abstract

The overall goal of the proposed research is to investigate the internal spectral temporal space within which speech perception takes place. A guiding premise of the proposed work is the assumption that many of the most important phenomena of speech perception can be explained in terms of general auditory processes. This in mind, experiments are designed to exploit mechanisms listeners use to maintain continuity of speech and other complex sounds under normal environmental conditions which frequently are not optimal. Many studies have demonstrated that listeners are most likely to maintain continuity of a sound source that does not undergo extreme spectral transformation over too short a period of time, but little is known about what sorts of transformation are tolerable. Given that transformations that traverse relatively little space are more likely to be perceived as continuous, the proposed research is designed to map out a spectral temporal space and to describe the and model principles governing perceptual continuity of speech. Along the way, a number of fundamental questions will be addressed concerning the structure of phonetic inventories. The Principle Investigator is co-author of a somewhat controversial theoretical position that holds that many of the most widespread phonetic regularities can be explained by efforts of language communities to arrange their phonetic inventories in a way that exploits auditory predispositions of listeners. This """"""""auditory enhancement hypothesis"""""""" is put to a number of critical tests in attempts to explain near-universal tendencies to use certain types of vowels. It is argued that, for example, general tendencies of languages to have higher pitch for high vowels, and more nasalization of low vowels can be explained in terms of making these vowels auditorily maximally distinctive. By exploiting mechanisms that aid the listener in maintaining perceptual continuity, one can empirically assess the perceptual distance between vowel sounds that are in accordance with these phonetic regularities and vowel sounds that are not. Finally, an extensive modeling effort is described that investigates whether an unsupervised neural network model can be developed that can account for the behavior of human listeners in the proposed experiments.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Deafness and Other Communication Disorders (NIDCD)
Type: First Independent Research Support & Transition (FIRST) Awards (R29)
Project #: 5R29DC000719-02
Application #: 3461699
Study Section: Hearing Research Study Section (HAR)

Project Start: 1989-12-01
Project End: 1994-11-30
Budget Start: 1990-12-01
Budget End: 1991-11-30
Support Year: 2
Fiscal Year: 1991
Total Cost
Indirect Cost

Institution

Name: University of Wisconsin Madison
Department
Type: Schools of Arts and Sciences
DUNS #: 161202122

City: Madison
State: WI
Country: United States
Zip Code: 53715

Related projects


NIH 1994 R29 DC	Spectral-Temporal Factors in Perception of Fluent Speech Kluender, Keith R. / University of Wisconsin Madison
NIH 1993 R29 DC	Spectral Temporal Factors in Perception of Fluent Speech Kluender, Keith R. / University of Wisconsin Madison
NIH 1992 R29 DC	Spectral Temporal Factors in Perception of Fluent Speech Kluender, Keith R. / University of Wisconsin Madison
NIH 1991 R29 DC	Spectral Temporal Factors in Perception of Fluent Speech Kluender, Keith R. / University of Wisconsin Madison
NIH 1990 R29 DC	Spectral Temporal Factors in Perception of Fluent Speech Kluender, Keith R. / University of Wisconsin Madison

Publications

Kluender, K R; Lotto, A J; Holt, L L et al. (1998) Role of experience for language-specific functional mappings of vowel sounds. J Acoust Soc Am 104:3568-82

Lotto, A J; Kluender, K R (1998) General contrast effects in speech perception: effect of preceding liquid on stop consonant identification. Percept Psychophys 60:602-19

Lotto, A J; Holt, L L; Kluender, K R (1997) Effect of voice quality on perceived height of English vowels. Phonetica 54:76-93

Lotto, A J; Kluender, K R; Holt, L L (1997) Perceptual compensation for coarticulation by Japanese quail (Coturnix coturnix japonica). J Acoust Soc Am 102:1134-40

Lotto, A J; Kluender, K R; Green, K P (1996) Spectral discontinuities and the vowel length effect. Percept Psychophys 58:1005-14

Kluender, K R; Lotto, A J; Jenison, R L (1995) Perception of voicing for syllable-initial stops at different intensities: does synchrony capture signal voiceless stop consonants? J Acoust Soc Am 97:2552-67

Kluender, K R; Lotto, A J (1994) Effects of first formant onset frequency on [-voice] judgments result from auditory processes not specific to humans. J Acoust Soc Am 95:1044-52

Kluender, K R; Jenison, R L (1992) Effects of glide slope, noise intensity, and noise duration on the extrapolation of FM glides through noise. Percept Psychophys 51:231-8

Van Veen, B D; Jenison, R L (1991) Auditory space expansion via linear filtering. J Acoust Soc Am 90:231-40

Jenison, R L; Greenberg, S; Kluender, K R et al. (1991) A composite model of the auditory periphery for the processing of speech based on the filter response functions of single auditory-nerve fibers. J Acoust Soc Am 90:773-86

Comments

Be the first to comment on Keith Kluender's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: