An ultimate goal of speech research is to develop a common theoretical framework that will link together physiology, acoustics, and speech perception. Such a unified approach is essential in explaining how tissue movement finally results in the perception of speech sounds. The proposed research evaluates a major assumption underlying the linear source-filter theory of speech production: that the glottal source and vocal tract are independent, and thus can be effectively separated using linear techniques. Because this same assumption underlies conventional inverse filtering (which is based on linear source-filter theory), results will also serve to validate this technique for estimating the glottal volume velocity. Estimates of the voice source from inverse filtering will be compared to the volume velocity calculated in a more direct but invasive manner. Because the volume velocity cannot ethically be measured directly in living humans, direct estimates will be generated in excised human laryngeal and in vivo canine models, by using anemometry to measure the exit jet particle velocity at the glottis and then multiplying this result by glottal area measures derived from high-speed imaging. Studies will examine a great range of normal and pathological phonatory configurations, and comparisons to inverse filtered data will lead to modifications in techniques to improve filter validity when necessary. By comparing volume velocity estimates from laryngeal modeling to those indirectly derived from inverse filtering the acoustic signal, the validity of inverse filtering for estimating the volume velocity will be established. Such studies will also illuminate the relationship between glottal dynamics and vocal acoustics, and thus will ultimately contribute to an enhanced basic understanding of the biomechanics of voice production and the practical limits of linear source filter theory in general. The primary hypothesis of this research is that the assumptions of linear source-filter theory hold for specific, limited regions of phonation. The goal of this research is to determine the regions over which these assumptions do not hold, and to quantify the interactions between source and filter across a wide variety of vocal configurations.

Agency
National Institute of Health (NIH)
Institute
National Institute on Deafness and Other Communication Disorders (NIDCD)
Type
Research Project (R01)
Project #
5R01DC004688-03
Application #
6761020
Study Section
Special Emphasis Panel (ZRG1-BBBP-7 (01))
Program Officer
Shekim, Lana O
Project Start
2002-07-01
Project End
2007-06-30
Budget Start
2004-07-01
Budget End
2005-06-30
Support Year
3
Fiscal Year
2004
Total Cost
$339,905
Indirect Cost
Name
University of California Los Angeles
Department
Surgery
Type
Schools of Medicine
DUNS #
092530369
City
Los Angeles
State
CA
Country
United States
Zip Code
90095
Chhetri, Dinesh K; Berke, Gerald S; Lotfizadeh, Ali et al. (2009) Control of vocal fold cover stiffness by laryngeal muscles: a preliminary study. Laryngoscope 119:222-7
Zhang, Zhaoyan; Neubauer, Juergen; Berry, David A (2007) Physical mechanisms of phonation onset: a linear stability analysis of an aeroelastic continuum model of phonation. J Acoust Soc Am 122:2279-95
Berke, Gerald S; Neubauer, Juergen; Berry, David A et al. (2007) Ex vivo perfused larynx model of phonation: preliminary study. Ann Otol Rhinol Laryngol 116:866-70
Neubauer, Jurgen; Zhang, Zhaoyan; Miraghaie, Reza et al. (2007) Coherent structures of the near field flow in a self-oscillating physical model of the vocal folds. J Acoust Soc Am 121:1102-18
Zhang, Zhaoyan; Neubauer, Juergen; Berry, David A (2006) The influence of subglottal acoustics on laboratory models of phonation. J Acoust Soc Am 120:1558-69