Judgments of voice quality contribute greatly to patients' opinions of their voice, and also to a clinician's decision to initiate and continue treatment. Quality judgments are further used as a standard against which instrumental measures of voice are validated. Nevertheless, these """"""""subjective"""""""" measures of voice quality are not highly regarded as either clinical or research tools, because of problems with reliability and untested validity. We hypothesize that problems in voice quality measurement do not originate within the listener, but rather derive from the methods used to measure what listeners hear. The proposed research departs from these traditional rating scale methods by applying a speech synthesizer for pathological voice quality to study fundamental issues concerning reliability and validity of measures of voice quality. In this approach, listeners an asked to adjust synthesizer parameters to achieve a perceptual match to the original voice -- to """"""""rate"""""""" a voice by matching it. This method explicitly links the acoustic signal to perceived voice quality. The investigators hypothesize that this linkage across the speech chain increases the validity, reliability, and utility of both acoustic and perceptual representations, by providing listeners with an objective tool (a synthesizer) for quantifying what they hear. The proposed research focuses on expanding and refining our synthesizer for pathological voice quality, compares the validity and reliability of synthesis techniques to traditional rating scale techniques for evaluating voice quality, applies the synthesizer to examine the perceptual importance of a periodicity, noise and the shape of the voicing source pulse, and quantifies the different sources of error in voice quality measurement. The investigators hypothesize that the protocols described here will greatly increase both the reliability and validity of voice quality measurement, bringing us much closer to our goal to develop a standardized voice evaluation protocol.

Agency
National Institute of Health (NIH)
Institute
National Institute on Deafness and Other Communication Disorders (NIDCD)
Type
Research Project (R01)
Project #
2R01DC001797-09
Application #
6285740
Study Section
Special Emphasis Panel (ZRG1-BBBP-7 (01))
Program Officer
Shekim, Lana O
Project Start
1992-12-01
Project End
2005-11-30
Budget Start
2001-01-17
Budget End
2001-11-30
Support Year
9
Fiscal Year
2001
Total Cost
$439,527
Indirect Cost
Name
University of California Los Angeles
Department
Surgery
Type
Schools of Medicine
DUNS #
119132785
City
Los Angeles
State
CA
Country
United States
Zip Code
90095
Zhang, Zhaoyan (2018) Vocal instabilities in a three-dimensional body-cover phonation model. J Acoust Soc Am 144:1216
Park, Soo Jin; Yeung, Gary; Vesselinova, Neda et al. (2018) Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles. J Acoust Soc Am 144:375
Wu, Liang; Zhang, Zhaoyan (2017) A Computational Study of Vocal Fold Dehydration During Phonation. IEEE Trans Biomed Eng 64:2938-2948
Zhang, Zhaoyan (2017) Effect of vocal fold stiffness on voice production in a three-dimensional body-cover phonation model. J Acoust Soc Am 142:2311
Gerratt, Bruce R; Kreiman, Jody; Garellek, Marc (2016) Comparing Measures of Voice Quality From Sustained Phonation and Continuous Speech. J Speech Lang Hear Res 59:994-1001
Signorello, Rosario; Zhang, Zhaoyan; Gerratt, Bruce et al. (2016) Impact of Vocal Tract Resonance on the Perception of Voice Quality Changes Caused by Varying Vocal Fold Stiffness. Acta Acust United Acust 102:209-213
Kreiman, Jody (2016) On Peer Review. J Speech Lang Hear Res 59:480-3
Garellek, Marc; Samlan, Robin; Gerratt, Bruce R et al. (2016) Modeling the voice source in terms of spectral slopes. J Acoust Soc Am 139:1404-10
Titze, Ingo R; Baken, Ronald J; Bozeman, Kenneth W et al. (2015) Toward a consensus on symbolic notation of harmonics, resonances, and formants in vocalization. J Acoust Soc Am 137:3005-7
Kreiman, Jody; Garellek, Marc; Chen, Gang et al. (2015) Perceptual evaluation of voice source models. J Acoust Soc Am 138:1-10

Showing the most recent 10 out of 35 publications