Perception of pathological voice quality is centrally important in clinical voice evaluation, but adequately quantifying the sound of a person's voice remains problematic. Data from studies completed during the previous funding period indicate that many difficulties associated with current measures of voice quality derive from the way in which quality is defined and measured. We propose the development of a psychoacoustic model of overall voice quality as an alternative to traditional ratings and acoustic analysis protocols. This psychoacoustic model will specify a set of perceptually-important acoustic parameters that combine to replicate and thereby quantify the overall, integral quality of a voice. We will first determine the minimal set of acoustic parameters required to produce a synthetic copy of any voice, such that listeners judge that the synthetic copy matches the quality of the original voice. This set will constitute a preliminary psychoacoustic model of voice quality. We will then refine and validate this psychoacoustic model by synthesizing copies of natural voices using only these model parameters. To the extent that listeners judge that the natural and synthetic tokens match exactly, the psychoacoustic model will be considered valid. Mismatches will be analyzed to determine what parameters should be added to or subtracted from the model. We will assess the relationship between changes in acoustic values and changes in the extent to which a voice deviates from normal. This will provide an explanatory model specifying how acoustic parameters combine and interact perceptually to determine the location of any voice sample along a continuum from "better" to "worse." Finally, we will investigate the link between perceptually-important acoustic, spectral changes and the associated alterations in glottal configuration. Such knowledge could identify targets for remediation that have the highest likelihood of producing vocal improvement during treatment.

Public Health Relevance

Measurement of voice quality is a prime concern in management of patients with voice disorders, but problems of reliability and validity persist for existing procedures for gathering such measures. The proposed research will establish a valid, reliable, theoretically-motivated alternative to current systems for measuring voice quality. By using confirmatory methods to establish causal links between acoustic variables and perceived voice quality, the proposed studies will enhance our understanding of the relationship between a voice signal and the perceptual response it evokes, leading to a standardized, perceptually-validated, objective protocol for clinical use. Further, these measures will enable us to generate and test preliminary hypotheses regarding the changes in glottal configuration that cause perceptually-important changes in vocal acoustics, thus providing some of the first experimental evidence linking perception to production. By establishing links among physiology, acoustics, and perception, this research may significantly advance clinical practice.

National Institute of Health (NIH)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Shekim, Lana O
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California Los Angeles
Schools of Medicine
Los Angeles
United States
Zip Code
Garellek, Marc; Samlan, Robin; Gerratt, Bruce R et al. (2016) Modeling the voice source in terms of spectral slopes. J Acoust Soc Am 139:1404-10
Signorello, Rosario; Zhang, Zhaoyan; Gerratt, Bruce et al. (2016) Impact of Vocal Tract Resonance on the Perception of Voice Quality Changes Caused by Varying Vocal Fold Stiffness. Acta Acust United Acust 102:209-213
Kreiman, Jody (2016) On Peer Review. J Speech Lang Hear Res 59:480-3
Kreiman, Jody; Garellek, Marc; Chen, Gang et al. (2015) Perceptual evaluation of voice source models. J Acoust Soc Am 138:1-10
Titze, Ingo R; Baken, Ronald J; Bozeman, Kenneth W et al. (2015) Toward a consensus on symbolic notation of harmonics, resonances, and formants in vocalization. J Acoust Soc Am 137:3005-7
Samlan, Robin A; Kreiman, Jody (2014) Perceptual consequences of changes in epilaryngeal area and shape. J Acoust Soc Am 136:2798-806
Chen, Gang; Kreiman, Jody; Alwan, Abeer (2014) The glottaltopogram: a method of analyzing high-speed images of the vocal folds. Comput Speech Lang 28:1156-1169
Kreiman, Jody; Gerratt, Bruce R; Garellek, Marc et al. (2014) Toward a unified theory of voice production and perception. Loquens 1:
Chen, Gang; Kreiman, Jody; Gerratt, Bruce R et al. (2013) Development of a glottal area index that integrates glottal gap size and open quotient. J Acoust Soc Am 133:1656-66
Zhang, Zhaoyan; Kreiman, Jody; Gerratt, Bruce R et al. (2013) Acoustic and perceptual effects of changes in body layer stiffness in symmetric and asymmetric vocal fold models. J Acoust Soc Am 133:453-62

Showing the most recent 10 out of 29 publications