Perception of pathological voice quality is centrally important in clinical voice evaluation, but adequately quantifying the sound of a person's voice remains problematic. Data from studies completed during the previous funding period indicate that many difficulties associated with current measures of voice quality derive from the way in which quality is defined and measured. We propose the development of a psychoacoustic model of overall voice quality as an alternative to traditional ratings and acoustic analysis protocols. This psychoacoustic model will specify a set of perceptually-important acoustic parameters that combine to replicate and thereby quantify the overall, integral quality of a voice. We will first determine the minimal set of acoustic parameters required to produce a synthetic copy of any voice, such that listeners judge that the synthetic copy matches the quality of the original voice. This set will constitute a preliminary psychoacoustic model of voice quality. We will then refine and validate this psychoacoustic model by synthesizing copies of natural voices using only these model parameters. To the extent that listeners judge that the natural and synthetic tokens match exactly, the psychoacoustic model will be considered valid. Mismatches will be analyzed to determine what parameters should be added to or subtracted from the model. We will assess the relationship between changes in acoustic values and changes in the extent to which a voice deviates from normal. This will provide an explanatory model specifying how acoustic parameters combine and interact perceptually to determine the location of any voice sample along a continuum from "better" to "worse." Finally, we will investigate the link between perceptually-important acoustic, spectral changes and the associated alterations in glottal configuration. Such knowledge could identify targets for remediation that have the highest likelihood of producing vocal improvement during treatment.

Public Health Relevance

Measurement of voice quality is a prime concern in management of patients with voice disorders, but problems of reliability and validity persist for existing procedures for gathering such measures. The proposed research will establish a valid, reliable, theoretically-motivated alternative to current systems for measuring voice quality. By using confirmatory methods to establish causal links between acoustic variables and perceived voice quality, the proposed studies will enhance our understanding of the relationship between a voice signal and the perceptual response it evokes, leading to a standardized, perceptually-validated, objective protocol for clinical use. Further, these measures will enable us to generate and test preliminary hypotheses regarding the changes in glottal configuration that cause perceptually-important changes in vocal acoustics, thus providing some of the first experimental evidence linking perception to production. By establishing links among physiology, acoustics, and perception, this research may significantly advance clinical practice.

National Institute of Health (NIH)
National Institute on Deafness and Other Communication Disorders (NIDCD)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BBBP-J (02))
Program Officer
Shekim, Lana O
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California Los Angeles
Schools of Medicine
Los Angeles
United States
Zip Code
Samlan, Robin A; Kreiman, Jody (2014) Perceptual consequences of changes in epilaryngeal area and shape. J Acoust Soc Am 136:2798-806
Chen, Gang; Kreiman, Jody; Alwan, Abeer (2014) The glottaltopogram: a method of analyzing high-speed images of the vocal folds. Comput Speech Lang 28:1156-1169
Chen, Gang; Kreiman, Jody; Gerratt, Bruce R et al. (2013) Development of a glottal area index that integrates glottal gap size and open quotient. J Acoust Soc Am 133:1656-66
Zhang, Zhaoyan; Kreiman, Jody; Gerratt, Bruce R et al. (2013) Acoustic and perceptual effects of changes in body layer stiffness in symmetric and asymmetric vocal fold models. J Acoust Soc Am 133:453-62
Garellek, Marc; Keating, Patricia; Esposito, Christina M et al. (2013) Voice quality and tone identification in White Hmong. J Acoust Soc Am 133:1078-89
Sidtis, Diana; Kreiman, Jody (2012) In the beginning was the familiar voice: personally familiar voices in the evolutionary and contemporary biology of communication. Integr Psychol Behav Sci 46:146-59
Kreiman, Jody; Gerratt, Bruce R (2012) Perceptual interaction of the harmonic source and noise in voice. J Acoust Soc Am 131:492-500
Kreiman, Jody; Gerratt, Bruce R (2011) Comparing two methods for reducing variability in voice quality measurements. J Speech Lang Hear Res 54:803-12
Kreiman, Jody; Antonanzas-Barroso, Norma; Gerratt, Bruce R (2010) Integrated software for analysis and synthesis of voice quality. Behav Res Methods 42:1030-41
Kreiman, Jody; Gerratt, Bruce R; Khan, Sameer Ud Dowla (2010) Effects of native language on perception of voice quality. J Phon 38:588-593

Showing the most recent 10 out of 22 publications