Measurement of voice quality is important for making several decisions regarding the management of patients with voice disorders. It has been shown that subjective ratings of voice quality made in a routine clinical set-up are highly inconsistent, and may not be valid measures of voice quality. An alternative approach to measure voice quality is to develop acoustic measures that closely correspond with listeners' perception of voice quality. However, none of the measures developed thus far show a high and consistent correlation with listener's perception of the vocal quality under study. The poor and inconsistent correlation between acoustic measures and perceptual ratings of voice quality may arise from the non-linear and multidimensional relationship between the acoustic signal and its perceptual correlate. Non-linear relationships between a physical stimulus and its psychophysical correlate are seen in all human sensory systems. For example, in the auditory domain the relation between intensity and loudness is better explained by the power law and that between frequency and pitch is better mapped using the Mel or the Bark scale. Recent advances in psychoacoustics and auditory physiology has led to the development of several auditory models which are capable of accounting for such non-linear phenomena. Because these non-linear processes are inherent to all auditory perceptual tasks, these also affect how a specific vocal acoustic signal affects the perception of voice quality. This research will investigate the use of an auditory processing model as a signal processing front end to study voice quality. It is hypothesized that the output of such an auditory model will provide objective measures of voice quality that show a high and consistent correlation with perceptual ratings of voice quality. The proposed research seeks to (1) study the effects of specific changes in vocal acoustic signal on the perception of breathy voice quality, (2) use an auditory processing model to understand what changes in the auditory spectrum lead to the changes in perceived voice quality, and (3) develop objective measures to explain perceptual data in synthetic and natural vowels. ? ?

Agency
National Institute of Health (NIH)
Institute
National Institute on Deafness and Other Communication Disorders (NIDCD)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21DC006690-01
Application #
6767291
Study Section
Motor Function, Speech and Rehabilitation Study Section (MFSR)
Program Officer
Shekim, Lana O
Project Start
2004-05-01
Project End
2006-04-30
Budget Start
2004-05-01
Budget End
2005-04-30
Support Year
1
Fiscal Year
2004
Total Cost
$175,867
Indirect Cost
Name
University of Florida
Department
Other Health Professions
Type
Schools of Arts and Sciences
DUNS #
969663814
City
Gainesville
State
FL
Country
United States
Zip Code
32611
Shrivastav, Rahul; Camacho, Arturo; Patel, Sona et al. (2011) A model for the prediction of breathiness in vowels. J Acoust Soc Am 129:1605-15
Patel, Sona; Shrivastav, Rahul; Eddins, David A (2010) Perceptual distances of breathy voice quality: a comparison of psychophysical methods. J Voice 24:168-77
Shrivastav, Rahul; Camacho, Arturo (2010) A computational model to predict changes in breathiness resulting from variations in aspiration noise level. J Voice 24:395-405