For years the major focus in speech acoustics has been on the frequency range below approximately 5 kHz. Human speech and the human voice generate acoustical energy up to 20 kHz. Evidence is accruing that high-frequency energy (energy above 5 kHz) in speech and voice contributes to percepts of quality, localization, and intelligibility. The proposed research is intended to be an initial step in the long-range goal of characterizing high-frequency energy in speech, with particular regard for its perceptual role, its potential for modification during speech production, and its generation mechanism. In this study, a database of high-fidelity recordings of talkers will be used for broad acoustical analysis and general characterization of high-frequency energy, as well as specific characterization of phoneme category, speech intensity level, and mode of production by their high-frequency energy content. Directionality of radiation of high-frequency energy from the mouth will also be examined. The recordings will be used for perceptual experiments wherein listeners will be asked to discriminate between speech and voice samples that differ only in high-frequency energy content. Listeners will also be subjected to intelligibility-in- noise tasks with samples that have been modified only in high-frequency content. The combination of these experiments will reveal (1) the ability of human listeners to detect high-frequency energy modification, and (2) the phonetic value of high-frequency energy in speech.

Public Health Relevance

The relevance of this project to public health lies in its efforts to elucidate the effect on human communicative behavior when high-frequency energy in speech is lost or altered, which may be incurred by factors such as hearing loss, noisy environmental conditions, telephony, audio data compression (such as mp3 compression), electronic sound reinforcement, or sound recording and playback. Previous research has already shown that high-frequency energy affects speech intelligibility, word-learning in normal-hearing and hearing-impaired children, speech localization, and qualitative percepts of speech and voice (e.g. 'naturalness'). Thus, this project will provide particularly valuable insight regarding the need for representation of the high-frequency range in augmentative hearing devices, including hearing aids, cochlear implants, and auditory brainstem implants;the results of this project may also impact the evaluation and management of speech, voice, and language disorders, as well as the development of training techniques for the enhancement of speech and voice.

National Institute of Health (NIH)
National Institute on Deafness and Other Communication Disorders (NIDCD)
Predoctoral Individual National Research Service Award (F31)
Project #
Application #
Study Section
Communication Disorders Review Committee (CDRC)
Program Officer
Cyr, Janet
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Arizona
Schools of Arts and Sciences
United States
Zip Code
Bakst, Sarah; Johnson, Keith (2018) Modeling the effect of palate shape on the articulatory-acoustics mapping. J Acoust Soc Am 144:EL71
Vitela, A Davi; Monson, Brian B; Lotto, Andrew J (2015) Phoneme categorization relying solely on high-frequency energy. J Acoust Soc Am 137:EL65-70
Monson, Brian B; Lotto, Andrew J; Story, Brad H (2014) Gender and vocal production mode discrimination using the high frequencies for speech and singing. Front Psychol 5:1239
Monson, Brian B; Hunter, Eric J; Lotto, Andrew J et al. (2014) The perceptual significance of high-frequency energy in the human voice. Front Psychol 5:587
Monson, Brian B; Lotto, Andrew J; Story, Brad H (2014) Detection of high-frequency energy level changes in speech and singing. J Acoust Soc Am 135:400-6
Monson, Brian B; Hunter, Eric J; Story, Brad H (2012) Horizontal directivity of low- and high-frequency energy in speech and singing. J Acoust Soc Am 132:433-41
Monson, Brian B; Lotto, Andrew J; Story, Brad H (2012) Analysis of high-frequency energy in long-term average spectra of singing, speech, and voiceless fricatives. J Acoust Soc Am 132:1754-64