High-frequency energy in speech and voice

Monson, Brian

Abstract

For years the major focus in speech acoustics has been on the frequency range below approximately 5 kHz. Human speech and the human voice generate acoustical energy up to 20 kHz. Evidence is accruing that high-frequency energy (energy above 5 kHz) in speech and voice contributes to percepts of quality, localization, and intelligibility. The proposed research is intended to be an initial step in the long-range goal of characterizing high-frequency energy in speech, with particular regard for its perceptual role, its potential for modification during speech production, and its generation mechanism. In this study, a database of high-fidelity recordings of talkers will be used for broad acoustical analysis and general characterization of high-frequency energy, as well as specific characterization of phoneme category, speech intensity level, and mode of production by their high-frequency energy content. Directionality of radiation of high-frequency energy from the mouth will also be examined. The recordings will be used for perceptual experiments wherein listeners will be asked to discriminate between speech and voice samples that differ only in high-frequency energy content. Listeners will also be subjected to intelligibility-in- noise tasks with samples that have been modified only in high-frequency content. The combination of these experiments will reveal (1) the ability of human listeners to detect high-frequency energy modification, and (2) the phonetic value of high-frequency energy in speech.

Public Health Relevance

The relevance of this project to public health lies in its efforts to elucidate the effect on human communicative behavior when high-frequency energy in speech is lost or altered, which may be incurred by factors such as hearing loss, noisy environmental conditions, telephony, audio data compression (such as mp3 compression), electronic sound reinforcement, or sound recording and playback. Previous research has already shown that high-frequency energy affects speech intelligibility, word-learning in normal-hearing and hearing-impaired children, speech localization, and qualitative percepts of speech and voice (e.g. 'naturalness'). Thus, this project will provide particularly valuable insight regarding the need for representation of the high-frequency range in augmentative hearing devices, including hearing aids, cochlear implants, and auditory brainstem implants;the results of this project may also impact the evaluation and management of speech, voice, and language disorders, as well as the development of training techniques for the enhancement of speech and voice.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Deafness and Other Communication Disorders (NIDCD)
Type: Predoctoral Individual National Research Service Award (F31)
Project #: 5F31DC010533-02
Application #: 8016048
Study Section: Communication Disorders Review Committee (CDRC)
Program Officer: Cyr, Janet

Project Start: 2010-01-01
Project End: 2011-08-31
Budget Start: 2011-01-01
Budget End: 2011-08-31
Support Year: 2
Fiscal Year: 2011
Total Cost: $32,134
Indirect Cost

Institution

Name: University of Arizona
Department: Otolaryngology
Type: Schools of Arts and Sciences
DUNS #: 806345617

City: Tucson
State: AZ
Country: United States
Zip Code: 85721

Related projects


NIH 2011 F31 DC	High-frequency energy in speech and voice Monson, Brian Bruce / University of Arizona	$32,134
NIH 2010 F31 DC	High-frequency energy in speech and voice Monson, Brian Bruce / University of Arizona	$38,914

Publications

Bakst, Sarah; Johnson, Keith (2018) Modeling the effect of palate shape on the articulatory-acoustics mapping. J Acoust Soc Am 144:EL71

Vitela, A Davi; Monson, Brian B; Lotto, Andrew J (2015) Phoneme categorization relying solely on high-frequency energy. J Acoust Soc Am 137:EL65-70

Monson, Brian B; Lotto, Andrew J; Story, Brad H (2014) Gender and vocal production mode discrimination using the high frequencies for speech and singing. Front Psychol 5:1239

Monson, Brian B; Hunter, Eric J; Lotto, Andrew J et al. (2014) The perceptual significance of high-frequency energy in the human voice. Front Psychol 5:587

Monson, Brian B; Lotto, Andrew J; Story, Brad H (2014) Detection of high-frequency energy level changes in speech and singing. J Acoust Soc Am 135:400-6

Monson, Brian B; Hunter, Eric J; Story, Brad H (2012) Horizontal directivity of low- and high-frequency energy in speech and singing. J Acoust Soc Am 132:433-41

Monson, Brian B; Lotto, Andrew J; Story, Brad H (2012) Analysis of high-frequency energy in long-term average spectra of singing, speech, and voiceless fricatives. J Acoust Soc Am 132:1754-64

Comments

Be the first to comment on Brian Monson's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: