Expressive Speech Synthesis for Speech-Generating Devices

Hertz, Susan

Abstract

The expression of emotions through speech is an essential element of human communication, yet individuals with speech impairments who rely on speech-generating devices (SGDs) to talk are largely deprived of this communicative channel. No one should be forced to sound blas when in fact they?re elated or to sound apathetic when they intend to scold. While certain expressive speech modes are offered by a few currently available SGDs, the range is extremely limited, and the speech often falls short of conveying the desired emotion or intention. Further, the few speech modes provided are generally restricted to a single voice. The proposed project aims to remedy this unfortunate situation by enhancing the rule-based Synfonica text-to- speech system (S-TTS) to make it easy and cost-efficient to add a broad range of consistently recognizable expressive speech modes, all automatically available for all voices offered by the system. S-TTS currently produces speech for American English in a neutral reading mode. In the project, five new expressive conversational modes will be added?neutral, happiness, sadness, and two types of anger (known as hot anger and cold anger). Each mode will be available for four voices: an already existing adult male voice, an already existing adult female voice, and two new child voices (one male and one female) to be developed in the project. Users of S-TTS will be able to customize these voices to create new ones through special voice customization options to be added to the system outside of the project. While the system will be tested through standard types of perceptual tests, the expressive modes will also be evaluated in real-life situations. For this purpose, a simple augmentative and alternative communication (AAC) app that incorporates the S-TTS voices will be developed, and individuals who use SGDs will be recruited to use this app in their daily lives. These individuals will be asked to provide feedback on the effectiveness of the expressive modes for their interactions with others. In general, the project will be a major step toward enabling SGD users to speak in the manner they wish. They will at long last be empowered to express themselves through voices that can reflect their feelings and help them realize their intentions.

Public Health Relevance

Currently, individuals with speech impairments who rely on speech-generating devices for their verbal communication are deprived of the ability to realistically express their emotions through speech. The proposed project aims to address this problem by enhancing Synfonica?s text-to-speech system with expressive speech capabilities. With such an enhanced system, developers of speech-generating devices will be able to provide users of these devices with genuinely expressive voices.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Deafness and Other Communication Disorders (NIDCD)
Type: Small Business Innovation Research Grants (SBIR) - Phase II (R44)
Project #: 5R44DC014173-03
Application #: 9668128
Study Section: Special Emphasis Panel (ZRG1)
Program Officer: Shekim, Lana O

Project Start: 2015-04-01
Project End: 2021-02-28
Budget Start: 2019-03-01
Budget End: 2021-02-28
Support Year: 3
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: Synfonica, LLC
Department
Type
DUNS #: 144511263

City: Ithaca
State: NY
Country: United States
Zip Code: 14850

Related projects


NIH 2019 R44 DC	Expressive Speech Synthesis for Speech-Generating Devices Hertz, Susan R. / Synfonica, LLC
NIH 2018 R44 DC	Expressive Speech Synthesis for Speech-Generating Devices Hertz, Susan R. / Synfonica, LLC

Comments

Be the first to comment on Susan Hertz's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: