Articulatory recovery from speech acoustics

McGowan, Richard

Abstract

The project's goal is to build methods and algorithms for the recovery of articulatory movement from acoustic speech signals. In the immediate future this will be accomplished employing analysis-by-techniques for sonorant sounds of English, but with a view to extend this to obstruents and the speech of other languages. There are four main areas of research that will be conducted to reach these goals. The first area is to study the relationship between small changes in articulation and the resulting acoustics, which is feasible now that a large amount of simultaneously recorded articulatory movement and acoustic data are available. The second area of research is in the kinematics of articulatory movement, particularly that of the tongue. Flesh point data enables a data-driven approach to the modeling of tongue kinematics. A quantitative approach to the kinematics of line segments between flesh points, secant lines, is being pursued because it is a first approximation to a kinematics of tongue shape, which is more closely related to acoustics than the flesh points themselves. Because the method for articulatory recovery is analysis-by-synthesis, it is necessary to have an articulatory synthesizer for an internal speech production model. The third area of research is the construction of an articulatory synthesizer that includes knowledge gained from the second area of research. With progress already made in articulatory synthesis, it is possible to concentrate on the kinematic control of the synthesizer. This also will be done from a data-driven approach with piecewise polynomials fit to secant line kinematic trajectories derived from flesh point data. Finally, the fourth area of research is in the recovery algorithms themselves, which include methods for normalization between different vocal tracts and the articulatory synthesizer's vocal tract. Both veridical (actual space-time articulatory trajectories are reproduced) and non-veridical (categorical segmental properties are reproduced for perceptually accurate identification) articulatory will be tested. Veridical articulatory recovery from speech acoustics is useful for the laboratory and clinic when acoustic and partial articulatory information are available and the scientist or clinician wants to know more about articulation. Non-veridical articulatory recovery is important for models of speech and language learning.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Deafness and Other Communication Disorders (NIDCD)
Type: Research Project (R01)
Project #: 5R01DC001247-17
Application #: 7772343
Study Section: Motor Function, Speech and Rehabilitation Study Section (MFSR)
Program Officer: Shekim, Lana O

Project Start: 1991-09-01
Project End: 2012-02-28
Budget Start: 2010-03-01
Budget End: 2012-02-28
Support Year: 17
Fiscal Year: 2010
Total Cost: $194,579
Indirect Cost

Institution

Name: Cress, LLC
Department
Type
DUNS #: 020529983

City: Lexington
State: MA
Country: United States
Zip Code: 02420

Articulatory recovery from speech acoustics
McGowan, Richard S.
Cress, LLC, Lexington, MA, United States

Abstract

Funding Agency

Institution

Publications

Comments

Recent in Grantomics:

Recently viewed grants:

Recently added grants:

Abstract

Funding Agency

Institution

Related projects

Publications

Comments