The National Science Foundation will provide partial support for a conference, "From Sound to Sense: Fifty+ Years of Discoveries in Speech Communication," (http://rledev.mit.edu/soundtosense/default.cfm), held on June 11-13, 2004, at the Massachusetts Institute of Technology in Cambridge, MA. The conference has two goals. The first is to survey the current state of the field, by providing an historical overview of what has been accomplished during the last five decades of speech research, and by sampling some of the most interesting recent work in this active and exciting research topic. Twenty-one invited speakers will present talks in six different areas: Phonology and Phonetics; Speech Acoustics; Speech Perception; Planning and Production; Development, Pathologies and Remediation; and Speech Technology. The second goal is to promote cross-fertilization among the subdisciplines, to benefit future work in this field of study, which has its foundations in the physical, engineering, linguistic, cognitive and medical sciences. To this end the conference is organized in single sessions, rather than multiple parallel sessions, so that everyone will hear all presentations and participate in the discussions together. Many of the presentations will be archived as written papers at an MIT web site and distributed as a CD, and all of the sessions will be videotaped (if funds permit), offering a rich resource for both students and established investigators.

Many of the historical overviews will be presented by senior researchers such as Gunnar Fant, Kenneth Stevens, Peter Ladefoged, John Ohala, David Pisoni, Bjorn Lindblom, Victor Zue and Ray Kent, who have done pioneering work in their subdisciplines, including advances such as -the source-filter theory of speech acoustics, which models the way that changes in the size and shape of the vocal tract produce the sounds of a spoken utterance, -the quantal theory of acoustic-articulatory mapping, which identifies the acoustically stable articulations that provide some of the bases for sound categories in languages of the world, -models of human speech perception, production, development and pathologies, which relate models of the acoustics and articulation of speech to human language processing, and -algorithms for the automatic recognition and synthesis of speech, which not only provide aids for the handicapped, but also serve as tools for testing our models of human speech processing. The invited speakers who were responsible for these advances bring the valuable perspective of decades of thinking about and investigating speech to their historical overviews. The meeting thus provides an opportunity for students to hear first hand the insights of researchers who up to now may have been only names on classic papers, and to have personal contact with them.

Other speakers represent leading-edge current work, addressed to a broad range of questions. These include how individual languages make different use the special articulatory-acoustic mapping properties of the vocal tract, how the brain operates to process speech both as input and as output, and how automatic recognition and synthesis can take advantage of our increasing understanding of how speech is processed by humans, to create better tools for human-computer and human-human interactions by spoken language. A final summarizing address, followed by a discussion forum, will synthesize the ideas presented at the conference, with a focus on how future speech research can benefit from interaction among the represented sub-specialties.

Project Start
Project End
Budget Start
2004-08-01
Budget End
2008-06-30
Support Year
Fiscal Year
2004
Total Cost
$22,550
Indirect Cost
Name
Massachusetts Institute of Technology
Department
Type
DUNS #
City
Cambridge
State
MA
Country
United States
Zip Code
02139