The project proposes to advance the state of the art in automatic speech recognition by detecting emotional and highly accented speech and differences based on age and gender, and then optimizing the acoustic model for those conditions. It will apply long-term adaptation techniques to improve robustness, and will implement an innovative second-pass decoding technique to improve accuracy by using side information such as thesaurus terms and human-prepared summaries.

The techniques to be developed will dramatically improve the efficiency of professional catalogers, leveraging automatic segmentation to suggest topic boundaries in interviews, using domain-tuned classification algorithms to recommend thesaurus terms, and providing automated tools to support generation of event timelines. Volunteers will help assign metadata and provide transcripts. Efforts will be made to automate transferring capabilities developed originally for English to other languages. Access to multilingual materials will be done by combining knowledge-based and corpus-based techniques to extend existing thesauri to new languages and by supporting cross-language searching of manually prepared segment-level summaries and automatic speech recognition transcripts.

Each component will be evaluated and user studies done to measure the overall impact on support for cataloging, search and exploration. This will produce significant impact, both through improved access to our cultural heritage and through the application of the techniques to other important problems.

The collection of spoken material used in this project will be the 116,000 hours held by the Survivors of the Shoah Visual History Foundation, a set of already digitized video recordings of great historical importance.

Agency
National Science Foundation (NSF)
Institute
Division of Information and Intelligent Systems (IIS)
Application #
0122466
Program Officer
Ephraim P. Glinert
Project Start
Project End
Budget Start
2001-10-01
Budget End
2006-03-31
Support Year
Fiscal Year
2001
Total Cost
$7,674,664
Indirect Cost
Name
Survivors of the Shoah Visual History Foundation
Department
Type
DUNS #
City
Universal City
State
CA
Country
United States
Zip Code
91608