Proposal: CNS 0454130 PI: Nancy M. Ide Institution: Vassar College

This planning grant will support the PI in preparing a community resource proposal for further development of the American National Corpus (ANC) which is currently an 11 million word corpus of contemporary American English. The ultimate goal is to create a 100 million-word collection that will be an enabling data resource for research in linguistics, computational linguistics, natural language processing, information retrieval, machine translation and other areas. The PI will address the challenging problem of planning a resource that addresses the needs of these communities with different requirements for their research. The PI will convene an advisory board, organize a workshop to further define needs, and receive research community comments. Proof of concept developments will be conducted to demonstrate the use of annotation tools with different parsers and semantic taggers, and to augment existing analysis tools, and enhance the ANC web interface.

Agency
National Science Foundation (NSF)
Institute
Division of Computer and Network Systems (CNS)
Type
Standard Grant (Standard)
Application #
0454130
Program Officer
Stephen Mahaney
Project Start
Project End
Budget Start
2005-08-15
Budget End
2006-07-31
Support Year
Fiscal Year
2004
Total Cost
$49,941
Indirect Cost
Name
Vassar College
Department
Type
DUNS #
City
Poughkeepsie
State
NY
Country
United States
Zip Code
12604