This project, acquiring analytic platforms for research in combinatorial and graph sciences, attacks a critical problem, that of extracting information from massive sets of data. The requested instrumentation, a large system for production (Netezza Performance Server(s)), are to be housed at Howard and Sandia National Lab and serve scientific projects on DNA binding mechanisms, protein structure prediction, data mining for computational science at Sandia, multi-dimensional visualization at SUNY-Buffalo, and natural language processing at NMSU. The work responds to the data deluge/tsunami/explosion/avalanche that presently overwhelms the scientific community. The data intensive computer resource, National Terabyte Data Analysis Centers (NTDAC), provides the ability to analyze tera-scale data sets 10 to 100 times faster than current alternatives at a lower cost. Over time, NTDAC is expected to house and analyze peta-bytes of raw data, to have back-up and support capabilities, and to use parallel relational data warehouse appliance technology that is orders of magnitude faster and easier to use. The initial focus, exploitation of massively parallel relational database appliance technology, enables solution problems in computational biology, chemistry, physics, text mining, bibliographic coupling, and natural language processing.

Broader Impact: The infrastructure benefits all areas of science through faster data analysis. The interrogation of massive data sets allows practitioners and students to deliver/gain new knowledge at a more rapid pace, and to create needed solutions. Moreover, the institution is an HBCU, where minority students are likely to gain from the experience.

Agency
National Science Foundation (NSF)
Institute
Division of Computer and Network Systems (CNS)
Type
Standard Grant (Standard)
Application #
0723060
Program Officer
Rita V. Rodriguez
Project Start
Project End
Budget Start
2007-08-01
Budget End
2009-07-31
Support Year
Fiscal Year
2007
Total Cost
$150,000
Indirect Cost
Name
Howard University
Department
Type
DUNS #
City
Washington
State
DC
Country
United States
Zip Code
20059