This proposal aims at improving the software for DNA sequencing by integrating the components of basecalling, sequence assembly and postassembly analysis into an integrated software system. To test the performance of the software configurations, data for known regions of E. coli will be resequenced from the original clones using a LI-Cor sequencing instrument. The data will be basecalled by neural net based pattern recognition and assembled with a variety of multiple alignment methods. Algorithmic solutions will be compared and evaluated relative to the goal of achieving an accurate final sequence with the minimum of editing by a human expert. Alleviating the need for sequence editing will present the opportunity for significant cost savings in genome projects and other research involving DNA sequencing.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43GM051680-01
Application #
2190370
Study Section
Special Emphasis Panel (ZRG7-SSS-2 (02))
Project Start
1994-08-01
Project End
1996-01-31
Budget Start
1994-08-01
Budget End
1996-01-31
Support Year
1
Fiscal Year
1994
Total Cost
Indirect Cost
Name
Dnastar, Inc.
Department
Type
DUNS #
130194947
City
Madison
State
WI
Country
United States
Zip Code
53705
Allex, C F; Baldwin, S F; Shavlik, J W et al. (1996) Improving the quality of automatic DNA sequence assembly using fluorescent trace-data classifications. Proc Int Conf Intell Syst Mol Biol 4:3-14