The human genome project is moving into its final phase, in which the genome sequence will be determined in large-scale efforts in a number of laboratories. Current technology appears largely adequate to the task but it will be essential to reduce as much as possible the need of skilled human labor, which remains a bottleneck to increased throughput, a potential source of uneven sequence quality, and an obstacle to more widespread participation by the community. A major aspect of sequencing that currently requires skilled labor is human review and manipulation of data, particularly editing (revision of errors in assembly and base calls), assessment of data quality, and decisions regarding data collection. The investigators' goal is to reduce, and eventually completely eliminate, all such human involvement in data processing while maintaining a high level of accuracy of the final sequence. The investigators will do this by improving the accuracy of assembly and base-calling, and by developing objective criteria to estimate this accuracy so as to more precisely delineate those regions of the sequence that may still require human review. In particular they will develop base-specific error probabilities as a criterion to guide data collection and to measure the quality of the final sequence. These advances will be implemented in the basecalling and assembly programs phred and phrap, which are freely distributed to academic researchers and are already in use in a number of sequencing laboratories.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
2R01HG000774-05
Application #
2026820
Study Section
Genome Study Section (GNM)
Project Start
1992-09-29
Project End
2000-04-30
Budget Start
1997-05-14
Budget End
1998-04-30
Support Year
5
Fiscal Year
1997
Total Cost
Indirect Cost
Name
University of Washington
Department
Biochemistry
Type
Schools of Medicine
DUNS #
135646524
City
Seattle
State
WA
Country
United States
Zip Code
98195