The proposed Genome Sequencing Center will expand the sequencing capacity of the ongoing multiplex sequencing project at CRI. An additional focus will be the development of new technology and software in order to significantly increase throughput. Forty individuals will be involved in two projects and two cores. A summary of their goals follows. Project 1: Automated Multiplex Sequencing. The overall goal is to sequence 2.4, 4, 6.4, 11.3 and 20 Mb contiguous regions in the five consecutive years of the project. The increased throughput will be accomplished by automating both laboratory and computer procedures. The M. leprae and M. tuberculosis genomes will be completed first. The sequencing target will then shift to human chromosome 10q24-q26 and its mouse syntenic regions. Overall, 24 individuals will participate in this project. The projected cost in Year 5 is approximately 25 cents per base. Project 2: Infrared Detection and Automation. This project will focus on automated high throughput hybridization and infrared fluorescence detection. The use of infrared fluorescence will significantly increase throughput, simplify quality control, and reduce costs by eliminating the use of radioactive chemicals and films. By integrating automatic hybridization and detection, all manual processes in the hybridization/detection component of multiplex sequencing will be eliminated. Over the course of five years, six machines for automated hybridization and detection will be built, enabling raw data generation at a rate of more than 500 Mb per year. Overall, 7 individuals will participate in this project. Informatics Core. The overall, 5-year objective is to develop and implement software that is capable of sustaining a processing rate of over 1.5 Mb of raw sequence data and approximately 100 kb of finished sequence per day. The four areas to be addressed are: improving base calling routines, improving sequence assembly, automating contig editing, and developing tools to facilitate sequence analysis. Additional system administration and computing tasks include network and hardware upgrades, data backups, development of project management tools, and programming support for robotics automation. Overall, this core will involve 8 individuals.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Program Projects (P01)
Project #
5P01HG001106-02
Application #
2209405
Study Section
Genome Research Review Committee (GRRC)
Project Start
1994-08-22
Project End
1997-06-30
Budget Start
1995-07-01
Budget End
1996-06-30
Support Year
2
Fiscal Year
1995
Total Cost
Indirect Cost
Name
Oscient Pharmaceuticals Corporation
Department
Type
DUNS #
City
Waltham
State
MA
Country
United States
Zip Code
02453
Engelstein, M; Aldredge, T J; Madan, D et al. (1998) An efficient, automatable template preparation for high throughput sequencing. Microb Comp Genomics 3:237-41
Smith, D R; Richterich, P; Rubenfield, M et al. (1997) Multiplex sequencing of 1.5 Mb of the Mycobacterium leprae genome. Genome Res 7:802-19
Smith, D R (1996) Microbial pathogen genomes--new strategies for identifying therapeutics and vaccine targets. Trends Biotechnol 14:290-3