Identifying and understanding roles of single nucleotide polymorphisms (SNPs) will lead to accurate diagnosis of inherited disease states, determination of risk factors, and characterization of patients' metabolic profiles. Such technology promises to lead to prophylactic treatments to delay the onset or progression of disease, and prescriptions of the safest and most efficacious medications. Current DNA sequencing technology, however, is too slow and expensive for these tasks. Here, we propose to develop an ultrafast DNA sequencing system featuring sequencing-by-synthesis (SBS) on high-density oligonucleotide arrays, each with approximately one million primer features. The collaborative team involved in this project was responsible for some of the earliest published work on SBS, and recognize the fundamental challenge that any method based on this approach must address before tangible progress to a practical system can be made. That is, to identify combinations of appropriately modified nucleoside triphosphates that will be accepted, efficiently and with high fidelity, by suitably mutated DNA replicating enzymes. Consequently, this proposal features a strong synthetic chemistry component featuring two laboratories focused on the preparing nucleoside triphosphates with fluorescent, labile 3'-protecting groups. It also describes molecular biology to produce relatively large libraries of mutated polymerases. Even though the numbers of modified enzymes generated is high, the mutations will focus on key structural regions to maximize the chances of finding suitable systems. This molecular biology component is coupled with a combinatorial screen to rapidly identify suitable enzyme/modified dNTP pairs. Once suitable combinations are identified, then the SBS methodology will be implemented using high-density arrays that, uniquely, orientate oligonucleotides in the desired 5'-> 3' direction. This core technology fits into a broader, comprehensive research plan encompassing microfluidics for sample manipulation and delivery of the DNA to the SBS system, fluorescent imaging via our proprietary Pulse-Multiline Excitation (PME) system, computational methods for identifying an optimal tiling path and thermodynamic properties of oligonucleotides across whole chromosomes, and informatics to process and store the data generated. The overall goal is, by the end of year three, to complete sequencing of chromosomes 3, 12 & X, which cover approximately 0.5 gigabases and would lay the foundation for whole genome sequencing.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003573-02
Application #
6953265
Study Section
Special Emphasis Panel (ZHG1-HGR-N (O1))
Program Officer
Schloss, Jeffery
Project Start
2004-09-29
Project End
2007-08-31
Budget Start
2005-07-01
Budget End
2006-03-31
Support Year
2
Fiscal Year
2005
Total Cost
$549,596
Indirect Cost
Name
Baylor College of Medicine
Department
Genetics
Type
Schools of Medicine
DUNS #
051113330
City
Houston
State
TX
Country
United States
Zip Code
77030
Stupi, Brian P; Li, Hong; Wang, Jinchun et al. (2012) Stereochemistry of benzylic carbon substitution coupled with ring modification of 2-nitrobenzyl groups as key determinants for fast-cleaving reversible terminators. Angew Chem Int Ed Engl 51:1724-7
Litosh, Vladislav A; Wu, Weidong; Stupi, Brian P et al. (2011) Improved nucleotide selectivity and termination of 3'-OH unblocked reversible terminators by molecular tuning of 2-nitrobenzyl alkylated HOMedU triphosphates. Nucleic Acids Res 39:e39
Metzker, Michael L (2010) Sequencing technologies - the next generation. Nat Rev Genet 11:31-46
Wu, Weidong; Stupi, Brian P; Litosh, Vladislav A et al. (2007) Termination of DNA synthesis by N6-alkylated, not 3'-O-alkylated, photocleavable 2'-deoxyadenosine triphosphates. Nucleic Acids Res 35:6339-49
Lu, Genliang; Burgess, Kevin (2006) A diversity oriented synthesis of 3'-O-modified nucleoside triphosphates for DNA 'sequencing by synthesis'. Bioorg Med Chem Lett 16:3902-5
Metzker, Michael L (2005) Emerging technologies in DNA sequencing. Genome Res 15:1767-76