Single-Molecule Electronic Nucleic Acid Sequencing-by-Synthesis Using Tagged Nucleotides and Nanopore Constructs With past NIH funding, we developed a single molecule nanopore-based sequencing by synthesis (SBS) strategy (Nanopore SBS) that accurately distinguishes the four DNA bases by electronically detecting 4 different polymer tags attached to the 5?-phosphate-modified nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primed template and polymerase, the polymer tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. While the Nanopore-SBS approach already produces good quality sequences, further optimization and development are needed to increase sequencing accuracy, while maintaining the ability of our nanopore-based single-molecule electronic system to produce long reads in real time. In this proposal, we will design and synthesize novel tagged nucleotides and construct nanopore-polymerase conjugates to control the sequencing reaction speed and increase single-molecule sequencing accuracy substantially, achieving desired polymerase catalytic rates and more efficient and consistent tag capture by the pores. We will use high ratios of unincorporable-to-incorporable tagged nucleotides to perform Nanopore-SBS. This will provide ample time to register currents due to the 4 unique tags on the unincorporable A, C, G and T nucleotides which display template-dependent binding to the polymerase ternary complex but are not incorporated into the growing DNA strand, followed by a new current level due to a 5th tag on the incorporable nucleotide which marks the transition to the extension step. This effectively eliminates insertion and deletion sequence artifacts, increases accuracy, and will be especially advantageous in DNA homopolymer repeat regions. This approach allows detection of a single nucleotide binding event multiple times (stutters) before the actual incorporation event, overcoming the inherent limitation of single molecule detection methods that only allow one chance for measurement. After optimizing the system with synthetic DNA templates, circular DNA libraries will be generated from viral and bacterial genomes to test this sequencing approach. With the improved tagged nucleotides, better regulated reaction kinetics, and newly designed polymerase-pore complexes, we will test the accuracy of our system on the nanopore arrays by sequencing these libraries at high coverage and comparing the results with other sequencing systems.

Public Health Relevance

High throughput next-generation DNA sequencing has penetrated all areas of biology and medicine, promising to transform preventive medicine and public health, but substantial improvements in accuracy, throughput, and costs of instrumentation and analysis are needed to make the technology truly accessible. To address these issues, we have developed a new Nanopore-based Sequencing-by-Synthesis platform, which allows real-time single-molecule electronic DNA sequencing. In this proposal, we will design and synthesize novel tagged nucleotides and construct nanopore- polymerase conjugates to control the speed of the sequencing reaction and increase the single-molecule sequencing accuracy substantially.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
1R01HG010514-01A1
Application #
9965302
Study Section
Special Emphasis Panel (ZHG1)
Program Officer
Smith, Michael
Project Start
2020-05-22
Project End
2023-03-31
Budget Start
2020-05-22
Budget End
2021-03-31
Support Year
1
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Columbia University (N.Y.)
Department
Engineering (All Types)
Type
Biomed Engr/Col Engr/Engr Sta
DUNS #
049179401
City
New York
State
NY
Country
United States
Zip Code
10027