Long-read sequencing has the potential to greatly simplify sequencing, helping to accelerate scientists? ability to perform de novo sequencing, haplotype phasing and transcriptomics. This project aims to develop a method to label DNA prior to next-generation sequencing, that maintains information about the proximity of fragments in the original strand, aiding in the downstream assembly of the sequencing data. To label the DNA, transposase will be loaded with specially-designed transposons containing barcode labels, and used to fragment the DNA, prior to next-generation sequencing.
The first aim of this project is to construct the transposomes used to label the DNA.
The second aim i s to use these transposomes to tagment and sequence a model DNA system to demonstrate read lengths of ~50 kb using next-generation sequencing on less than a picogram of DNA. If successful, the sequencing approach developed in this grant will simplify synthetic long read sequencing, making high-accuracy and inexpensive long read sequencing more accessible to the genetics community.

Public Health Relevance

Because the genetic instructions contained in DNA play a crucial role in the healthy functioning of the body, it is often important for researchers and clinicians to read this information using DNA sequencing devices. To do this, the DNA is usually broken into small ?words? that the sequencer can read, and then computer algorithms are used to guess the original order that those words appeared in the DNA, before it was broken apart. This project aims to develop a new way to molecularly label DNA fragments, to make it easier to determine the original DNA sequence.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43HG011217-01A1
Application #
10081920
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Smith, Michael
Project Start
2020-09-22
Project End
2021-08-31
Budget Start
2020-09-22
Budget End
2021-08-31
Support Year
1
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Esper Biosciences, Inc.
Department
Type
DUNS #
080091037
City
Ithaca
State
NY
Country
United States
Zip Code
14850