This project develops a platform-independent target enrichment technology for the preparation of large (50- 150 kb+) DNA fragments. These are magnetically selected from genomic regions of interest and used directly - without any intermediate amplification step - for next-generation sequencing (NGS). This will allow the direct detection of methylated base information on several of the newer NGS systems. Large DNA templates can currently be processed by several NGS platforms, and even more powerful systems are being developed. The ability to utilize large, contiguous DNA segments as sequencing templates greatly assists in detecting and characterizing structural variants, obtaining accurate sequence information across complex and GC-rich genomic loci, conducting de novo sequence assembly, phasing long-range haplotype information, and finding new or unexpected sequence elements, such as viral integration sites, inversions, duplications and other types of chromosomal rearrangements. Existing methods for genomic enrichment are not able to provide a comprehensive characterization of complex genomic loci in an economical and flexible manner because they cannot provide the large size fragments that are required to successfully span confounding sequence elements and resolve persistent challenges at important loci. We will develop an efficient long-read capture technology for small amounts of input DNA that resolves common sequencing and assembly problems that occur with most current amplification- or hybridization- based enrichment technologies. The benefits of this large-fragment enrichment as carried out with capture primers with a very small footprint (20-25 bases) will extend to all types of NGS platforms, including to those that rely on short read lengths. We believe that we can significantly improve the ability of researchers and clinicians to obtain complete and accurate characterizations of complex genomic variations and regions of interest that are important for disease diagnosis and prognosis by NGS. Anonymous, pre-existing samples for this project are provided by the NCI.

Public Health Relevance

Over the past decade, next generation sequencing (NGS) technology has transformed the field of genetics, enabling a better understanding of the genetic causes of disease. The application of this knowledge to companion diagnostics and companion therapeutics is now quickly transforming research and clinical practice. In addition, so-called targeted DNA sequencing of selected genomic loci is more cost effective than whole genome sequencing (WGS), facilitates higher sample throughput and optimized coverage, eliminates ambiguities and improves accuracy by greatly reducing the complexity of the DNA to be sequenced. Current methods for genomic targeting however remain a severely limiting factor for resolving complex genomic loci because they do not provide the large size fragments required to span confounding sequences. We will fill this need with a large-fragment enrichment method, RSE, which will permit direct long-read next- generation sequencing after enrichment, including the ability to detect methylation on several NGS systems. This combination will significantly improve the ability of researchers and clinicians to obtain complete and accurate characterizations of complex genomic variations and regions in disease.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43HG009580-01
Application #
9316222
Study Section
Special Emphasis Panel (ZHG1-HGR-N (J2))
Program Officer
Smith, Michael
Project Start
2017-04-12
Project End
2018-03-31
Budget Start
2017-04-12
Budget End
2018-03-31
Support Year
1
Fiscal Year
2017
Total Cost
$276,252
Indirect Cost
Name
Generation Biotech, LLC
Department
Type
Domestic for-Profits
DUNS #
034158787
City
Lawrenceville
State
NJ
Country
United States
Zip Code
08648