(derived from the original, changes underlined) Advances in synthetic biology have accelerated to the point where the synthesis of entire genomes is now possible. However, the technologies for these feats are painstaking, and the production of a new chromosome or genome requires multiple years of effort, working from small fragments to ever larger assemblies. The speed (and ultimately scale) of large fragment assembly would be greatly improved if it were possible to routinely amplify very long stretches of DNA (> 100 kb) in vitro. The methods developed in the execution of this proposal should also prove extremely useful for greatly improved reagents for molecular diagnostics for SARS-CoV-2. To that end, this proposal is focused on the further development of a novel directed evolution method known as Compartmentalized Self-Replication (CSR), in which polymerases expressed in cells in emulsions undergo thermal cycling to amplify their own genes, to generate long read DNA polymerases that should prove capable of generating PCR amplicons > 100 kb in length, with few errors. To achieve this goal, we propose to develop a novel library construction method that most efficiently brings together sequence and structural domains from a variety of DNA polymerase variants to form diverse chimeras (Aim 1.1), and to sieve these libraries using improvements to CSR that will allow us to select for extreme processivity in yeast (Aim 1.2) and efficient error- correction (Aim 1.3). Using the methods in Aim 1.2, we can produce polymerase variants that should be able to directly participate in RT-qPCR without sample preparation, including from samples inactivated with denaturants. The variants that result will be characterized for their ability to synthesize long amplicons in vitro (Aim 2.1), for their fidelity (Aim 2.2), and for their detailed kinetic properties (Aim 2.3). Finally, to better ensure the processivity of the resultant polymerase chimeras, we will append either DNA-binding domains (Aim 3.1) or clamps (Aim 3.2) that should lead to much better ability to grip DNA. Using the methods described in Aim 3.1, we can generate thermostable reverse transcriptases that should prove useful for the development of isothermal amplification assays that can be used at point-of-care, or in resource-poor settings. In addition to accelerating the ongoing revolution in genome synthesis, such long-read polymerases should also pave the way to new sequencing technologies, including for single molecule sequencing and for single cell sequencing. In the current crisis, polymerase engineering for particular functions, directed towards needs that the community has and that need to be resolved for forward motion on testing, is a critical component of a national plan.

Public Health Relevance

By developing a DNA polymerase that can copy and amplify DNA over very long (chromosome-sized) stretches, we will generate a tool for the research community, industry, and medicine that can be used to better create genomes from scratch, and read long stretches of DNA sequence information. The former application will abet the growing field of synthetic biology as it crafts new organisms, while the latter application should foment additional advances in Next Generation DNA sequencing. The methods developed can be immediately applied to developing better, and needed, reagents for assays for SARS-CoV-2.

Agency
National Institute of Health (NIH)
Institute
National Institute of Biomedical Imaging and Bioengineering (NIBIB)
Type
Research Project (R01)
Project #
3R01EB027202-01A1S1
Application #
10170542
Study Section
Program Officer
Rampulla, David
Project Start
2020-09-01
Project End
2021-08-31
Budget Start
2020-09-01
Budget End
2021-08-31
Support Year
1
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of Texas Austin
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
170230239
City
Austin
State
TX
Country
United States
Zip Code
78759