The biosphere is dominated by over 10e30 bacteria and archaea, a number roughly equivalent to the number of seconds since the "Big Bang". Since bacteria are asexual, their rapid acquisition of new genes (for things like antibiotic resistance or ability to degrade specific chemicals) depends on the huge amount of gene exchange between species. Despite its critical importance, what controls the rate of this gene flow between bacteria is poorly understood. But it does seem clear that restriction-modification (RM) systems play a key role. RM systems produce two enzymes, a nuclease that cuts unprotected DNA (such as what enters from another cell or a bacterial virus), and a methyltransferase that protects the bacterium's own DNA from the nuclease. Many basic questions about the functioning and roles of these systems remain. Given the great importance of RM systems for understanding ongoing bacterial evolution, with implications for everything from plant diseases through biogeochemistry and global warming, it is critical that this deficiency in our understanding be addressed.

The purpose of this project is to elucidate the regulatory design of a C protein-controlled model RM system, and how it acts in a living bacterial cell. The C protein activates transcription of its own gene, and of the downstream nuclease gene, and as a result nuclease expression is delayed until C protein accumulates. The first aim is to determine the range of acceptable relative levels of a paired nuclease and methyltransferase. If the methyltransferase/nuclease ratio is too low, cells may die from DNA damage. If this ratio is too high, the entire population may be killed by virus (phage) that escaped the nuclease and became methylated. The second aim determines naturally occurring changes in relative nuclease and methyltransferase levels, under a range of growth conditions including stresses. The third aim is to characterize the time sensitive control system that controls production of the nuclease. Mathematical modeling has provided predictions that will be tested, to see if the system design is accurately understood.

Broader Impacts: An integral aspect of the project is to promote education and research training of scientists, ranging from high school science teachers through postdoctoral fellows. The teachers and undergraduates will each study one of the many already cloned but uncharacterized C protein orthologs. High school science teachers and undergraduate science majors will be recruited for summer research internships. The laboratory members will continue to host high school students for science fair projects and will do volunteer biology/career teaching at high schools. Additionally, at least one graduate student and one postdoctoral fellow will be encouraged to earn certificates in bioinformatics as part of their training. These individuals will be well prepared to contribute to the burgeoning field of bacterial genomics. The project will also enrich graduate level teaching and further promote the application to bacterial studies of mathematical modeling.

Project Report

Bacteria are the most abundant life forms on earth, and like us they have viruses (called "bacteriophages") that can infect and kill them. One of the key defenses bacteria employ against these viruses is restriction-modification (RM) systems. RM systems have other, nondefensive roles as well, that have not been as well characterized. As the name suggests, RM systems include two components: a restriction enzyme, that cuts new DNA entering a cell, and a modification enzyme, that protects the cell’s own DNA from the restriction enzyme. The restriction enzymes have been hugely important to the development of biotechnology, as the ability to cut DNA at specific points has allowed controlled assembly of different DNA segments. However, to successfully spread among different bacteria, the RM systems face a serious problem: they have to ensure that the new host cell’s DNA is fully protected before they start producing restriction enzyme, or they will kill their new host. The major project goal was to improve our understanding of how this timing system works. Such knowledge is important for better ability to a) understand and respond to gene exchange among bacteria (including such things as antibiotic resistance and factors causing plant diseases), and b) further improve the set of biotechnological tools waiting to be discovered among the still barely-explored and highly-diverse world of RM systems. This project focused on one family of RM systems. This family includes a third gene, in addition to the restriction and modification enzyme genes, that produces a small protein called "C" (for "controller") that can both turn on and turn off the restriction enzyme gene. Furthermore, it does this by turning its own gene on and off – the restriction enzyme gene is located just after the C gene, and is only expressed when the C gene is. However the positive feedback loop created when C protein turns on its own gene is like a microphone held in front of a speaker – the sound begins at a low level but feeds back into the microphone and gets increasingly loud with increasing rapidity. When the genes for one of these RM systems enters a new cell, there is no C protein, and very little production of C or restriction enzyme, though the modification enzyme is made immediately. Over time, the level of C (and restriction enzyme) begins to leap up. This simplified and basic model has been known for over a decade, but truly understanding its operation is a more complex problem. There were six major outcomes from this project: 1) While the C and restriction enzyme genes are transcriptionally coupled (same mRNA molecule), they are translationally independent (different ribosomes carry out the translation into protein). This was studied using a model RM system called PvuII, originally from a bacterium that causes urinary tract infections (Proteus vulgaris). 2) A critical factor in the control system is how many copies of the genes there are. This again used PvuII as the model system. This had not been demonstrated for any RM system before, and was only explored in this project due to the predictions of a mathematical model. Such modeling is a tool of increasing value. The result makes sense, because the genes for PvuII (and many other RM systems) reside on small circles of DNA called "plasmids". These enter a new cell in a single copy, and then often replicate until they reach a higher copy number. 3) Another feature predicted by mathematical modeling and confirmed in PvuII is called hysteretic bistability. Basically, when the level of C in the cell is low, so is the expression of C (and restriction enzyme), and when the level of C is high, the expression is higher. But there is a region of intermediate levels of C where the expression can be high or low, depending on the cell’s recent history. This sort of design makes it harder to leap prematurely to high levels of expression, or to drop down to low levels once the RM system is well established in its new host cell. 4) Attempts to find other RM systems related to PvuII, so possible differences in their control systems could be studied, led to the discovery of a new subfamily in which the C and restriction proteins have been fused together into one bifunctional protein. The implications of this fusion have not yet been determined experimentally, but may be profound for the timing mechanism. 5) In collaboration with another laboratory, significant insights have been gained into the recognition of DNA methylation in mammalian cells, where it plays a major regulatory role. DNA methylation is also what modification enzymes do in bacteria to protect DNA. 6) The project resulted in the science and research training of a postdoctoral fellow, a graduate student, two undergraduates, and three high school science teachers. Last Modified: 10/24/2013 Submitted by: Robert M Blumenthal The original outcomes report did not include three publications from 2013. They are: Liu Y, Zhang X, Blumenthal RM, Cheng X. 2013. A common mode of recognition for methylated CpG. Trends Biochemical Sci. 38: 177-183. PMC3608759. Williams K, Savageau MA, Blumenthal RM. 2013. A bistable hysteretic switch in an activator-repressor regulated restriction-modification system. Nucleic Acids Res. 41: 6045-6057. PMC3695507. [Featured article] Liang J, Blumenthal RM. 2013. Naturally-occurring, dually-functional fusions between restriction endonucleases and regulatory proteins. BMC Evol. Biol. 13: 218 (11pp).

Agency
National Science Foundation (NSF)
Institute
Division of Molecular and Cellular Biosciences (MCB)
Application #
0964728
Program Officer
Susannah Gal
Project Start
Project End
Budget Start
2010-03-01
Budget End
2013-08-31
Support Year
Fiscal Year
2009
Total Cost
$569,999
Indirect Cost
Name
University of Toledo Health Science Campus
Department
Type
DUNS #
City
Toledo
State
OH
Country
United States
Zip Code
43614