Organisms harvest organic fuel to produce cellular energy through the metabolic pathway known as oxidative phosphorylation. This critical process is carried out along a chain of interacting proteins that are encoded by two separate genomes, mitochondrial and nuclear, that must cooperate to maintain functional integrity. Levels of energy demand directly impact the molecular evolution of these metabolic genes. In birds, the high energy demand associated with flight has been correlated with molecular patterns that reflect conserved protein function. There exists a wide range of metabolic demands across animals, but the impact of such differences on protein evolution remains largely unexplored. Salamanders provide a unique opportunity to examine the impacts of metabolic demand on molecular evolution due to their extremely low metabolic requirements, which allow them to tolerate higher levels of protein variation than most other vertebrates. This study will analyze patterns of molecular evolution to determine what aspects of protein structure must be maintained to retain functional integrity of metabolic genes. This goal will be met by sequencing large portions of the highly variable mitochondrial and nuclear genomes of several salamander species. This study will yield a comprehensive analysis of the forces shaping the molecular evolution of the core proteins underlying aerobic metabolism. These results will provide an understanding of the molecular underpinnings of energy demand, which may aid in understanding the origin of genetic diseases associated with mitochondrial mutations. Two undergraduates and one graduate student will be trained in genomics and bioinformatics through this research.
Inside animal cells, a set of biochemical reactions takes place that transforms energy stored in the chemical bonds of food into energy stored in the chemical bonds of a molecule called ATP. ATP is the "currency" that cells use to pay for their chemical reactions. The biochemical pathways that transform food energy into "usable" ATP energy are collectively called cellular respiration. Cellular respiration is dependent on the interaction of a large set of proteins. Proteins are encoded by the DNA in an organism’s genome. However, the proteins that carry out some portions of cellular respiration are unusual; although most of them are encoded by the DNA in the nuclear genome (i.e. the chromosomes housed in the cell’s nucleus that reflect genetic contributions from both parents), some are encoded in a small genome that resides in a cellular compartment called the mitochondrion. The mitochondrial genome is tiny – in most animals, it contains only 13 genes, far fewer than the tens of thousands of genes present in the nuclear genome. The 13 proteins encoded by mitochondrial genes physically interact with many more proteins encoded by genes in the nuclear genome to carry out parts of cellular respiration. Because these mitochondrial-encoded and nuclear-encoded proteins interact, an evolutionary change in one genome that impacts a protein-coding gene can disrupt cellular respiration function by destroying the necessary physical interactions among mitochondrial and nuclear proteins. In animals, rates of evolution are substantially higher in the mitochondrial genome than in the nuclear genome, reflecting substantially higher rates of mutation. Because of this, evolution of the mitochondrial genome drives compensatory evolution of some nuclear genes to maintain the necessary physical interactions among proteins required for cellular respiration. Across animals, rates and patterns of evolution of the mitochondrial genome vary. We hypothesized that some of this variation might be caused by differences across animals in the amounts of ATP required to sustain cellular activity — i.e., the metabolic rate. Salamanders are unusual among vertebrates in having some of the lowest metabolic rates. Thus, they are an important model for studying how metabolic rate impacts mitochondrial genome evolution. We predicted that genes encoding cellular respiration proteins in the mitochondrial genome of salamanders would be subjected to less purifying selection — i.e. removal by natural selection of any mutations that change protein function — than mitochondrial genes in other species because the demand for ATP synthesis is less in salamanders. Our results are consistent with this prediction; salamander proteins have experienced more evolutionary changes that impact their protein sequence than have other vertebrate animals. We also predicted that coevolution between the mitochondrial genome and the nuclear genome in salamanders would be less extreme than in other vertebrates, again because of the lower demands on ATP synthesis machinery in salamanders. We have collected a DNA sequence dataset to test this hypothesis. We also found that salamander mitochondrial genomes have evolved in another unusual way — whole portions of the genome, including several genes, have been duplicated more than once during the group’s evolutionary history. Gene and genome duplication have occurred in many organisms, including portions of both the nuclear and the mitochondrial genomes. After an initial duplication event, the new "extra" copies of the duplicated genes have several possible fates: they can be deleted from the genome, they can persist in the genome and evolve a new function, they can persist in the genome as defunct copies, or they can persist in the genome and take over a subset of the original single-copy gene’s function. In salamanders, we identified a case where a portion of the mitochondrial genome was recently duplicated within a species; some populations now have the version of the genome with the duplicate, while other populations retain the version of the genome with the single copy. By comparing the genomes from these two populations, we were able to determine that deletions of the new extra copies occurred over relatively shallow evolutionary timescales, and that the extra copies of different genes were deleted at very different rates. In addition to these new insights into the ways in which genomes evolve, the funding provided by this grant also yielded training opportunities for graduate and undergraduate students in bioinformatics/computational biology. Through these experiences, these students worked through identifying and defining a research goal, establishing a set of analyses that would allow them to reach this goal, performing the analyses, examining and interpreting quantitative results, and writing and/or orally sharing such interpretations.