A major puzzle in biology is the origin of novel phenotypes. Innovative phenotypes are important to understanding medically important phenomena such as host-parasite interactions and the development of antibiotic resistance in pathogens. Classic models hold that mutational processes generate new phenotypes through gene duplication, domain shuffling, and other mechanisms that modify existing genes rather than making new ones. Random sequences may be prone to toxic aggregation rather than folding to a functional state: this is a major reason why "tinkering" mechanisms have been posited. However, recent discoveries, including our own, show that new genes can evolve de novo from non-coding sequences and that new portions of genes can also arise in this manner.
The aims of this proposal are: 1) to identify more such cases, 2) to investigate the roles of ordered structure and intrinsic structural disorder in de novo protein-coding innovation, and 3) to examine the structural properties of proteins, or portions of proteins, encoded by new coding sequence. Significant possible outcomes include 1) insights into selection processes that allow new genes to arise, 2) the discovery of novel proteins and an ability to compare their properties to those of highly evolved proteins, yielding insights into the protein folding code and protein design/engineering. The newness of this research area, as well as the incorporation of cutting-edge evolutionary theory and studies of de novo protein structure, make this work highly innovative. The PI team is interdisciplinary and complementary, including both an evolutionary biologist and an experimental structural biologist/biochemist.

Public Health Relevance

Newly evolved genes may be important in the ongoing evolutionary battle between humans and their pathogens, and in other evolutionary arms races important to psychiatric and reproductive disorders. This work will study factors that could make it easier for new genes to evolve, as well as differences and similarities between the structures of newly evolved genes and old genes.

National Institute of Health (NIH)
Research Project (R01)
Project #
Application #
Study Section
Genetic Variation and Evolution Study Section (GVE)
Program Officer
Janes, Daniel E
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Arizona
Schools of Arts and Sciences
United States
Zip Code
Masel, Joanna; Promislow, Daniel E L (2016) Answering evolutionary questions: A guide for mechanistic biologists. Bioessays 38:704-11
Teufel, Ashley I; Masel, Joanna; Liberles, David A (2015) What Fraction of Duplicates Observed in Recently Sequenced Genomes Is Segregating and Destined to Fail to Fix? Genome Biol Evol 7:2258-64
Andreatta, Matthew E; Levine, Joshua A; Foy, Scott G et al. (2015) The Recent De Novo Origin of Protein C-Termini. Genome Biol Evol 7:1686-701
Trotter, Meredith V; Weissman, Daniel B; Peterson, Grant I et al. (2014) Cryptic genetic variation can make "irreducible complexity" a common mode of adaptation in sexual populations. Evolution 68:3357-67
Masel, Joanna (2013) Q&A: Evolutionary capacitance. BMC Biol 11:103