The overarching goal of the proposed research is to study evolution of structure, Biophysical properties and function of proteins in the context of Darwinian evolution of their carrier organisms. In the past budget period we significantly advanced towards that goal by introducing rich, powerful yet tractable multiscale models of protein evolution which derive organismal fitness from genomic sequences via intuitive yet realistic Genotype- Phenotype relationships which relate biophysical and functional properties of proteins to important phenotypic traits such cell division. Here we will further develop this approach by adding more realism to the models including treatment of proteins in sequence-based multiscale model at atomic resolution, prototypical E.coli glycolytic pathway, effect of misfolding on fitness and effect of protein abundances. Specific problems that will be addressed include the effect of ecology of the population on key Biophysical properties of proteins such as distribution of their stabilities, abundances and strengths of protein-protein interactions in the proteome;reconstruction of ancestor sequences and molecular phylogenies from genomics data within physically realistic models, relationship between protein stability, abundance in cytoplasm, and rate of their evolution. Predictions and insights from multiscale models will be vigorously tested against high-throughput Bioinfiomatic analyses and experiment. Experimental studies proposed here include measurements of stabilities of statistically representative sets of proteins in several proteomes, of bacteria having various ecological parameters (e.g population sizes and/or mutation rates). Successful completion of these studies will advance our ability to extract functionally relevant signals from genomic sequences by putting their analysis on firm evolutionary and Biophysical ground.

Public Health Relevance

This research aims to establish the relation between ecological properties of bacterial populations and molecular properties of their proteomes. It will provide foundation to determine effective tools to fight adverse effect of infection and address such unmet medical needs as development of novel medicines and treatment protocols to overcome bacterial resistance.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Edmonds, Charles G
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
Schools of Arts and Sciences
United States
Zip Code
Serohijos, Adrian W R; Shakhnovich, Eugene I (2014) Contribution of selection for protein folding stability in shaping the patterns of polymorphisms in coding regions. Mol Biol Evol 31:165-76
Bershtein, Shimon; Mu, Wanmeng; Serohijos, Adrian W R et al. (2013) Protein quality control acts on folding intermediates to shape the effects of mutations on organismal fitness. Mol Cell 49:133-44
Mannige, Ranjan V (2013) Two modes of protein sequence evolution and their compositional dependencies. Phys Rev E Stat Nonlin Soft Matter Phys 87:062714
Bershtein, Shimon; Mu, Wanmeng; Wu, Wanmeng et al. (2012) Soluble oligomerization provides a beneficial fitness effect on destabilizing mutations. Proc Natl Acad Sci U S A 109:4857-62
Heo, Muyoung; Shakhnovich, Eugene I (2010) Interplay between pleiotropy and secondary selection determines rise and fall of mutators in stress response. PLoS Comput Biol 6:e1000710
Zhang, Jingshan; Shakhnovich, Eugene I (2010) Optimality of mutation and selection in germinal centers. PLoS Comput Biol 6:e1000800
Donald, Jason E; Shakhnovich, Eugene I (2009) SDR: a database of predicted specificity-determining residues in proteins. Nucleic Acids Res 37:D191-4
Zhang, Yang; Hubner, Isaac A; Arakaki, Adrian K et al. (2006) On the origin and highly likely completeness of single-domain protein structures. Proc Natl Acad Sci U S A 103:2605-10