The overarching goal of the proposed research is to study evolution of structure, Biophysical properties and function of proteins in the context of Darwinian evolution of their carrier organisms. In the past budget period we significantly advanced towards that goal by introducing rich, powerful yet tractable multiscale models of protein evolution which derive organismal fitness from genomic sequences via intuitive yet realistic Genotype- Phenotype relationships which relate biophysical and functional properties of proteins to important phenotypic traits such cell division. Here we will further develop this approach by adding more realism to the models including treatment of proteins in sequence-based multiscale model at atomic resolution, prototypical E.coli glycolytic pathway, effect of misfolding on fitness and effect of protein abundances. Specific problems that will be addressed include the effect of ecology of the population on key Biophysical properties of proteins such as distribution of their stabilities, abundances and strengths of protein-protein interactions in the proteome;reconstruction of ancestor sequences and molecular phylogenies from genomics data within physically realistic models, relationship between protein stability, abundance in cytoplasm, and rate of their evolution. Predictions and insights from multiscale models will be vigorously tested against high-throughput Bioinfiomatic analyses and experiment. Experimental studies proposed here include measurements of stabilities of statistically representative sets of proteins in several proteomes, of bacteria having various ecological parameters (e.g population sizes and/or mutation rates). Successful completion of these studies will advance our ability to extract functionally relevant signals from genomic sequences by putting their analysis on firm evolutionary and Biophysical ground.

Public Health Relevance

This research aims to establish the relation between ecological properties of bacterial populations and molecular properties of their proteomes. It will provide foundation to determine effective tools to fight adverse effect of infection and address such unmet medical needs as development of novel medicines and treatment protocols to overcome bacterial resistance.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM068670-10
Application #
8500340
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Edmonds, Charles G
Project Start
2004-04-01
Project End
2016-02-29
Budget Start
2013-03-01
Budget End
2014-02-28
Support Year
10
Fiscal Year
2013
Total Cost
$545,059
Indirect Cost
$210,627
Name
Harvard University
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
082359691
City
Cambridge
State
MA
Country
United States
Zip Code
02138
Serohijos, Adrian W R; Shakhnovich, Eugene I (2014) Contribution of selection for protein folding stability in shaping the patterns of polymorphisms in coding regions. Mol Biol Evol 31:165-76
Bershtein, Shimon; Mu, Wanmeng; Serohijos, Adrian W R et al. (2013) Protein quality control acts on folding intermediates to shape the effects of mutations on organismal fitness. Mol Cell 49:133-44
Mannige, Ranjan V (2013) Two modes of protein sequence evolution and their compositional dependencies. Phys Rev E Stat Nonlin Soft Matter Phys 87:062714
Bershtein, Shimon; Mu, Wanmeng; Wu, Wanmeng et al. (2012) Soluble oligomerization provides a beneficial fitness effect on destabilizing mutations. Proc Natl Acad Sci U S A 109:4857-62
Heo, Muyoung; Shakhnovich, Eugene I (2010) Interplay between pleiotropy and secondary selection determines rise and fall of mutators in stress response. PLoS Comput Biol 6:e1000710
Zhang, Jingshan; Shakhnovich, Eugene I (2010) Optimality of mutation and selection in germinal centers. PLoS Comput Biol 6:e1000800
Donald, Jason E; Shakhnovich, Eugene I (2009) SDR: a database of predicted specificity-determining residues in proteins. Nucleic Acids Res 37:D191-4
Zhang, Yang; Hubner, Isaac A; Arakaki, Adrian K et al. (2006) On the origin and highly likely completeness of single-domain protein structures. Proc Natl Acad Sci U S A 103:2605-10