The goal of the research described in this proposal is to develop a new generation of algorithms for comparative modeling, with a specific goal to improve the quality of models based on very distant homologies. This is a necessary step to provide a link between protein sequences coming from genome projects and understanding functions of these proteins, which can only be provided by the detailed knowledge of their three- dimensional structure. Enzymatic reactions, recognition of substrates, interactions between proteins-they all happen on the molecular level and whether we want to just understand or to modify, inhibit or enhance them, we need to look at and understand biological systems on the level of their molecular three dimensional structure. Unfortunately, none of the currently available algorithms is able to make the models more similar to the actual structures that they are to the templates they are built from. Actually in most cases the modeling processes partly destroys the similarity. The research described here aims to change it by combining two approaches: Development of empirical rules of how protein structures change in response to change in sequence and applying them to modify the template structure. Development of new tools for evaluation of three dimensional models of proteins In short, the plan is to use he first approach to generate a number of possible variants of the structure of the protein being modeled, while using the second approach to choose the best possible one. These overall goals will be accomplished by systematic analysis of changes in structures in families of homologous proteins to develop empirical rules of how protein structures changes in response to changes in sequence. The database of structures of homologous proteins at various levels of sequence divergence will be built and each structure will be decomposed into a hierarchy of subsystems built from smaller elements. This will allow seeking simple rules describing changes in structure, such as identification of """"""""pivoting moves."""""""" With a set of rules like that it will be possible to modify the structure of a modeling template to make it more similar to the final structure of the modeling target. Many possibilities will be generated and a system of model evaluation algorithms will choose the bet model among the as many as a thousand possibilities. A final goal of this proposal is to automate the algorithms to the point that they could be implemented in a fully automatic way on a WEB server. Improved algorithms, will be made publicly available on the group fold prediction server. Existing databases of sold predictions will be continuously updated and extended to include various interesting protein families.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Program Projects (P01)
Project #
1P01GM063208-01A1
Application #
6589928
Study Section
Special Emphasis Panel (ZRG1)
Project Start
2002-04-01
Project End
2007-03-31
Budget Start
2002-04-01
Budget End
2003-03-31
Support Year
1
Fiscal Year
2002
Total Cost
Indirect Cost
Name
University of California San Diego
Department
Type
DUNS #
077758407
City
La Jolla
State
CA
Country
United States
Zip Code
92093
Porta-Pardo, Eduard; Godzik, Adam (2016) Mutation Drivers of Immunological Responses to Cancer. Cancer Immunol Res 4:789-98
Xie, Li; Ng, Clara; Ali, Thahmina et al. (2013) Multiscale modeling of the causal functional roles of nsSNPs in a genome-wide association study: application to hypoxia. BMC Genomics 14 Suppl 3:S9
Stegle, Oliver; Denby, Katherine J; Cooke, Emma J et al. (2010) A robust Bayesian two-sample test for detecting intervals of differential gene expression in microarray time series. J Comput Biol 17:355-67
Valas, Ruben E; Yang, Song; Bourne, Philip E (2009) Nothing about protein structure classification makes sense except in the light of evolution. Curr Opin Struct Biol 19:329-34
Podtelezhnikov, Alexei A; Wild, David L (2009) Reconstruction and stability of secondary structure elements in the context of protein structure prediction. Biophys J 96:4399-408
Veretnik, Stella; Wills, Christopher; Youkharibache, Philippe et al. (2009) Sm/Lsm genes provide a glimpse into the early evolution of the spliceosome. PLoS Comput Biol 5:e1000315
Briedis, Kristine M; Starr, Ayelet; Bourne, Philip E (2008) Analysis of the human kinome using methods including fold recognition reveals two novel kinases. PLoS One 3:e1597
Borgwardt, Karsten (2008) Predicting phenotypic effects of gene perturbations in C. elegans using an integrated network model. Bioessays 30:707-10
Xie, Lei; Bourne, Philip E (2008) Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments. Proc Natl Acad Sci U S A 105:5441-6
Chung, Jo-Lan; Beaver, John E; Scheeff, Eric D et al. (2007) Con-Struct Map: a comparative contact map analysis tool. Bioinformatics 23:2491-2

Showing the most recent 10 out of 55 publications