The goal of this project is to improve the accuracy of comparative modeling both in the 30-90% sequence identity range and in the 10-30% range. This will be accomplished by a multi-disciplinary team of six investigators in biophysics, mathematics, statistics, and computer science. Based on new statistical analysis of homologous protein structure pairs using graphical models (Jordan) and non-parametric Bayesian methods (Jordan, Dunbrack), Tompa will devise a coarse sampling procedure, based on backtracking and branch-and-bound algorithms, designed to search the space of homologous structures from a starting model produced by the Baker or Dunbrack groups. Tseng and Baker will develop extensions of quasi-Newton optimization methods specifically tailored to Monte Carlo Minimization trajectories. These methods will take advantage of information gained in local optimizations carried out earlier in the trajectory from neighboring regions of the landscape. With a large sample of locally minimized structures, Jordan will use response surface methodology and Gaussian processes to fit a surface to these local minima. A search on this surface then produces promising low-energy regions of the space that can be searched further with fine sampling methods, including tabu search (Baker). Further optimizations with block-coordinate descent methods (Tseng) will also be implemented. Ponder will test his recently developed polarizable multi-pole force field, while developing this force field further with a generalized-Born, surface-area solvation model. Dunbrack will benchmark the accuracy of predicted structures at all stages of the project. Predicted side-chain conformations will be compared to deposited coordinates as well as electron density calculations from the experimental structure factors (Dunbrack). Finally, the methods developed in this proposal will be applied to proteins implicated in cancer development, including those in DNA repair, apoptosis, and cell-growth signaling, with a priority on targets for cancer therapeutics. New structures from three Protein Structure Initiative centers will be used both as prediction targets (before they are solved) and as templates for prediction of structures of important biological or clinical interest. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Exploratory Grants (P20)
Project #
5P20GM076222-02
Application #
7216862
Study Section
Special Emphasis Panel (ZGM1-CBB-3 (HM))
Program Officer
Smith, Ward
Project Start
2006-04-01
Project End
2009-03-31
Budget Start
2007-04-01
Budget End
2008-03-31
Support Year
2
Fiscal Year
2007
Total Cost
$673,743
Indirect Cost
Name
Research Institute of Fox Chase Cancer Center
Department
Type
DUNS #
064367329
City
Philadelphia
State
PA
Country
United States
Zip Code
19111
Rossi, Paolo; Shi, Lei; Liu, Gaohua et al. (2015) A hybrid NMR/SAXS-based approach for discriminating oligomeric protein interfaces using Rosetta. Proteins 83:309-17
Berkholz, Donald S; Driggers, Camden M; Shapovalov, Maxim V et al. (2012) Nonplanar peptide bonds in proteins are common and conserved but not biased toward active sites. Proc Natl Acad Sci U S A 109:449-53
Lange, Oliver F; Rossi, Paolo; Sgourakis, Nikolaos G et al. (2012) Determination of solution structures of proteins up to 40 kDa using CS-Rosetta with sparse NMR data from deuterated samples. Proc Natl Acad Sci U S A 109:10873-8
Yarov-Yarovoy, Vladimir; DeCaen, Paul G; Westenbroek, Ruth E et al. (2012) Structural basis for gating charge movement in the voltage sensor of a sodium channel. Proc Natl Acad Sci U S A 109:E93-102
North, Benjamin; Lehmann, Andreas; Dunbrack Jr, Roland L (2011) A new clustering of antibody CDR loop conformations. J Mol Biol 406:228-56
Shapovalov, Maxim V; Dunbrack Jr, Roland L (2011) A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 19:844-58
Raman, Srivatsan; Lange, Oliver F; Rossi, Paolo et al. (2010) NMR structure determination for larger proteins using backbone-only data. Science 327:1014-8
Das, Rhiju; Karanicolas, John; Baker, David (2010) Atomic accuracy in predicting and designing noncanonical RNA structure. Nat Methods 7:291-4
Blum, Ben; Jordan, Michael I; Baker, David (2010) Feature space resampling for protein conformational search. Proteins 78:1583-93
Ting, Daniel; Wang, Guoli; Shapovalov, Maxim et al. (2010) Neighbor-dependent Ramachandran probability distributions of amino acids developed from a hierarchical Dirichlet process model. PLoS Comput Biol 6:e1000763

Showing the most recent 10 out of 26 publications