The Protein Structure Initiative aims to provide an atomic level structure for essentially any gene sequence. Experimental progress is occurring at multiple locations, providing a wide distribution of structural classes. In response to RFA GM-07-003, we propose to take advantage of previous successes and significantly improve the accuracy of comparative modeling methods for protein structure prediction. Our focus largely is on improving the homology-based predictions that emerge after the initial threading stage, whether sequence identity is high (>30%) or low (<10%). For both classes, Dr. Xu's successful Raptor threading program can generate starting models, but they often have suboptimal stereochemistry. Drs. Sosnick and Freed's folding algorithm will refine and rescore structures using a newly developed, extremely accurate statistical potential and a novel move set which largely solves the conformational sampling problem near the bottom of the native well without resorting to expensive computational molecular mechanics methods. The just-developed move set allows significant optimization of the backbone dihedral angles according to observed ?,f frequencies in the PDB and to interactions including H-bonding, while maintaining the overall backbone trace to within ~1 ? RMSD. Large angular movements are allowed, and the refinement of loops will benefit from our extremely successful strategy used to model denatured states. Targets include large membrane channel proteins (>300 residues).
In aim 2, our Ramachandran basin prediction algorithm, which often can predict the native basins with greater than 85% accuracy, will help identify the correct template for threading of low identity sequences. Recurring local motifs found in folding simulations and the new statistical potential also will be utilized in template identification. Across nearly every biological discipline, detailed knowledge of protein structure is critical to understanding the biological function. High resolution structural information often is the starting point for describing a human disease at the molecular level, and mechanistic studies thereon. The major advance outlined in this proposal will greatly expand the database of protein structures and impact studies in nearly every field of health-related studies. The proposed research involves a new, synergistic collaboration and novel computational techniques as indicated in the RFA.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM081642-03
Application #
7663289
Study Section
Special Emphasis Panel (ZGM1-CBB-3 (HM))
Program Officer
Smith, Ward
Project Start
2007-08-01
Project End
2011-07-30
Budget Start
2009-08-01
Budget End
2011-07-30
Support Year
3
Fiscal Year
2009
Total Cost
$318,340
Indirect Cost
Name
University of Chicago
Department
Biochemistry
Type
Schools of Medicine
DUNS #
005421136
City
Chicago
State
IL
Country
United States
Zip Code
60637
Virtanen, J J; Sosnick, T R; Freed, K F (2014) Ionic strength independence of charge distributions in solvation of biomolecules. J Chem Phys 141:22D503
Adhikari, Aashish N; Peng, Jian; Wilde, Michael et al. (2012) Modeling large regions in proteins: applications to loops, termini, and folding. Protein Sci 21:107-21
Virtanen, Jouko Juhani; Makowski, Lee; Sosnick, Tobin R et al. (2011) Modeling the hydration layer around proteins: applications to small- and wide-angle x-ray scattering. Biophys J 101:2061-9
Haddadian, Esmael J; Gong, Haipeng; Jha, Abhishek K et al. (2011) Automated real-space refinement of protein structures using a realistic backbone move set. Biophys J 101:899-909
Peng, Jian; Xu, Jinbo (2010) Low-homology protein threading. Bioinformatics 26:i294-300
Zhao, Feng; Peng, Jian; Xu, Jinbo (2010) Fragment-free approach to protein folding using conditional neural fields. Bioinformatics 26:i310-7
Virtanen, Jouko J; Makowski, Lee; Sosnick, Tobin R et al. (2010) Modeling the hydration layer around proteins: HyPred. Biophys J 99:1611-9
Baird, Nathan J; Gong, Haipeng; Zaheer, Syed S et al. (2010) Extended structures in RNA folding intermediates are due to nonnative interactions rather than electrostatic repulsion. J Mol Biol 397:1298-306
Scharf, Louise; Li, Nan-Sheng; Hawk, Andrew J et al. (2010) The 2.5 Å structure of CD1c in complex with a mycobacterial lipid reveals an open groove ideally suited for diverse antigen presentation. Immunity 33:853-62
Zhao, Feng; Peng, Jian; Debartolo, Joe et al. (2010) A probabilistic and continuous model of protein conformational space for template-free modeling. J Comput Biol 17:783-98

Showing the most recent 10 out of 24 publications