The functions of proteins depend exquisitely on their structure, with details at the 0.1 ? scale influencing enzyme catalysis, disease-causing mutations, and drug recognition. For this reason, having detailed and accurate structures of proteins is a cornerstone of modern biomedical research, and the NIH funded the Protein Structure Initiative with the goal of obtaining models for every protein structure with an accuracy approaching that of a high-resolution crystal structure. Current technology for template-based modeling is powerful, but cannot yet deliver near-crystal-structure quality. Tests show that the best minimization routines still fall short of consistently producing protein models for close homologs that approach within ~1 ? rmsd of the 'native' structure as ultimately revealed by crystal structures. To help break through this 1 ? barrier, during the previous period of support we used ultrahigh-resolution structures to create a library of conformation- dependent ideal geometry functions for the protein backbone, and showed that its use improves the quality of protein crystal structures and holds promise to improve template-based model refinement. We also discovered that ultrahigh-resolution crystal structures are a rich source of details about protein structure that are not accurately attainable from structures in the ~1.5-2 ? resolution range and thus have not yet been fully accounted for in current energy functions. Here, our central hypothesis is that a major step forward in template-based modeling accuracy will come from identifying and explicitly taking into account detailed features of protein covalent geometry, conformation and non-covalent packing interactions that have not yet been characterized, and can now be gleaned from the study of highly accurate ultrahigh-resolution protein structures. The overall goal of our proposal is to mine such information so it can be used to improve the accuracy of predictive modeling. With many ultrahigh-resolution structures now available, the time is ripe to achieve this goal by pursuing three specific aims related to (1) extending the impact of the 'ideal geometry function' paradigm by creating, optimizing, and implementing conformation- dependent libraries accounting for peptide planarity, side chains, and cis-peptides, (2) mining ultrahigh- resolution crystal structures to glean information for next-generation empirical energy functions, and (3) analyzing ultrahigh-resolution protein structures solved in varying environments to produce a set of benchmark test cases and developing residue level assessment tools to use with these test cases to evaluate and hone template-based modeling refinement applications. This proposed work is low cost and low risk, and has a high likelihood of substantial impact as it provides basic information that can be widely incorporated into predictive and experimental modeling applications to improve their accuracy. It is also distinct from major efforts being invested into template-based modeling. Introducing this greater level of realism is a prerequisite to improving the refinement step of template-based modeling and achieving the goals of the Protein Structure Initiative.

Public Health Relevance

Proteins carry out the work that gets done inside of cells, so figuring out what they look like helps us understand things like how drugs work and how to design new drugs that will work even better. It is not practical to experimentally determine every protein structure, so having reliable ways to use computers to predict their structures is quite important. Current methods are not quite accurate enough, and the goal of this work is to look carefully at the best known protein structures to learn from their exact features how we can improve prediction technology to get the details right.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
7R01GM083136-08
Application #
8895978
Study Section
Biochemistry and Biophysics of Membranes Study Section (BBM)
Program Officer
Wu, Mary Ann
Project Start
2008-08-01
Project End
2016-07-31
Budget Start
2015-08-01
Budget End
2016-07-31
Support Year
8
Fiscal Year
2015
Total Cost
$206,832
Indirect Cost
$59,832
Name
Oregon State University
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
053599908
City
Corvallis
State
OR
Country
United States
Zip Code
97331
Brereton, Andrew E; Karplus, P Andrew (2018) Ensemblator v3: Robust atom-level comparative analyses and classification of protein structure ensembles. Protein Sci 27:41-50
Evangelidis, Thomas; Nerli, Santrupti; Nová?ek, Ji?í et al. (2018) Automated NMR resonance assignments and structure determination using a minimal set of 4D spectra. Nat Commun 9:384
Hollingsworth, Scott A; Lewis, Matthew C; Karplus, P Andrew (2016) Beyond basins: ?,? preferences of a residue depend heavily on the ?,? values of its neighbors. Protein Sci 25:1757-62
Sharaf, Naima G; Brereton, Andrew E; Byeon, In-Ja L et al. (2016) NMR structure of the HIV-1 reverse transcriptase thumb subdomain. J Biomol NMR 66:273-280
Moriarty, Nigel W; Tronrud, Dale E; Adams, Paul D et al. (2016) A new default restraint library for the protein backbone in Phenix: a conformation-dependent geometry goes mainstream. Acta Crystallogr D Struct Biol 72:176-9
Brereton, Andrew E; Karplus, P Andrew (2016) On the reliability of peptide nonplanarity seen in ultra-high resolution crystal structures. Protein Sci 25:926-32
Li, Wenlin; Kinch, Lisa N; Karplus, P Andrew et al. (2015) ChSeq: A database of chameleon sequences. Protein Sci 24:1075-86
Brereton, Andrew E; Karplus, P Andrew (2015) Native proteins trap high-energy transit conformations. Sci Adv 1:e1501188
Karplus, P Andrew; Diederichs, Kay (2015) Assessing and maximizing data quality in macromolecular crystallography. Curr Opin Struct Biol 34:60-8
Clark, Sarah A; Tronrud, Dale E; Karplus, P Andrew (2015) Residue-level global and local ensemble-ensemble comparisons of protein domains. Protein Sci 24:1528-42

Showing the most recent 10 out of 26 publications