Improving Modeling by Learning from Details of High Accuracy Protein Structures

Karplus, Paul

Abstract

The functions of proteins depend exquisitely on their structure, with details at the 0.1 ? scale influencing enzyme catalysis, disease-causing mutations, and drug recognition. For this reason, having detailed and accurate structures of proteins is a cornerstone of modern biomedical research, and the NIH funded the Protein Structure Initiative with the goal of obtaining models for every protein structure with an accuracy approaching that of a high-resolution crystal structure. Current technology for template-based modeling is powerful, but cannot yet deliver near-crystal-structure quality. Tests show that the best minimization routines still fall short of consistently producing protein models for close homologs that approach within ~1 ? rmsd of the 'native' structure as ultimately revealed by crystal structures. To help break through this 1 ? barrier, during the previous period of support we used ultrahigh-resolution structures to create a library of conformation- dependent ideal geometry functions for the protein backbone, and showed that its use improves the quality of protein crystal structures and holds promise to improve template-based model refinement. We also discovered that ultrahigh-resolution crystal structures are a rich source of details about protein structure that are not accurately attainable from structures in the ~1.5-2 ? resolution range and thus have not yet been fully accounted for in current energy functions. Here, our central hypothesis is that a major step forward in template-based modeling accuracy will come from identifying and explicitly taking into account detailed features of protein covalent geometry, conformation and non-covalent packing interactions that have not yet been characterized, and can now be gleaned from the study of highly accurate ultrahigh-resolution protein structures. The overall goal of our proposal is to mine such information so it can be used to improve the accuracy of predictive modeling. With many ultrahigh-resolution structures now available, the time is ripe to achieve this goal by pursuing three specific aims related to (1) extending the impact of the 'ideal geometry function' paradigm by creating, optimizing, and implementing conformation- dependent libraries accounting for peptide planarity, side chains, and cis-peptides, (2) mining ultrahigh- resolution crystal structures to glean information for next-generation empirical energy functions, and (3) analyzing ultrahigh-resolution protein structures solved in varying environments to produce a set of benchmark test cases and developing residue level assessment tools to use with these test cases to evaluate and hone template-based modeling refinement applications. This proposed work is low cost and low risk, and has a high likelihood of substantial impact as it provides basic information that can be widely incorporated into predictive and experimental modeling applications to improve their accuracy. It is also distinct from major efforts being invested into template-based modeling. Introducing this greater level of realism is a prerequisite to improving the refinement step of template-based modeling and achieving the goals of the Protein Structure Initiative.

Public Health Relevance

Proteins carry out the work that gets done inside of cells, so figuring out what they look like helps us understand things like how drugs work and how to design new drugs that will work even better. It is not practical to experimentally determine every protein structure, so having reliable ways to use computers to predict their structures is quite important. Current methods are not quite accurate enough, and the goal of this work is to look carefully at the best known protein structures to learn from their exact features how we can improve prediction technology to get the details right.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 7R01GM083136-08
Application #: 8895978
Study Section: Biochemistry and Biophysics of Membranes Study Section (BBM)
Program Officer: Wu, Mary Ann

Project Start: 2008-08-01
Project End: 2016-07-31
Budget Start: 2015-08-01
Budget End: 2016-07-31
Support Year: 8
Fiscal Year: 2015
Total Cost: $206,832
Indirect Cost: $59,832

Institution

Name: Oregon State University
Department: Biochemistry
Type: Schools of Arts and Sciences
DUNS #: 053599908

City: Corvallis
State: OR
Country: United States
Zip Code: 97331

Related projects


NIH 2015 R01 GM	Improving Modeling by Learning from Details of High Accuracy Protein Structures Karplus, Paul Andrew / Oregon State University	$206,832
NIH 2014 R01 GM	Improving Modeling by Learning from Details of High Accuracy Protein Structures Karplus, Paul Andrew / Oregon State University
NIH 2013 R01 GM	Improving Modeling by Learning from Details of High Accuracy Protein Structures Karplus, Paul Andrew / Oregon State University	$200,665
NIH 2012 R01 GM	Improving Modeling by Learning from Details of High Accuracy Protein Structures Karplus, Paul Andrew / Oregon State University	$208,438
NIH 2011 R01 GM	Empirical conformation-dependent covalent geometry variation in proteins Karplus, Paul Andrew / Oregon State University	$208,186
NIH 2010 R01 GM	Empirical conformation-dependent covalent geometry variation in proteins Karplus, Paul Andrew / Oregon State University	$210,764
NIH 2009 R01 GM	Empirical conformation-dependent covalent geometry variation in proteins Karplus, Paul Andrew / Oregon State University	$286,440
NIH 2008 R01 GM	Empirical conformation-dependent covalent geometry variation in proteins Karplus, Paul Andrew / Oregon State University	$213,756

Publications

Brereton, Andrew E; Karplus, P Andrew (2018) Ensemblator v3: Robust atom-level comparative analyses and classification of protein structure ensembles. Protein Sci 27:41-50

Evangelidis, Thomas; Nerli, Santrupti; Nová?ek, Ji?í et al. (2018) Automated NMR resonance assignments and structure determination using a minimal set of 4D spectra. Nat Commun 9:384

Hollingsworth, Scott A; Lewis, Matthew C; Karplus, P Andrew (2016) Beyond basins: ?,? preferences of a residue depend heavily on the ?,? values of its neighbors. Protein Sci 25:1757-62

Sharaf, Naima G; Brereton, Andrew E; Byeon, In-Ja L et al. (2016) NMR structure of the HIV-1 reverse transcriptase thumb subdomain. J Biomol NMR 66:273-280

Moriarty, Nigel W; Tronrud, Dale E; Adams, Paul D et al. (2016) A new default restraint library for the protein backbone in Phenix: a conformation-dependent geometry goes mainstream. Acta Crystallogr D Struct Biol 72:176-9

Brereton, Andrew E; Karplus, P Andrew (2016) On the reliability of peptide nonplanarity seen in ultra-high resolution crystal structures. Protein Sci 25:926-32

Li, Wenlin; Kinch, Lisa N; Karplus, P Andrew et al. (2015) ChSeq: A database of chameleon sequences. Protein Sci 24:1075-86

Brereton, Andrew E; Karplus, P Andrew (2015) Native proteins trap high-energy transit conformations. Sci Adv 1:e1501188

Karplus, P Andrew; Diederichs, Kay (2015) Assessing and maximizing data quality in macromolecular crystallography. Curr Opin Struct Biol 34:60-8

Clark, Sarah A; Tronrud, Dale E; Karplus, P Andrew (2015) Residue-level global and local ensemble-ensemble comparisons of protein domains. Protein Sci 24:1528-42

Showing the most recent 10 out of 26 publications

Comments

Be the first to comment on Paul Karplus's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: