Model Comparison in Structural Biology

Fraser, James

Abstract

X-ray crystallography has traditionally been used to generate three-dimensional structural models of biological molecules, which provide fundamental insights into biological mechanisms. The progress of refining a structural model is monitored using a powerful cross-validation statistic, R-free. However, recent advances in refinement techniques have created new classes of models that model conformational heterogeneity using ensembles or multiple conformations. There is currently a critical need to create new model selection criteria to evaluate different classes of models, as vastly different interpretations of biologically important motions can be drawn from these datasets. Bayesian model selection presents disciplined methods to determine the level of modeling detail appropriate for a given dataset. We will develop comparison techniques to rigorously trade off the quality of fit and parsimony of distinct model types. First, we will create synthetic X-ray diffraction datasets to be processed using standard data integration pipelines. Synthetic datasets afford us knowledge of the """"""""correct"""""""" answer and allow us to vary the input conformational heterogeneity and noise. After model refinement, we will use information criteria to evaluate the tradeoffs between model complexity and parsimony. Next, we will evaluate real datasets, focusing on the refinement of high-resolution enzyme and low-resolution membrane protein data sets. We will rigorously explore the effect of global parameter grid searches on the resulting models. Finally, we will implement and distribute software that automates model comparisons. This software will be integrated into leading structure refinement and integrative modeling suites. These statistical methods will provide a general and significant improvement to the inference of protein ensembles from diverse structural data. With our research program, we will provide the structural biology community with statistically rigorous, computationally tractabl model comparison techniques integrated into existing popular software suites, and evidence for their utility. These advances will enable the exploitation of conformational heterogeneity to identify new inhibitors using in silico docking and to guide engineering of new protein functions, while avoiding futile explorations of imprecise models caused by poor data quality.

Public Health Relevance

This proposal describes new methods for optimizing model selection in structural biology. Knowledge of the precision and accuracy of protein conformations is key to structure-based drug design, which is an important paradigm for developing new chemical entities for treating disease.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Exploratory/Developmental Grants (R21)
Project #: 1R21GM110580-01
Application #: 8681145
Study Section: Special Emphasis Panel (ZRG1-BCMB-A (02))
Program Officer: Flicker, Paula F

Project Start: 2014-04-01
Project End: 2016-03-31
Budget Start: 2014-04-01
Budget End: 2015-03-31
Support Year: 1
Fiscal Year: 2014
Total Cost: $190,878
Indirect Cost: $65,878

Institution

Name: University of California San Francisco
Department: Pharmacology
Type: Schools of Pharmacy
DUNS #: 094878337

City: San Francisco
State: CA
Country: United States
Zip Code: 94143

Related projects


NIH 2015 R21 GM	Model Comparison in Structural Biology Fraser, James Solomon / University of California San Francisco	$227,996
NIH 2014 R21 GM	Model Comparison in Structural Biology Fraser, James Solomon / University of California San Francisco	$190,878

Publications

Wall, Michael E; Wolff, Alexander M; Fraser, James S (2018) Bringing diffuse X-ray scattering into focus. Curr Opin Struct Biol 50:109-116

Biel, Justin T; Thompson, Michael C; Cunningham, Christian N et al. (2017) Flexibility and Design: Conformational Heterogeneity along the Evolutionary Trajectory of a Redesigned Ubiquitin. Structure 25:739-749.e3

Thomaston, Jessica L; Woldeyes, Rahel A; Nakane, Takanori et al. (2017) XFEL structures of the influenza M2 proton channel: Room temperature water networks and insights into proton conduction. Proc Natl Acad Sci U S A 114:13357-13362

Russi, Silvia; González, Ana; Kenner, Lillian R et al. (2017) Conformational variation of proteins at room temperature is not dominated by radiation damage. J Synchrotron Radiat 24:73-82

Wang, Ray Yu-Ruei; Song, Yifan; Barad, Benjamin A et al. (2016) Automated structure refinement of macromolecular assemblies from cryo-EM maps using Rosetta. Elife 5:

Van Benschoten, Andrew H; Liu, Lin; Gonzalez, Ana et al. (2016) Measuring and modeling diffuse scattering in protein X-ray crystallography. Proc Natl Acad Sci U S A 113:4069-74

Baxter, Elizabeth L; Aguila, Laura; Alonso-Mori, Roberto et al. (2016) High-density grids for efficient data collection from multiple crystals. Acta Crystallogr D Struct Biol 72:2-11

Meyer, Peter A; Socias, Stephanie; Key, Jason et al. (2016) Data publication with the structural biology data grid supports live analysis. Nat Commun 7:10882

Cimermancic, Peter; Weinkam, Patrick; Rettenmaier, T Justin et al. (2016) CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites. J Mol Biol 428:709-719

Barad, Benjamin A; Echols, Nathaniel; Wang, Ray Yu-Ruei et al. (2015) EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy. Nat Methods 12:943-6

Showing the most recent 10 out of 21 publications

Comments

Be the first to comment on James Fraser's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: