The proposal X-ray data analysis in the presence of structural variability aims to advance diffraction data analysis methods so that the variability between crystals and within crystals is optimally modeled during data processing in reciprocal space and during structural analysis in real space. The significance of the proposed work results from the importance of the technique, which generates uniquely-detailed information. X-ray structures are used to understand cellular processes at the atomic level directly, to explain and validate results obtain by other techniques, to generate hypotheses for detailed studies of cellular process, and to guide drug design studies - all of which are highly relevant to the NIH mission. Macromolecular crystals are frequently of limited size and crystal lattice order. Both may result in the need for combining data from multiple crystals for successful structure solution, with the limited order generating diffraction artifacts and correlating with non-isomorphism between different specimens. Non-isomorphism hinders the averaging of data sets from multiple crystals, because for successful averaging, data need to be very similar. The problems with averaging are compounded by incompleteness of the data in a single data set, radiation-induced changes in the crystal under investigation, and lack of statistical measures that would inform experimenters regarding whether or not the data analysis is progressing in the right direction. There are also technical challenges associated with averaging multiple data sets that result from the combinatorial complexity of data analysis when a large number of data sets need to be analyzed. Final difficulty appears when the analysis of the structural results obtained from multi-crystal experiments must separate the desired biological signals, e.g. the presence of a ligand or a specific dynamic behavior of the molecules, from the noise. Our proposal addresses these problems by developing and implementing innovative approaches.
In Aim 1, new approaches will be developed and implemented for averaging multiple, potentially incomplete data sets resulting from one or more crystals. Owing to our innovative approach to modeling the components of non-isomorphism, we expect that even quite non-isomorphous data sets can be used together to solve challenging structures.
In Aim 2, methods that will analyze the results of averaging data sets from multiple crystals in real space will be developed. The descriptors of averaging will be correlated with the outcomes of the structural analysis, so that the contributors to variability in real space can be quantified and interpreted. Finally, in Am 3, a web-based server will be developed in order to provide these methods to the structural biology community.

Public Health Relevance

This proposal aims to advance methods for diffraction data analysis from multiple samples, so that currently unattainable structural problems can be analyzed. Crystal structures, which provide uniquely detailed information that is used to understand cellular processes in health and disease and to design better and more effective drugs, are solved by analyzing X-ray diffraction data. The development of these methods will advance thousands of structural projects of which each has individual importance for the NIH mission.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM117080-01
Application #
9009602
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Smith, Ward
Project Start
2015-09-22
Project End
2019-08-31
Budget Start
2015-09-22
Budget End
2016-08-31
Support Year
1
Fiscal Year
2015
Total Cost
$350,040
Indirect Cost
$80,040
Name
University of Texas Sw Medical Center Dallas
Department
Physiology
Type
Schools of Medicine
DUNS #
800771545
City
Dallas
State
TX
Country
United States
Zip Code
75390
Porebski, Przemyslaw J; Sroka, Piotr; Zheng, Heping et al. (2018) Molstack-Interactive visualization tool for presentation, interpretation, and validation of macromolecules and electron density maps. Protein Sci 27:86-94
Handing, Katarzyna B; Niedzialkowska, Ewa; Shabalin, Ivan G et al. (2018) Characterizing metal-binding sites in proteins with X-ray crystallography. Nat Protoc 13:1062-1090
Kutner, Jan; Shabalin, Ivan G; Matelska, Dorota et al. (2018) Structural, Biochemical, and Evolutionary Characterizations of Glyoxylate/Hydroxypyruvate Reductases Show Their Division into Two Distinct Subfamilies. Biochemistry 57:963-977
Wlodawer, Alexander; Dauter, Zbigniew; Porebski, Przemyslaw J et al. (2018) Detect, correct, retract: How to manage incorrect structural models. FEBS J 285:444-466
Borek, Dominika; Bromberg, Raquel; Hattne, Johan et al. (2018) Real-space analysis of radiation-induced specific changes with independent component analysis. J Synchrotron Radiat 25:451-467
Shabalin, Ivan G; Porebski, Przemyslaw J; Minor, Wladek (2018) Refining the macromolecular model - achieving the best agreement with the data from X-ray diffraction experiment. Crystallogr Rev 24:236-262
Zheng, Heping; Porebski, Przemyslaw J; Grabowski, Marek et al. (2017) Databases, Repositories, and Other Data Resources in Structural Biology. Methods Mol Biol 1607:643-665
Zheng, Heping; Langner, Karol M; Shields, Gregory P et al. (2017) Data mining of iron(II) and iron(III) bond-valence parameters, and their relevance for macromolecular crystallography. Acta Crystallogr D Struct Biol 73:316-325
Minor, Wladek; Dauter, Zbigniew; Jaskolski, Mariusz (2016) The young person's guide to the PDB. Postepy Biochem 62:242-249
Kikuchi, Sotaro; Borek, Dominika M; Otwinowski, Zbyszek et al. (2016) Crystal structure of the cohesin loader Scc2 and insight into cohesinopathy. Proc Natl Acad Sci U S A 113:12444-12449

Showing the most recent 10 out of 18 publications