The majority of our knowledge of biological structure comes from crystals. With the structure of a macromolecule one can visualize how it works, and how it interacts with other macromolecules - one can see life on an atomic scale. Among methods employed to reveal the details of molecular structure, none rivals single crystal X-ray diffraction for its generality of application (85% of the contents of the protein data bank), clarity of view, and lack of ambiguity in the interpretation. Crystallization is a significant 'bottleneck'in structural biology. The Hauptman-Woodward Medical Research Institute (HWI) operates a mature High- Throughput crystallization-Screening (HTS) laboratory for the general biomedical community and Structural Genomics groups. Macromolecular samples are screened against 1,536 different chemical cocktails that encompass both an incomplete factorial sampling of chemical space and examples of commercially available screens. Images of all the crystallization experiments are recorded for six weeks at weekly intervals. These images are archived. To date we have built up a library of over 90 million time-resolved images of almost 16 million crystallization experiments comprising over 10,000 biological macromolecules, each combined with the different chemical cocktails. We hypothesize that by analyzing the outcomes in terms of chemical space and dynamics, a temporal phase diagram of solubility can be constructed. From this phase diagram, optimized crystallization conditions and factors that drive that optimization can be predicted. In a multidisciplinary approach, we will use this data to develop an expert crystallization knowledge system.
Our aims are to accomplish this by (1) making the data archive readily and rapidly accessible;(2) to continuously acquire and update data as it becomes available;(3) to use the data to establish trends and guide crystallization;and (4) to develop new crystallization knowledge from the data. The cocktails used for crystallization screening chemically decrease the solubility of the macromolecules, driving the system to a state of supersaturation that can lead to crystallization. We will focus on structural genomics samples (~40% of our data) where complete information about the sample is available. We will incorporate an X-ray feedback mechanism to supplement the visual data for characterization of both crystal and precipitate. Initial studies show that analyzing the outcomes in terms of chemical space and dynamics does produce an empirical phase diagram of solubility over time. From these preliminary studies with a limited amount of this data, we have defined trajectories to traverse this space effectively, rationally guiding successful crystallization. Using screening data and historical trends, we will generate specific chemical advice, based upon statistical and probabilistic analysis of the whole dataset, describing how to crystallize and optimize individual samples. We will also identify trends in crystallization behavior as a function of the biochemistry. This approach will greatly improve the transfer of information from the crystallization- screening laboratory to immediately benefit the almost 900 different laboratories that are currently making use of the service. By incorporating commercially available screens, we can relate in-house data to screening results from other laboratories, expanding our analysis to develop crystallization and optimization strategies for samples beyond those we set up in the High-throughput crystallization-screening laboratory. Our data analysis will improve the success rate of crystallization in general, and enable structural studies of a wider range of biologically and medically relevant macromolecules.

Public Health Relevance

The majority of our knowledge of biological structure comes from crystals (85% of the structures deposited in the protein databank). Crystal growth has repeatedly been identified as the rate-limiting step in macromolecular structure determination. By the analysis and use of a unique data set of over 16 million crystallization experiments and 90 million images, we can improve the crystallization process by providing specific advice on optimization, and establish general predictive information for the biomedical structural biology community in general.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Edmonds, Charles G
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Hauptman-Woodward Medical Research Institute
United States
Zip Code
Altan, Irem; Charbonneau, Patrick; Snell, Edward H (2016) Computational crystallization. Arch Biochem Biophys 602:12-20
Bruno, Andrew E; Soares, Alexei S; Owen, Robin L et al. (2016) The use of haptic interfaces and web services in crystallography: an application for a 'screen to beam' interface. J Appl Crystallogr 49:2082-2090
Rossi, Paolo; Shi, Lei; Liu, Gaohua et al. (2015) A hybrid NMR/SAXS-based approach for discriminating oligomeric protein interfaces using Rosetta. Proteins 83:309-17
Grant, Thomas D; Luft, Joseph R; Carter, Lester G et al. (2015) The accurate assessment of small-angle X-ray scattering data. Acta Crystallogr D Biol Crystallogr 71:45-56
Luft, Joseph R; Newman, Janet; Snell, Edward H (2014) Crystallization screening: the influence of history on current practice. Acta Crystallogr F Struct Biol Commun 70:835-53
Fusco, Diana; Barnum, Timothy J; Bruno, Andrew E et al. (2014) Statistical analysis of crystallization database links protein physico-chemical features with crystallization mechanisms. PLoS One 9:e101123
Calero, Guillermo; Cohen, Aina E; Luft, Joseph R et al. (2014) Identifying, studying and making good use of macromolecular crystals. Acta Crystallogr F Struct Biol Commun 70:993-1008
Bruno, Andrew E; Ruby, Amanda M; Luft, Joseph R et al. (2014) Comparing chemistry to outcome: the development of a chemical distance metric, coupled with clustering and hierarchal visualization applied to macromolecular crystallography. PLoS One 9:e100782
Stiegler, Amy L; Grant, Thomas D; Luft, Joseph R et al. (2013) Purification and SAXS analysis of the integrin linked kinase, PINCH, parvin (IPP) heterotrimeric complex. PLoS One 8:e55591
Grant, Thomas D; Luft, Joseph R; Wolfley, Jennifer R et al. (2013) The structure of yeast glutaminyl-tRNA synthetase and modeling of its interaction with tRNA. J Mol Biol 425:2480-93

Showing the most recent 10 out of 17 publications