Protein-protein interactions are integral to virtually all biological pathways. Many important interactions occur in weak, transient complexes that will not be amenable to direct experimental analysis, even when both proteins can be isolated and their structures determined. Thus, it is important to develop computational docking methods which, starting from the structures of component proteins, can determine the structure of their complexes. We have developed a multistage docking algorithm that provided the best results in the latest rounds of the CAPRI (Critical Assessment of Predicted Interactions) worldwide docking experiment. In addition, our docking server ClusPro was the best among automated servers. Although the CAPRI results demonstrate progress, a number of major problems remain unsolved. First, docking homology models is a challenge and all methods used in CAPRI performed poorly for such targets. Docking unbound structures is also difficult if binding is accompanied by substantial backbone conformational change. Second, it is not clear whether a model generated by docking represents a specific and stable complex. Third, the interface may include regions that are disordered in the separate proteins, challenging docking methods. We address these problems by pursuing three specific aims. First, we develop a novel algorithm for docking homology models and proteins with substantial backbone flexibility. The method is based on the hypothesis that the interface in complexes is sequentially and structurally more conserved than the rest of the proteins. Since such regions are frequently sufficient for recognition, identification and correct docking of the key segments can yield near-native docked structures. For homology models this implies that one can dock the regions that can be reliably modeled, and then expand the models by adding back the removed parts using the docked structures as constraints. The problem of docking "difficult cases" with substantial backbone conformational change can also be addressed by identifying and docking the structurally most conserved regions. Once clusters of the docked rigid fragments are obtained, the models are expanded by rebuilding the more flexible parts. Second, we use a two-step approach to examine the stability of protein complexes, first by removing small and hence unlikely clusters of low energy docked structures, and then by calculating dissociation rates by stochastic roadmap simulation. The method will be validated on a benchmark set that includes models of real protein complexes and decoys generated by docking non-interacting protein pairs. The approach will also be used to determine whether complex structures deposited to the PDB are biologically relevant. Third, we consider the problem of determining the structure of flexible loops and/or disordered regions when they become parts of a protein- protein interface. Rather than attempting to predict and to dock the most likely conformation of the flexible fragment, we build their bound structure directly into binding hot spots of the partner protein. Flexible peptide docking methods will be used to expand the docked fragments by adding further residues.

Public Health Relevance

Many biologically important interactions occur in weak, transient complexes that are not amenable to direct experimental analysis, and hence it is important to develop computational docking methods that, starting from the structures of unbound proteins, can determine the structure of their complexes. The goal of this proposal is to solve some of the most important outstanding problems of protein docking, i.e., docking homology models and flexible proteins, and predicting the stability of complexes.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Wehrle, Janna P
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Boston University
Engineering (All Types)
Schools of Engineering
United States
Zip Code
Bogorad, Andrew M; Xia, Bing; Sandor, Dana G et al. (2014) Insights into the architecture of the eIF2B?/?/? regulatory subcomplex. Biochemistry 53:3432-45
Bohnuud, Tanggis; Kozakov, Dima; Vajda, Sandor (2014) Evidence of conformational selection driving the formation of ligand binding sites in protein-protein interfaces. PLoS Comput Biol 10:e1003872
Mottarella, Scott E; Beglov, Dmitri; Beglova, Natalia et al. (2014) Docking server for the identification of heparin binding sites on proteins. J Chem Inf Model 54:2068-78
Kozakov, Dima; Li, Keyong; Hall, David R et al. (2014) Encounter complexes and dimensionality reduction in protein-protein association. Elife 3:e01370
Lensink, Marc F; Moal, Iain H; Bates, Paul A et al. (2014) Blind prediction of interfacial water positions in CAPRI. Proteins 82:620-32
Vajda, Sandor; Hall, David R; Kozakov, Dima (2013) Sampling and scoring: a marriage made in heaven. Proteins 81:1874-84
Golden, Mary S; Cote, Shaun M; Sayeg, Marianna et al. (2013) Comprehensive experimental and computational analysis of binding energy hot spots at the NF-ýýB essential modulator/IKKýý protein-protein interface. J Am Chem Soc 135:6242-56
Lavi, Assaf; Ngan, Chi Ho; Movshovitz-Attias, Dana et al. (2013) Detection of peptide-binding sites on protein surfaces: the first step toward the modeling and targeting of peptide-mediated interactions. Proteins 81:2096-105
Kozakov, Dima; Hall, David R; Chuang, Gwo-Yu et al. (2011) Structural conservation of druggable hot spots in protein-protein interfaces. Proc Natl Acad Sci U S A 108:13528-33
Kozakov, Dima; Hall, David R; Beglov, Dmitri et al. (2010) Achieving reliability and high accuracy in automated protein docking: ClusPro, PIPER, SDU, and stability analysis in CAPRI rounds 13-19. Proteins 78:3124-30

Showing the most recent 10 out of 38 publications