. Computational structure-based protein design is a transformative field with exciting prospects for advancing both basic science and translational medical research. My laboratory has developed new protein design algorithms and used them to predict MRSA resistance to new antibiotics; design a broadly neutralizing antibody VRC07-523LS against HIV with unprecedented breadth and potency that is now in clinical trials; design protein-peptide interactions to treat cystic fibrosis; perform antigenicity-guided structural design of HIV gp140 envelope protein (Env) trimer constructs to delineate mechanism and fix conformation; and design a new antigenic membrane-bound membrane proximal external region (MPER) trimer for examining immunogenic responses to the HIV viral coat protein gp41. Central to protein design methodology is the need to optimize the amino acid sequence, placement of side chains, and backbone conformations in protein structures. By developing advanced search and scoring algorithms for combinatorial optimization of protein and ligand structure and sequence, we showed that desired structure, affinity, and activity can be designed by (a) modeling improved molecular flexibility and (b) exploiting ensembles of structures for accurate predictions. Our suite of algorithms has mathematical guarantees on the solution quality (up to the accuracy of the input model, which includes the initial structures, molecular flexibility to be modeled, and an empirical molecular mechanics energy function). Specifically, our algorithms guarantee to compute the global minimum energy conformation (GMEC), a gap-free list of sequences and structures in order of predicted energy, and a provably-good approximation to the binding affinity by bounding partition functions over molecular ensembles. We propose to build on our foundation of protein design algorithms, called OSPREY, and apply them in areas of biochemical and pharmacological importance. We will (1) predict future resistance mutations in protein targets of novel drugs; (2) design inhibitors of protein:protein interactions to target today?s ?undruggable? proteins; and (3) use OSPREY to redesign and improve broadly neutralizing HIV antibodies. Improvements to our protein design algorithms will be implemented to improve accuracy and scope, and we will advance the state of the art in protein design by making algorithmic and modeling improvements to accomplish the Aims (1- 3) above, including: the modeling of more protein/ligand flexibility and improved energy functions during large- scale design; new combinatorial optimization and energy-fitting methods to accelerate the design search; and design of affinity and specificity using novel multi-state design algorithms that model thermodynamic molecular ensembles. We will test our design predictions prospectively, by making novel predicted mutant proteins and performing biochemical, biological, and structural studies. We will also validate our algorithms retrospectively, using existing structures and data. All software will be released open-source.

Public Health Relevance

Statement. We propose computational structure-based protein design algorithms that could revolutionize therapeutic treatment. Our algorithms will enable the design of proteins and other molecules to act on today's undruggable proteins and tomorrow's drug-resistant diseases. In the next grant period, we will develop novel protein design algorithms and software, and use them to (1) predict future resistance mutations to new drugs in pathogens responsible for deadly nosocomial and community-acquired infections: methicillin-resistant Staphylococcus aureus (MRSA), Escherichia coli (E. coli), vancomycin-resistant Enterococcus (VRE), and Candida glabrata; (2a) design inhibitors of protein:protein interactions (PPIs) that address the underlying genetic defect in cystic fibrosis patients and alleviate their symptoms; (2b) design a KRas PPI inhibitor to ameliorate mutations that potentiate pancreatic ductal adenocarcinoma (PDAC); and (3) design and improve broadly neutralizing antibodies against Human immunodeficiency virus (HIV).

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM078031-10
Application #
9691368
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Lyster, Peter
Project Start
2008-04-15
Project End
2022-04-30
Budget Start
2019-05-01
Budget End
2020-04-30
Support Year
10
Fiscal Year
2019
Total Cost
Indirect Cost
Name
Duke University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
044387793
City
Durham
State
NC
Country
United States
Zip Code
27705
Qi, Yang; Martin, Jeffrey W; Barb, Adam W et al. (2018) Continuous Interdomain Orientation Distributions Reveal Components of Binding Thermodynamics. J Mol Biol 430:3412-3426
Ojewole, Adegoke A; Jou, Jonathan D; Fowler, Vance G et al. (2018) BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces. J Comput Biol 25:726-739
Ojewole, Adegoke; Lowegard, Anna; Gainza, Pablo et al. (2017) OSPREY Predicts Resistance Mutations Using Positive and Negative Computational Protein Design. Methods Mol Biol 1529:291-306
Jain, Swati; Jou, Jonathan D; Georgiev, Ivelin S et al. (2017) A critical analysis of computational protein design with sparse residue interaction graphs. PLoS Comput Biol 13:e1005346
Hallen, Mark A; Jou, Jonathan D; Donald, Bruce R (2017) LUTE (Local Unpruned Tuple Expansion): Accurate Continuously Flexible Protein Design with General Energy Functions and Rigid Rotamer-Like Efficiency. J Comput Biol 24:536-546
Hallen, Mark A; Donald, Bruce R (2017) CATS (Coordinates of Atoms by Taylor Series): protein design with backbone flexibility in all locally feasible directions. Bioinformatics 33:i5-i12
Zhou, Yichao; Donald, Bruce R; Zeng, Jianyang (2017) Parallel Computational Protein Design. Methods Mol Biol 1529:265-277
Hallen, Mark A; Donald, Bruce R (2016) comets (Constrained Optimization of Multistate Energies by Tree Search): A Provable and Efficient Protein Design Algorithm to Optimize Binding Affinity and Specificity with Respect to Sequence. J Comput Biol 23:311-21
Pan, Yuchao; Dong, Yuxi; Zhou, Jingtian et al. (2016) cOSPREY: A Cloud-Based Distributed Algorithm for Large-Scale Computational Protein Design. J Comput Biol 23:737-49
Traoré, Seydou; Roberts, Kyle E; Allouche, David et al. (2016) Fast search algorithms for computational protein design. J Comput Chem 37:1048-58

Showing the most recent 10 out of 35 publications