Very large datasets and new models to predict and design protein interactions

Keating, Amy

Abstract

Specific protein-protein interactions are responsible for organizing the cell, for processing biological signals and information, and for the chemistry of life. Thus, understanding biological mechanism relies on understanding the interactions that occur between proteins. An important long-term goal is to develop methods for reliably predicting and rationally modifying protein-protein interactions. Such capabilities would provide insight into the molecular details of pathology and highlight opportunities for disease treatment. This proposal describes an integrated experimental/computational technology platform that will provide predictive models of protein interaction specificity. The experimental component involves constructing randomized libraries of proteins or peptides that will be sorted according to their affinities for binding a particular receptor. The identities and binding affinities for very large numbers of library members will be decoded using high-throughput sequencing methods. The data, consisting of up to 107 {sequence, affinity} pairs per sequencing run, will be used as input to computational machine learning methods. Models will be generated that capture the relationship between sequence and interactions, and the predictive power of these models will be tested experimentally. The work described in this proposal emphasizes technology development and application of the new platform to study two general types of protein complexes. First are interactions of short helical ligands with mid-sized globular proteins, here studied using anti-apoptotic Bcl-2 and Ca2+- binding EF-hand proteins. Second are interactions of short linear peptides with modular interaction domains, here PDZ and SH3 domains. These four protein families mediate an enormous number of important molecular recognition events in human cells, and the resulting models will provide valuable support to study of their biological functions. This work will also provide a stringent test of the capabilities of the proposed technology, which can then be applied to a much wider variety of molecular complexes, e.g., protein-protein, protein-small molecule and protein-nucleic acid assemblies. Given the paucity of high- throughput methods for accurately measuring protein-protein interactions, and the primitive capabilities of most computational models for predicting protein binding, the proposed technology platform has the potential to dramatically transform the study of protein interaction specificity.

Public Health Relevance

Specific protein-protein interactions underlie all biological processes. Knowledge of interactions that occur in healthy vs. diseased tissues, coupled with methods for inhibiting such interactions, would dramatically expand opportunities to treat human disease. This proposal describes a new technology for advancing the measurement, prediction and design of protein complexes.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 5R01GM096466-04
Application #: 8538461
Study Section: Special Emphasis Panel (ZRG1-BCMB-A (51))
Program Officer: Wehrle, Janna P

Project Start: 2010-09-30
Project End: 2015-08-31
Budget Start: 2013-09-01
Budget End: 2014-08-31
Support Year: 4
Fiscal Year: 2013
Total Cost: $387,824
Indirect Cost: $156,287

Institution

Name: Massachusetts Institute of Technology
Department: Biology
Type: Schools of Arts and Sciences
DUNS #: 001425594

City: Cambridge
State: MA
Country: United States
Zip Code: 02139

Related projects


NIH 2014 R01 GM	Very large datasets and new models to predict and design protein interactions Keating, Amy E. / Massachusetts Institute of Technology
NIH 2013 R01 GM	Very large datasets and new models to predict and design protein interactions Keating, Amy E. / Massachusetts Institute of Technology	$387,824
NIH 2012 R01 GM	Very large datasets and new models to predict and design protein interactions Keating, Amy E. / Massachusetts Institute of Technology	$395,872
NIH 2012 R01 GM	Very large datasets and new models to predict and design protein interactions Keating, Amy E. / Massachusetts Institute of Technology	$88,280
NIH 2011 R01 GM	Very large datasets and new models to predict and design protein interactions Keating, Amy E. / Massachusetts Institute of Technology	$375,899
NIH 2010 R01 GM	Very large datasets and new models to predict and design protein interactions Keating, Amy E. / Massachusetts Institute of Technology	$411,867

Publications

Jenson, Justin M; Xue, Vincent; Stretz, Lindsey et al. (2018) Peptide design by optimization on a data-parameterized protein interaction landscape. Proc Natl Acad Sci U S A 115:E10342-E10351

Rodríguez-Martínez, José A; Reinke, Aaron W; Bhimsaria, Devesh et al. (2017) Combinatorial bZIP dimers display complex DNA-binding specificity landscapes. Elife 6:

Reich, Lothar Luther; Dutta, Sanjib; Keating, Amy E (2016) Generating High-Accuracy Peptide-Binding Data in High Throughput with Yeast Surface Display and SORTCERY. Methods Mol Biol 1414:233-47

Foight, Glenna Wink; Keating, Amy E (2016) Comparison of the peptide binding preferences of three closely related TRAF paralogs: TRAF2, TRAF3, and TRAF5. Protein Sci 25:1273-89

Reich, Lothar Luther; Dutta, Sanjib; Keating, Amy E (2015) SORTCERY-A High-Throughput Method to Affinity Rank Peptide Ligands. J Mol Biol 427:2135-50

Comments

Be the first to comment on Amy Keating's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: