Molecular interactions play a central role in all biological processes. Akin to the complete sequencing of genomes, complete descriptions of interactomes is a fundamental step towards a deeper understanding of biological processes, and has a vast potential to impact systems biology, genomics, molecular biology and therapeutics. Protein-protein interactions (PPIs) and protein-RNA interactions (PRIs) are of particular interest as they are critical in maintenance of cellular integrity, metabolism, transcription/translation, and cell-cell communication. Although high-throughput experimental PPI and PRI data is rapidly accumulating, building complete and confident datasets requires multiple replicates of expensive screens. This proposal aims to develop new methods that will significantly advance our efforts at structure-based approaches to better predict PPIs and RPIs and boost confidence in emerging high-throughput (HTP) data with the goal of comprehensive interactome mapping at lower cost. Taken together, these methods will vastly expand our understanding of macromolecular networks. We will continue to devise structure-based methods for protein-protein interaction prediction and branch out to methods for protein-RNA interaction prediction; this represents a major shift from the purely sequence-based approaches that most bioinformatics approaches utilize to predict We will also build computational frameworks for boosting confidence in HTP protein-protein and protein-RNA interaction datasets using structure-based approaches; these frameworks will provide a comprehensive assessment of in-house and public HTP data, with potential biomedical applications such as heat shock protein-kinase interactions related to development for cancer therapeutics, MAPK6's role in a cancer-related signaling network, and (long non-coding) RNA-protein binding roles in neurodegenerative disease. Finally, we will computationally screen for PPIs and PRIs at the genome scale and expand our Struct2Net webserver to disseminate tools based on our methods and results to the community. An increasing number of HTP interaction datasets are being determined, thus presenting new opportunities to leverage this data in conjunction with structural insights to map binding sites and to uncover the underlying molecular mechanisms of cellular functions. molecular interactions and will enhance coverage and accuracy of the complete interactome. Successful completion of these aims will result in computational methods that will significantly increase our confidence in high-throughput data on protein-protein and protein-RNA interactions and will reveal fundamental aspects of their functioning, as well as testable hypotheses for experimental investigations. All developed software will be made publicly available.

Public Health Relevance

Biological processes are carried out through thousands of interactions between various types of molecules (the Interactome) that play fundamental roles in all biomedical processes including the maintenance of cellular integrity, metabolism, transcription/translation, and cell-cell communication. Understanding these interaction networks on a large scale will empower both rational, targeted drug design and more intelligent disease management. In this project, we develop computational methods for structure-based prediction of protein-protein and protein- RNA interactions, and integrate these predictions with available high-throughput genomic data to predict the Interactomes of entire species' genomes.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM081871-08
Application #
8848384
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Krepkiy, Dmitriy
Project Start
2008-04-01
Project End
2017-05-31
Budget Start
2015-06-01
Budget End
2017-05-31
Support Year
8
Fiscal Year
2015
Total Cost
Indirect Cost
Name
Massachusetts Institute of Technology
Department
Type
Organized Research Units
DUNS #
001425594
City
Cambridge
State
MA
Country
United States
Zip Code
Cho, Hyunghoon; Berger, Bonnie; Peng, Jian (2018) Generalizable visualization of mega-scale single-cell data. Res Comput Mol Biol 10812:251-253
Liu, Yang; Palmedo, Perry; Ye, Qing et al. (2018) Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks. Cell Syst 6:65-74.e3
Ordovas-Montanes, Jose; Dwyer, Daniel F; Nyquist, Sarah K et al. (2018) Allergic inflammatory memory in human respiratory epithelial progenitor cells. Nature 560:649-654
Bepler, Tristan; Morin, Andrew; Noble, Alex J et al. (2018) Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs. Res Comput Mol Biol 10812:245-247
Hie, Brian; Cho, Hyunghoon; Berger, Bonnie (2018) Realizing private and practical pharmacological collaboration. Science 362:347-350
Cho, Hyunghoon; Berger, Bonnie; Peng, Jian (2018) Generalizable and Scalable Visualization of Single-Cell Data Using Neural Networks. Cell Syst 7:185-191.e4
Orenstein, Yaron; Ohler, Uwe; Berger, Bonnie (2018) Finding RNA structure in the unstructured RBPome. BMC Genomics 19:154
Khurana, Vikram; Peng, Jian; Chung, Chee Yeun et al. (2017) Genome-Scale Networks Link Neurodegenerative Disease Genes to ?-Synuclein through Specific Molecular Pathways. Cell Syst 4:157-170.e14
Orenstein, Yaron; Kim, Ryan; Fordyce, Polly et al. (2017) Joker de Bruijn: Sequence Libraries to Cover All k-mers Using Joker Characters. Res Comput Mol Biol 10229:389-390
Orenstein, Yaron; Puccinelli, Robert; Kim, Ryan et al. (2017) Optimized Sequence Library Design for Efficient In Vitro Interaction Mapping. Cell Syst 5:230-236.e5

Showing the most recent 10 out of 50 publications