The generation of protein-protein interaction network (interactome) models with increasing quality and sensitivity is a necessary, although not sufficient, aspect in the quest of generating predictive """"""""systems"""""""" models. The yeast S. cerevisiae is an excellent model to validate this concept. Two major approaches, yielding complementary high-quality (HQ) data are used to experimentally map interactomes: i) binary physical interaction mapping;and ii) and protein complex analysis. Both approaches are required to obtain a complete view of the interactome of an organism. We recently developed a conceptual framework to assess interactome models based on quantitative benchmarking of screening and validation assays against reference sets. The framework then combines specific measurements of completeness, assay sensitivity, sampling sensitivity and precision to estimate the overall sensitivity and quality of a network model. With this we have shown that early yeast interactome data cover ~10%, our second generation map ~20%, of the yeast binary interactome with HQ interactions. For increasingly standardized and improved quality control, we developed an experimental method to assign confidence scores to individual interactions. This method uses a """"""""tool-kit"""""""" of interaction assays each benchmarked against common reference sets. Experimental validation of every interaction in the tool-kit assays enables integration of benchmark and validation data to calculate individual confidence scores. Here we propose to continue the mapping efforts for the binary interactome of S. cerevisiae. Utilizing novel technologies we aim to extend the overall sensitivity to 50%, which is both technically feasible and can be expected to enable a profoundly improved understanding of the interactome network and how it mediates genotype-to-phenotype relationships. For quality control all interactions will be validated in multiple standardized binary interaction assays and a confidence score for each individual interaction will be calculated. We will thus generate and analyze a third generation binary interactome map of S. cerevisiae.
Our specific aims are: i) to expand the binary interactome map of S. cerevisiae from ~20 to ~50% sensitivity, ii) to validate experimentally all binary interactions found in Specific Aim 1 with a novel confidence scoring strategy based on a highly-controlled and benchmarked assay tool-kit. iii) To expand the global analysis of the S. cerevisiae binary interactome network using the new high- quality map obtained in Specific Aim 2.

Public Health Relevance

The human genome project has provided us with a catalogue of all human genes. Each of the ~20.000 human genes is the blueprint for a protein. The proteins in turns assemble into molecular machines, which execute most functions of a living organism. Inside of cells and organisms proteins are binding to each other. Such protein interactions'are underlying every biological process. Likewise, malfunction of protein interactions, e.g. due to genetic mutations, can lead to disruption of normal processes and diseases. We aim to map the interactions of each and every protein discovered by genome projects. The availability of such protein interaction maps will deepen our understanding of diseases and facilitate understanding of the biochemical consequences of mutations. Importantly, currently many genetic predispositions for several diseases can be identified for which no treatment is available - understanding the molecular consequences of such mutations will greatly facilitate the search for treatments.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Feingold, Elise A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Dana-Farber Cancer Institute
United States
Zip Code
Khurana, Vikram; Peng, Jian; Chung, Chee Yeun et al. (2017) Genome-Scale Networks Link Neurodegenerative Disease Genes to ?-Synuclein through Specific Molecular Pathways. Cell Syst 4:157-170.e14
Zhong, Quan; Pevzner, Samuel J; Hao, Tong et al. (2016) An inter-species protein-protein interaction network across vast evolutionary distance. Mol Syst Biol 12:865
Munier, Sandie; Rolland, Thomas; Diot, C├ędric et al. (2013) Exploration of binary virus-host interactions using an infectious protein complementation assay. Mol Cell Proteomics 12:2845-55
Carvunis, Anne-Ruxandra; Rolland, Thomas; Wapinski, Ilan et al. (2012) Proto-genes and de novo gene birth. Nature 487:370-4