The analysis of protein complexes and interaction networks, and their dynamic behavior as a function of time and cell state, are of central importance in biological research. The recent technological advances have made affinity purification and mass spectrometry (AP/MS) a high-throughput and widely used technique. However, the development of computational tools for AP/MS data has lagged behind. While a number of approaches have being developed for topology-based analysis of interaction networks, these methods were optimized for very specific types of AP/MS data, and are not generally applicable in most experiments. Thus, this proposal addresses the critical mismatch that currently exists between the type of data being generated and the availability of appropriate computational tools for processing these data. To this end, we have recently demonstrated the great utility of label-free quantitative protein information such as spectral counts that can be extracted from AP/MS data. Building upon this work, we will develop a robust computational framework for significance analysis of individual protein-protein interactions in AP/MS studies via statistical modeling of quantitative profiles of bait and prey proteins across multiple purifications. The proposed method will allow combining and comparing protein interaction data across different laboratories and experimental platforms. Furthermore, this work will enable more accurate reconstruction of protein complexes from AP/MS data, as well as the analysis of changes in the networks as a function of the cell states or in response to an external perturbation. By integrating the interaction probabilities derived from AP/MS data with the higher level information such as functional genomics-based predictions, we will further improve the sensitivity of detecting protein interactions. As a result of this work, we will gain a better understanding of the sources of false positive protein interactions, which in turn will help in designing future experiments. In collaboration with biologists, we will apply our methods in several key areas of biological research linked through their significance for fundamental understanding of cell signaling. It will involve large-scale analysis of human protein kinases, phosphatases, and other signaling proteins and their interactions, including measuring dynamic changes in the interactome. We will also provide the proteomic community with a set of open source and freely available computational tools, as well as orthogonally validated reference datasets for benchmarking and further development of computational methods for AP/MS data.

Public Health Relevance

The proposed computational work will enable statistically robust and quantitative analysis of protein-protein interactions and protein complexes using affinity purification - mass spectrometry (AP/MS) approach. The bioinformatics methods will allow establishing a computational framework for quality assessment, analysis, modeling, and cross-laboratory comparison of AP/MS data. The tools and methods will be of great utility for both large collaborative interactome projects and small scale studies. All computational tools developed as a part of this proposal will be made freely available to the research community.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM094231-04
Application #
8535788
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Brazhnik, Paul
Project Start
2010-09-01
Project End
2014-08-31
Budget Start
2013-09-01
Budget End
2014-08-31
Support Year
4
Fiscal Year
2013
Total Cost
$377,586
Indirect Cost
$112,790
Name
University of Michigan Ann Arbor
Department
Pathology
Type
Schools of Medicine
DUNS #
073133571
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109
Meyer, Jesse G; Mukkamalla, Sushanth; Steen, Hanno et al. (2017) PIQED: automated identification and quantification of protein modifications from DIA-MS data. Nat Methods 14:646-647
Xu, Tao; Park, Sung-Soo; Giaimo, Benedetto Daniele et al. (2017) RBPJ/CBF1 interacts with L3MBTL3/MBT1 to promote repression of Notch signaling via histone demethylase KDM1A/LSD1. EMBO J 36:3232-3249
da Veiga Leprevost, Felipe; Grüning, Björn A; Alves Aflitos, Saulo et al. (2017) BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics 33:2580-2582
Kong, Andy T; Leprevost, Felipe V; Avtonomov, Dmitry M et al. (2017) MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat Methods 14:513-520
Perez-Riverol, Yasset; Bai, Mingze; da Veiga Leprevost, Felipe et al. (2017) Discovering and linking public omics data sets using the Omics Discovery Index. Nat Biotechnol 35:406-409
Rosenberger, George; Bludau, Isabell; Schmitt, Uwe et al. (2017) Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses. Nat Methods 14:921-927
Santos, Renata M; Nogueira, Fabio C S; Brasil, Aline A et al. (2017) Quantitative proteomic analysis of the Saccharomyces cerevisiae industrial strains CAT-1 and PE-2. J Proteomics 151:114-121
Rolland, Delphine C M; Basrur, Venkatesha; Jeon, Yoon-Kyung et al. (2017) Functional proteogenomics reveals biomarkers and therapeutic targets in lymphomas. Proc Natl Acad Sci U S A 114:6581-6586
Navarro, Pedro; Kuharev, Jörg; Gillet, Ludovic C et al. (2016) A multicenter study benchmarks software tools for label-free proteome quantification. Nat Biotechnol 34:1130-1136
Avtonomov, Dmitry M; Raskind, Alexander; Nesvizhskii, Alexey I (2016) BatMass: a Java Software Platform for LC-MS Data Visualization in Proteomics and Metabolomics. J Proteome Res 15:2500-9

Showing the most recent 10 out of 65 publications