This application seeks support for a center of excellence in computational mass spectrometry and a national and international resource in the broad area of proteomics. It proposes to enlarge the current research activities, to branch into previously unexplored areas of computational proteomics, and to support multiple collaborative efforts. The proposal addresses the computational bottleneck that affects the entire proteomics community and impairs interpretation of data in thousands of experimental labs around the world. The goal is to bring the modern algorithmic technologies to mass-spectrometry and to build a new generation of reliable open access software tools to support both new development in mass-spectrometry instrumentation and the emerging applications of mass-spectrometry. The proposal focuses on four directions: (i) enabling complex mass spectrometry searches, (ii) analyzing unknown proteomes without protein databases, (iii) analyzing altered proteomes, and (iv) constructing proteogenomic annotations and analyzing pathways. These directions cover both well-studied but still inadequately addressed problems (like search for mutations and post-translational modifications) and unexplored problems for which there are no computational tools currently available (like antibody sequencing or analyzing fusion proteins in cancer). These projects require two-way collaborative efforts on a wide range of topics involving biomedical and computational scientists from various institutions. While many collaborations have been already established at San Diego (UCSD and Burnham Institute), sixteen other US universities, hospitals and biotechnology companies, as well as foreign research institutions at Germany, Singapore, Spain, Sweden, and United Kingdom, we propose to further extend these collaborations by developing robust open access mass spectrometry software that will catalyze the exchanges between experimental and computational researchers in proteomics. The biomedical applications addressed in these collaborative projects include but are not limited to (i) discovery of cancer biomarkers, (ii) elucidation of changes in aged cataractous lens, (iii) understanding how bacteria adjust to antibiotics and other harsh conditions, (iv) addressing the need to constantly reformulate the influenza vaccine to make it efficient, and (v) sequencing of snake venoms that proved instrumental in design of blood clotting drugs. Educational activities in the area of computational proteomics will also be developed, including short courses, a seminar program, an annual conference, and concerted education of students and postdocs.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Biotechnology Resource Grants (P41)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-Q (40))
Program Officer
Sheeley, Douglas
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Biostatistics & Other Math Sci
Schools of Arts and Sciences
La Jolla
United States
Zip Code
Na, Seungjin; Payne, Samuel H; Bandeira, Nuno (2016) Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks. Mol Cell Proteomics 15:3501-3512
Bouslimani, Amina; Melnik, Alexey V; Xu, Zhenjiang et al. (2016) Lifestyle chemistries from phones for individual profiling. Proc Natl Acad Sci U S A 113:E7645-E7654
Kolmogorov, Mikhail; Liu, Xiaowen; Pevzner, Pavel A (2016) SpectroGene: A Tool for Proteogenomic Annotations Using Top-Down Spectra. J Proteome Res 15:144-51
Liu, Guomin; Knight, James D R; Zhang, Jian Ping et al. (2016) Data Independent Acquisition analysis in ProHits 4.0. J Proteomics 149:64-68
Mohimani, Hosein; Pevzner, Pavel A (2016) Dereplication, sequencing and identification of peptidic natural products: from genome mining to peptidogenomics to spectral networks. Nat Prod Rep 33:73-86
Guthals, Adrian; Boucher, Christina; Bandeira, Nuno (2015) The generating function approach for Peptide identification in spectral networks. J Comput Biol 22:353-66
Wang, Jian; Tucholska, Monika; Knight, James D R et al. (2015) MSPLIT-DIA: sensitive peptide identification for data-independent acquisition. Nat Methods 12:1106-8
Safonova, Yana; Bonissone, Stefano; Kurpilyansky, Eugene et al. (2015) IgRepertoireConstructor: a novel algorithm for antibody repertoire construction and immunoproteogenomics analysis. Bioinformatics 31:i53-61
Bouslimani, Amina; Porto, Carla; Rath, Christopher M et al. (2015) Molecular cartography of the human skin surface in 3D. Proc Natl Acad Sci U S A 112:E2120-9
Woo, Sunghee; Cha, Seong Won; Bonissone, Stefano et al. (2015) Advanced Proteogenomic Analysis Reveals Multiple Peptide Mutations and Complex Immunoglobulin Peptides in Colon Cancer. J Proteome Res 14:3555-67

Showing the most recent 10 out of 43 publications