This application seeks support for a center of excellence in computational mass spectrometry and a national and international resource in the broad area of proteomics. It proposes to enlarge the current research activities, to branch into previously unexplored areas of computational proteomics, and to support multiple collaborative efforts. The proposal addresses the computational bottleneck that affects the entire proteomics community and impairs interpretation of data in thousands of experimental labs around the world. The goal is to bring the modern algorithmic technologies to mass-spectrometry and to build a new generation of reliable open access software tools to support both new development in mass-spectrometry instrumentation and the emerging applications of mass-spectrometry. The proposal focuses on four directions: (i) enabling complex mass spectrometry searches, (ii) analyzing unknown proteomes without protein databases, (iii) analyzing altered proteomes, and (iv) constructing proteogenomic annotations and analyzing pathways. These directions cover both well-studied but still inadequately addressed problems (like search for mutations and post-translational modifications) and unexplored problems for which there are no computational tools currently available (like antibody sequencing or analyzing fusion proteins in cancer). These projects require two-way collaborative efforts on a wide range of topics involving biomedical and computational scientists from various institutions. While many collaborations have been already established at San Diego (UCSD and Burnham Institute), sixteen other US universities, hospitals and biotechnology companies, as well as foreign research institutions at Germany, Singapore, Spain, Sweden, and United Kingdom, we propose to further extend these collaborations by developing robust open access mass spectrometry software that will catalyze the exchanges between experimental and computational researchers in proteomics. The biomedical applications addressed in these collaborative projects include but are not limited to (i) discovery of cancer biomarkers, (ii) elucidation of changes in aged cataractous lens, (iii) understanding how bacteria adjust to antibiotics and other harsh conditions, (iv) addressing the need to constantly reformulate the influenza vaccine to make it efficient, and (v) sequencing of snake venoms that proved instrumental in design of blood clotting drugs. Educational activities in the area of computational proteomics will also be developed, including short courses, a seminar program, an annual conference, and concerted education of students and postdocs.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Biotechnology Resource Grants (P41)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-Q (40))
Program Officer
Sheeley, Douglas
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Biostatistics & Other Math Sci
Schools of Arts and Sciences
La Jolla
United States
Zip Code
Mohimani, Hosein; Kersten, Roland D; Liu, Wei-Ting et al. (2014) Automated genome mining of ribosomal peptide natural products. ACS Chem Biol 9:1545-51
Wang, Jian; Anania, Veronica G; Knott, Jeff et al. (2014) A turn-key approach for large-scale identification of complex posttranslational modifications. J Proteome Res 13:1190-9
Meyer, Jesse G; Kim, Sangtae; Maltby, David A et al. (2014) Expanding proteome coverage with orthogonal-specificity ?-lytic proteases. Mol Cell Proteomics 13:823-35
Wang, Mingxun; Bandeira, Nuno (2013) Spectral library generating function for assessing spectrum-spectrum match significance. J Proteome Res 12:3944-51