Mass spectrometry is based on fragmenting biological molecules into smaller pieces, and using the fragment masses as a fingerprint for identifying and quantifying bio-molecules. It is the dominant technology for studying active molecules in healthy and diseased tissue, and identifying protein targets and natural products for novel therapeutics. When the initial proposal Center for Computational Mass Spectrometry (CCMS) was submitted in 2007, the lack of adequate computational tools for analyzing mass spectrometry data was the the key bottleneck. With great success in enabling applications of new experimental techniques such as FTMS, ETD, HCD, top-down mass spectrometry, and many others, the mandate of CCMS continues to be the development of next generation computational technologies and to apply them to open experimental. In this proposal, we will capitalize on our recent results in diverse subfields of computational proteomics and will further branch into previously unexplored MS applications. We will focus specifically on bridging proteomics and genomics technologies using 6 technology research and development platforms. Specifically, we will (a) apply proteogenomics approach for the discovery of abberant cancer genes and analyzing antibody repertoires;(b) sequence natural antibiotics;(c) collate spectral data through spectral archives and networks;(d) develop universal tools for peptide identification;(e) develop tools for top-down proteomics;and, (f) analyzing multiplexed spectra. The technology platforms are driven by a multitude of col- laborative biomedical studies where the use of CCMS developed tools is essential for their success. These studies include (a) unraveling the combinatorial histone code in human diseases;(b) a proteogenomics approach to studies of oral microbiome and polybacterial infections;(c) detecting inter-species chemical in- teractions;(d) developing a systems approach towards the therapeutic modulation of the acetylome;(e) developing tools for monoclonal and polyclonal antibody sequencing;(f) development of breast cancer vac- cines;(g) clinical cancer proteogenomics;(h) discovery of lantibiotics;(i) discovering proteomic biomarkers for drug toxicity in cancer patients;and, (j) identifying protein-protein interactions and post-translational mod- ifications in cataractous lens. These projects require three-way collaborative efforts on a wide range of topics involving biomedical scientists, mass spectrometrists, and computational scientists from various institutions. CCMS will also train students and practicing scientists from all over the world in computational proteomics, and educate the proteomics community about modern computational mass spectrometry to encourage its wide adoption.

Public Health Relevance

The proposal involves the development of computational technologies for analyzing mass spectrometry data that will directly impact biomedical projects, including therapuetic antibody sequencing using a combina- tion of DNA sequencing and proteomics, analyzing antibody repertoires to analyze complex immunological responses, decoding the histone code and histone isoforms that are relevant to cancer, finding aberrant proteins that represent hallmarks of cancer abnormalities, and many others.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Biotechnology Resource Grants (P41)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-N (40))
Program Officer
Sheeley, Douglas
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Biostatistics & Other Math Sci
Schools of Arts and Sciences
La Jolla
United States
Zip Code
Mohimani, Hosein; Kersten, Roland D; Liu, Wei-Ting et al. (2014) Automated genome mining of ribosomal peptide natural products. ACS Chem Biol 9:1545-51
Wang, Jian; Anania, Veronica G; Knott, Jeff et al. (2014) A turn-key approach for large-scale identification of complex posttranslational modifications. J Proteome Res 13:1190-9
Meyer, Jesse G; Kim, Sangtae; Maltby, David A et al. (2014) Expanding proteome coverage with orthogonal-specificity ?-lytic proteases. Mol Cell Proteomics 13:823-35
Wang, Mingxun; Bandeira, Nuno (2013) Spectral library generating function for assessing spectrum-spectrum match significance. J Proteome Res 12:3944-51