Software for Mass Spectral Data Conversion / Automatic Proteomic Analysis

Costello, Catherine

Abstract

This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. Primary support for the subproject and the subproject's principal investigator may have been provided by other sources, including other NIH sources. The Total Cost listed for the subproject likely represents the estimated amount of Center infrastructure utilized by the subproject, not direct funding provided by the NCRR grant to the subproject or subproject staff. With a multitude of different MS instrumentation and data analysis software platforms available for MS and proteomics, it becomes difficult to manipulate and manage various data sets. We are creating new software to process protein and peptide MS and MS/MS data, locate and assign post-translational modifications (PTMs) and to help place the results into biological context. The BUPID program was developed in C under Linux and made accessible to the main program through a CGI based web interface. The shell data conversion program was written to implement a user friendly GUI interface which may be operated in an unattended batch processing mode. Testing of the program was performed on existing MALDI-TOF MS, MALDI-FT MS and LC MS/MS data sets obtained in house. The program allowed the conversion of large volumes of data obtained on different instruments to the formats of several commercially and publicly available search engines. Files were then submitted for protein identification to the search engines with the search settings specified by the user. Our implementation of the mzXML format introduced by the Institute for Systems Biology afforded the benefits of a common data format for summation of results obtained on different MS platforms, comparative analysis of MS methodology and archiving of data. We added capabilities for interpretation of top-down tandem mass spectra of proteins (BUPID-top down) and for linking of database assignments to functionality of proteins (STRAP). These results were presented as posters at ASMS and other scientific meetings;the STRAP manuscript was published in early 2010. A further dedition (STRAP-PTM) is now being developed to assign and map PTMs. The search algorithm Boston University Protein Identifier (BUPID) provides a robust and accurate statistical model for protein identification using MS data. The algorithm offers a number of important features: 1. Using log-likelihood ratio as scoring function, the algorithm can best distinguish correctly assigned peptides from incorrect assignments. 2. Matching peaks with a background-dependent threshold offers more flexibility and accuracy than the traditional mass window. 3. The statistical model provides similar or better results with comparison to conventional database search engines. We use log-likelihood ratio to calculate the probability that a protein is present in the sample. The model distinguishes two hypotheses (1) H0: That a set of peaks in the spectrum is generated by the random background;and (2) HA: That the same set of peaks is generated by peptides corresponding to a specific protein. A peak is included in the set if the probability that it is produced by the protein is more significant than that it is otherwise produced by the random background. Final results are ranked by the E-value of their probability score using the sequence information of the protein. We have compared the performance of the BUPID server and several other public web-based database search engines. Recent efforts have focused on interpretation of top-down tandem MS data from the LTQ-Orbitrap MS and ICR-FTMS instruments. A manuscript that includes the use of BUPID Top-down was recently published in the Int. J. Mass Spectrom. A paper describing STRAP was also published and a manuscript on STRAP-PTM is now under revision.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Center for Research Resources (NCRR)
Type: Biotechnology Resource Grants (P41)
Project #: 5P41RR010888-15
Application #: 8365501
Study Section: Special Emphasis Panel (ZRG1-BCMB-H (40))

Project Start: 2011-06-01
Project End: 2012-08-09
Budget Start: 2011-06-01
Budget End: 2012-08-31
Support Year: 15
Fiscal Year: 2011
Total Cost: $4,616
Indirect Cost

Institution

Name: Boston University
Department: Biochemistry
Type: Schools of Medicine
DUNS #: 604483045

City: Boston
State: MA
Country: United States
Zip Code: 02118

Related projects

Publications

Lu, Yanyan; Jiang, Yan; Prokaeva, Tatiana et al. (2017) Oxidative Post-Translational Modifications of an Amyloidogenic Immunoglobulin Light Chain Protein. Int J Mass Spectrom 416:71-79

Sethi, Manveen K; Zaia, Joseph (2017) Extracellular matrix proteomics in schizophrenia and Alzheimer's disease. Anal Bioanal Chem 409:379-394

Hu, Han; Khatri, Kshitij; Zaia, Joseph (2017) Algorithms and design strategies towards automated glycoproteomics analysis. Mass Spectrom Rev 36:475-498

Ji, Yuhuan; Bachschmid, Markus M; Costello, Catherine E et al. (2016) S- to N-Palmitoyl Transfer During Proteomic Sample Preparation. J Am Soc Mass Spectrom 27:677-85

Hu, Han; Khatri, Kshitij; Klein, Joshua et al. (2016) A review of methods for interpretation of glycopeptide tandem mass spectral data. Glycoconj J 33:285-96

Pu, Yi; Ridgeway, Mark E; Glaskin, Rebecca S et al. (2016) Separation and Identification of Isomeric Glycans by Selected Accumulation-Trapped Ion Mobility Spectrometry-Electron Activated Dissociation Tandem Mass Spectrometry. Anal Chem 88:3440-3

Wang, Yun Hwa Walter; Meyer, Rosana D; Bondzie, Philip A et al. (2016) IGPR-1 Is Required for Endothelial Cell-Cell Adhesion and Barrier Function. J Mol Biol 428:5019-5033

Srinivasan, Srimathi; Chitalia, Vipul; Meyer, Rosana D et al. (2015) Hypoxia-induced expression of phosducin-like 3 regulates expression of VEGFR-2 and promotes angiogenesis. Angiogenesis 18:449-62

Yu, Xiang; Sargaeva, Nadezda P; Thompson, Christopher J et al. (2015) In-Source Decay Characterization of Isoaspartate and ?-Peptides. Int J Mass Spectrom 390:101-109

Steinhorn, Benjamin S; Loscalzo, Joseph; Michel, Thomas (2015) Nitroglycerin and Nitric Oxide--A Rondo of Themes in Cardiovascular Therapeutics. N Engl J Med 373:277-80

Showing the most recent 10 out of 253 publications

Comments

Be the first to comment on Catherine Costello's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: