My colleagues and I are developing methods for matching peptide mass spectra to entries in a protein sequence database. Our goals are to derive easily-interpreted tests of statistical significance, investigate the utility of restricting the search database by species of origin, and provide open source code that can be further tested and improved by other investigators. Work to date had focused on development of an objective measure of spectral noisiness and its relationship to success in protein identification using exisiting commercial matching algorithms. We have found, surprisingly, that spectral noise is a rather minor factor, with limitations in the matching algorithms being more often responsible for identification failure. We have also developed a preliminary version of OMSSA, an Open Mass Spectrometry Serch Algorithm. Tests have shown that the search hueristics used in OMSSA are as fast as those in commercial algorithms, and furthermore that OMSSA's significance test statistics offer somewhat improved signal-to-noise properties.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM101703-01
Application #
6828684
Study Section
(CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
1
Fiscal Year
2003
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Xu, Ming; Geer, Lewis Y; Bryant, Stephen H et al. (2005) Assessing data quality of peptide mass spectra obtained by quadrupole ion trap mass spectrometry. J Proteome Res 4:300-5
Geer, Lewis Y; Markey, Sanford P; Kowalak, Jeffrey A et al. (2004) Open mass spectrometry search algorithm. J Proteome Res 3:958-64