Methods for Assessing Confidence Level of Protein ID by Mass Spectrometry

Eriksson, Jan

Abstract

Mass spectrometry (MS) in combination with database searching is a popular and presumably accurate method for identification of proteins from species with known genomes. Proteins separated on a gel, digested with e.g. trypsin and extracted from the gel yields specific peptides, which can be subsequently analyzed by MS. The distribution of tryptic peptide masses, a so-called tryptic peptide map, is a protein fingerprint and can be compared with the sequence information stored in a database. Various scoring methods have been developed in order to find the protein candidate with the highest degree of similarity to the experimentally obtained peptide map. Due to imperfections in the separation and extraction, contamination during processing etc, the tryptic peptide map is typically incomplete with respect to the protein identified, and also contains a background of tryptic peptide masses from one or several other proteins. This means that a protein-identification may not always be accurate and unambiguous. In light of some apparent problems of distinguishing good protein-identification results from more uncertain ones, we are currently developing methods for determining the quality of protein identification by mass spectrometry. The approach is to do protein identifications on hypothetical sets of tryptic peptide mass data generated by a computer. This allows us to have a perfect control over the quality of the data and to vary the data as well as the search parameters in many different ways. Protein cont... identification based on realistic but random hypothetical data sets are particularly useful. Independently of the scoring method used in the identification, a repeated use of random data sets can generate the probability density function for protein identification by chance. Knowing this function under the conditions of a particular experiment, such as the size and mass accuracy of a peptide map, one can test the hypothesis that the identification score is an observation from a random distribution. Hence, this allows the assignment of a confidence level of the identification. Furthermore, statistical analyses of protein identification with hypothetical data will allow us to determine the quality of currently employed scoring methods. An improved insight into what features characterize a good scoring method, will guide us in future efforts to further refine such methods for protein identification.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Center for Research Resources (NCRR)
Type: Biotechnology Resource Grants (P41)
Project #: 5P41RR000862-26
Application #: 6118330
Study Section

Project Start: 1998-12-10
Project End: 1999-11-30
Budget Start: 1998-10-01
Budget End: 1999-09-30
Support Year: 26
Fiscal Year: 1999
Total Cost
Indirect Cost

Institution

Name: Rockefeller University
Department
Type
DUNS #: 071037113

City: New York
State: NY
Country: United States
Zip Code: 10065

Related projects

Publications

Manning, Lois R; Popowicz, Anthony M; Padovan, Julio C et al. (2017) Gel filtration of dilute human embryonic hemoglobins reveals basis for their increased oxygen binding. Anal Biochem 519:38-41

Boice, Michael; Salloum, Darin; Mourcin, Frederic et al. (2016) Loss of the HVEM Tumor Suppressor in Lymphoma and Restoration by Modified CAR-T Cells. Cell 167:405-418.e13

Chait, Brian T; Cadene, Martine; Olinares, Paul Dominic et al. (2016) Revealing Higher Order Protein Structure Using Mass Spectrometry. J Am Soc Mass Spectrom 27:952-65

Krutchinsky, Andrew N; Padovan, Júlio C; Cohen, Herbert et al. (2015) Maximizing ion transmission from atmospheric pressure into the vacuum of mass spectrometers with a novel electrospray interface. J Am Soc Mass Spectrom 26:649-58

Mast, Fred D; Rachubinski, Richard A; Aitchison, John D (2015) Signaling dynamics and peroxisomes. Curr Opin Cell Biol 35:131-6

Krutchinsky, Andrew N; Padovan, Júlio C; Cohen, Herbert et al. (2015) Optimizing electrospray interfaces using slowly diverging conical duct (ConDuct) electrodes. J Am Soc Mass Spectrom 26:659-67

Oricchio, Elisa; Papapetrou, Eirini P; Lafaille, Fabien et al. (2014) A cell engineering strategy to enhance the safety of stem cell therapies. Cell Rep 8:1677-1685

Zhong, Yu; Morris, Deanna H; Jin, Lin et al. (2014) Nrbf2 protein suppresses autophagy by modulating Atg14L protein-containing Beclin 1-Vps34 complex architecture and reducing intracellular phosphatidylinositol-3 phosphate levels. J Biol Chem 289:26021-37

Xue, John Z; Woo, Eileen M; Postow, Lisa et al. (2013) Chromatin-bound Xenopus Dppa2 shapes the nucleus by locally inhibiting microtubule assembly. Dev Cell 27:47-59

Indiani, Chiara; O'Donnell, Mike (2013) A proposal: Source of single strand DNA that elicits the SOS response. Front Biosci (Landmark Ed) 18:312-23

Showing the most recent 10 out of 67 publications

Comments

Be the first to comment on Jan Eriksson's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: