Mass spectrometry (MS) based proteomics has emerged as a key technology in the search for disease- associated biomarkers. State-of-the-art instruments can identify thousands of proteins in a single sample by 'shotgun'proteomic analysis, where protein mixtures are proteolyzed into peptides, separated by one or more chromatographic steps, and analyzed by peptide dissociation using tandem mass spectrometry (MS/MS). The goal of this approach is to create new technologies for the accurate detection of proteins within complex samples. Achieving this target is currently limited by the major problem of inferring the peptide sequence from MS/MS spectra by sequence database searching: spectra are compared to """"""""model spectra"""""""" generated from database sequences. Current algorithms suffer from poor accuracy and discrimination due to the use of simple models for predicting spectra, which ignores the rich information contained in the relative intensities of peaks in a typical MS/MS. Consequently, there is a vital need for more accurate models to predict MS/MS spectrum intensities from peptide sequences. In this proposal, we will develop a new and innovative kinetic model for predicting peptide fragmentation MS/MS spectra, and use the model to develop MS/MS identification algorithms with high discrimatory power. Spectra simulated by the kinetic model will then be used to design selected reaction monitoring (SRM) assays, which have become a critically important technique for measuring targeted sets of proteins in human biomarker studies. This will solve a bottleneck for widespread adoption of SRM methods for biomarker discovery, which is currently hindered by the slow process of identifying and optimizing SRM transitions for the assays. The following specific aims are (1) Develop an optimized kinetic model of gas-phase peptide fragmentation which predicts MS/MS spectra for any peptide sequence. Model parameters will be fit using the Levenberg- Marquardt algorithm, a robust method for non-linear least squares. (2) Extend the model to predict MS/MS fragmentation of phosphopeptides. The approaches developed in this aim can be extended to other disease- relevant post-translational modifications which profoundly alter peptide fragmentation and interfere with MS/MS identification. (3) Develop a route to successful implementation of spectrum-to-spectrum matching algorithms, an entirely new approach for large scale identification of proteins, in which MS/MS are searched directly against libraries of predicted spectra, simulated using our prototype kinetic model. We use predicted spectra to bypass the need for sequence databases, and spectrum-to-sequence strategies altogether. (4) Develop an algorithm for de novo prediction of selected reaction monitoring (SRM) assays for highly multiplexed quantitative measurement of proteins in complex mixtures.

Public Health Relevance

Mass spectrometry-based proteomics has emerged as a key technology in the search for useful protein biomarkers, and holds many promises for early detection of disease, prediction of drug efficacy and resistance, and targeted molecular therapies. The field is currently limited by the major problem of inferring the peptide sequence from a fragmentation mass spectrum - until this problem is solved, many potential applications of proteomics to human health will not be achieved. We will develop a kinetic model to predict peptide fragmentation spectra for any peptide sequence;a method that will enable comprehensive protein profiling in human biofluids, and the rapid design of selected reaction monitoring (SRM) assays, which have become a critically important technique for measuring targeted sets of proteins in human biomarker studies.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA155453-03
Application #
8504800
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Ossandon, Miguel
Project Start
2011-09-01
Project End
2016-07-31
Budget Start
2013-08-01
Budget End
2014-07-31
Support Year
3
Fiscal Year
2013
Total Cost
$295,501
Indirect Cost
$100,451
Name
University of Colorado at Boulder
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
007431505
City
Boulder
State
CO
Country
United States
Zip Code
80309
Brown, Robert; Stuart, Scott A; Stuart, Scott S et al. (2015) Large-Scale Examination of Factors Influencing Phosphopeptide Neutral Loss during Collision Induced Dissociation. J Am Soc Mass Spectrom 26:1128-42
Long, Jun; Tokhunts, Robert; Old, William M et al. (2015) Identification of a family of fatty-acid-speciated sonic hedgehog proteins, whose members display differential biological properties. Cell Rep 10:1280-1287
Yen, Chia-Yu; Houel, Stephane; Ahn, Natalie G et al. (2011) Spectrum-to-spectrum searching using a proteome-wide spectral library. Mol Cell Proteomics 10:M111.007666
Houel, Stephane; Abernathy, Robert; Renganathan, Kutralanathan et al. (2010) Quantifying the impact of chimera MS/MS spectra on peptide identification in large-scale proteomics studies. J Proteome Res 9:4152-60