The critical protein actors in biological systems are the intact proteoforms, namely the different forms of proteins produced from the genome in a variety of splice forms and adorned with a myriad of post-translational modifications that dramatically affect their function. Lamentably, today's bottom-up proteomic technologies, which identify and quantify peptides derived from proteins, rather than the proteins themselves, fail to deliver crucial protein-level information to biologists. We address this problem here, by developing a new paradigm for proteomic analyses, which uses intact mass and lysine count to identify and quantify the proteoforms present in a biological sample. Our strategy integrates state-of-the-art proteomic and genomic technologies. We couple a new generation of high resolution mass spectrometers and a novel isotopic tagging strategy to yield for each proteoform its accurate mass, its number of lysines, and its relative abundance. Parallel RNA-Seq analyses on the same sample will provide transcriptomic data to construct a tightly focused sample-specific proteoform database. Comparison of the mass spectrometric data with the proteoform database will reveal the proteoform identities present in the sample, and the isotopic tagging strategy will provide relative abundance. This information will comprise a revolutionary new tool for biology.

Public Health Relevance

The goal of this proposal is to provide scientists with new tools to identify and quantify the protein variations, at the whole protein level, expressed in eac individual sample under study. This information will facilitate improved understanding of the functional relevance of genetic variations and lead to advances in the diagnosis and treatment of complex diseases.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM114292-03
Application #
9206176
Study Section
Enabling Bioanalytical and Imaging Technologies Study Section (EBIT)
Program Officer
Edmonds, Charles G
Project Start
2015-05-01
Project End
2019-01-31
Budget Start
2017-02-01
Budget End
2018-01-31
Support Year
3
Fiscal Year
2017
Total Cost
$372,913
Indirect Cost
$125,438
Name
University of Wisconsin Madison
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
161202122
City
Madison
State
WI
Country
United States
Zip Code
53715
Zaidan, Nur Zafirah; Walker, Kolin J; Brown, Jaime E et al. (2018) Compartmentalization of HP1 Proteins in Pluripotency Acquisition and Maintenance. Stem Cell Reports 10:627-641
LeDuc, Richard D; Schwämmle, Veit; Shortreed, Michael R et al. (2018) ProForma: A Standard Proteoform Notation. J Proteome Res 17:1321-1325
Schaffer, Leah V; Rensvold, Jarred W; Shortreed, Michael R et al. (2018) Identification and Quantification of Murine Mitochondrial Proteoforms Using an Integrated Top-Down and Intact-Mass Strategy. J Proteome Res 17:3526-3536
Chen, Xiangsong; Lu, Li; Qian, Shuiming et al. (2018) Canonical and Noncanonical Actions of Arabidopsis Histone Deacetylases in Ribosomal RNA Processing. Plant Cell 30:134-152
Schaffer, Leah V; Shortreed, Michael R; Cesnik, Anthony J et al. (2018) Expanding Proteoform Identifications in Top-Down Proteomic Analyses by Constructing Proteoform Families. Anal Chem 90:1325-1333
Smith, Lloyd M; Kelleher, Neil L (2018) Proteoforms as the next proteomics currency. Science 359:1106-1107
Cesnik, Anthony J; Shortreed, Michael R; Schaffer, Leah V et al. (2018) Proteoform Suite: Software for Constructing, Quantifying, and Visualizing Proteoform Families. J Proteome Res 17:568-578
Li, Qiyao; Shortreed, Michael R; Wenger, Craig D et al. (2017) Global Post-Translational Modification Discovery. J Proteome Res 16:1383-1390
Dai, Yunxiang; Shortreed, Michael R; Scalf, Mark et al. (2017) Elucidating Escherichia coli Proteoform Families Using Intact-Mass Proteomics and a Global PTM Discovery Database. J Proteome Res 16:4156-4165
Shortreed, Michael R; Frey, Brian L; Scalf, Mark et al. (2016) Elucidating Proteoform Families from Proteoform Intact-Mass and Lysine-Count Measurements. J Proteome Res 15:1213-21

Showing the most recent 10 out of 12 publications