In the post-genomic era, proteomics is the next frontier allowing an in-depth understanding of the function of cellular systems in human diseases and the development of personalized treatments. Unlike the genome, the proteome is dynamic and highly complex due to alternative splicing and post-translational modifications (PTMs). The emerging top-down proteomics is the most powerful method to comprehensively characterize proteoforms that arise from genetic variations and PTMs, which is critical for understanding disease mechanisms and identifying new therapeutic targets. However, data analysis tools for top-down proteomics remain under- developed. With the recent rapid growth of the top-down proteomics community, there is an urgent need to develop a comprehensive analysis platform for top-down proteomics. Herein, we aim to develop MASH Explorer, a comprehensive, universal, and user-friendly software environment for top-down proteomics, which includes both an offline downloadable software package (App) and an online version for data processing and analysis (Portal).
The specific aims are: 1) to develop MASH Explorer App, a downloadable software package for top- down proteomics that can process data from various vendor formats and incorporate multiple algorithms for deconvolution and database search with user-friendly graphical interfaces; a novel deconvolution algorithm for high-accuracy data analysis and a novel search algorithm for identification of large proteins will also be developed to address the deficiency in the currently available top-down proteomcis software tools; 2) to develop novel algorithms for the automated identification and quantification of proteoform families with enhanced identification of low-abundance proteoforms; 3) to develop MASH Explorer Portal, a web-based extension of MASH Explorer App for processing and sharing top-down proteomics data. The MASH Explorer Portal features my MASH, a cloud-based personalized space for online data processing, as well as MASH public, a public database for long-term data storage and sharing to facilitate the collaboration among various groups and the entire proteomics community. The successful completion of this project will create a powerful software tool, MASH Explorer, for top-down proteomics. The MASH Explorer software will be freely available to all users. Improving the accessibility of non-proprietary free software solutions will significantly bolster the growth of the top-down proteomics community. Hence, we envision it will play integral roles in advancing the burgeoning field of top-down proteomics and realize its full potential for biomedical research.

Public Health Relevance

Top-down proteomics is the most powerful method to comprehensively characterize proteoforms that arise from genetic variations and post-translational modifications, which is critical for understanding disease mechanisms and identifying new therapeutic targets. This proposal seeks to develop a comprehensive and user- friendly software environment for top-down proteomics. The successful completion of this project will benefit not only the proteomic researchers, but also biochemists and biologists alike who are interested in studying the roles of protein modifications in human diseases.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM125085-03
Application #
9904714
Study Section
Enabling Bioanalytical and Imaging Technologies Study Section (EBIT)
Program Officer
Krepkiy, Dmitriy
Project Start
2018-06-01
Project End
2022-03-31
Budget Start
2020-04-01
Budget End
2021-03-31
Support Year
3
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of Wisconsin Madison
Department
Anatomy/Cell Biology
Type
Schools of Medicine
DUNS #
161202122
City
Madison
State
WI
Country
United States
Zip Code
53715
Cai, Wenxuan; Hite, Zachary L; Lyu, Beini et al. (2018) Temperature-sensitive sarcomeric protein post-translational modifications revealed by top-down proteomics. J Mol Cell Cardiol 122:11-22