Current mass spectrometers have benefited from a technological revolution over recent years, with great increases to speed and resolution as well as significant improvements in fragmentation capabilities for biomolecule analyses. Proteomics results that were previously unthinkable only a few years ago are now routine and being performed by hundreds of laboratories across the world. However, despite these advancements and successes, the software systems guiding mass spectrometry data acquisition have not seen technology improvements on the same scale. As such, these new mass spectrometers have not yet reached their full potential, particularly for intact protein characterization where analytes are more varied than peptides and acquisition parameters are tougher to generalize. A software named AutoProt is being developed to automate the targeted characterization of proteoforms from a proteoform family. AutoProt controls the acquisition of mass spectrometry data, initiating a data-driven fragmentation routine when proteoforms-of-interest from a proteoform family are detected in survey scans. As each fragmentation data scan is collected, AutoProt matches experimental data to theoretical proteoform fragments and uses the corresponding results to make an informed decision on how to further characterize the proteoform. By using instant feedback, fragmentation settings are customized specifically and in real-time for each proteoform being analyzed, leading to better characterization results in less overall time. In Phase I, AutoProt will be further developed to handle the characterization of multiple proteoforms such as modified and truncated forms from the same proteoform family in the same acquisition run, enabling a wider assessment of the proteoforms present in a sample (e.g., all the proteoforms from the same UniProt gene accession after immunoprecipitation). A database will be integrated into AutoProt to store run data and allow sample characterization to be resumed for additional acquisition runs. Furthermore, the characterization routine will be made more robust in this grant through the collection of a large amount of fragmentation data on many proteoforms and then the distillation of that data into a general fragmentation model. AutoProt will then be able to broadly characterize proteoforms with different properties (e.g., molecular weight and fragmentation propensities). Lastly, AutoProt will be outfitted with report generation to automatically output experimental report files compatible with the proteoform data visualization tool, TDCollider. With these features in place, AutoProt will be a first-of-its-kind automated characterization software with first-class proteoform characterization abilities and acquisition-to-report capabilities, eliminating time- consuming set up of acquisition and processing for proteoform characterization data.

Public Health Relevance

This novel mass spectrometry acquisition software enables detailed analyses of disease-related proteoforms and extensive characterization of therapeutic proteins. The software will make top down mass spectrometry and the important proteoform information that it can uncover more accessible for a higher number of researchers to benefit from complete proteoform characterization data.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Krepkiy, Dmitriy
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Proteinaceous, Inc.
United States
Zip Code