Software for automated interpretation of heparan sulfate tandem mass spectra the expression of heparan sulfate (HS) is required for embryonic development and the functioning of every physiological system. Present in intracellular granules, on cell surfaces and in extracellular matrices, HS binds to growth factor families and their receptors. These binding interactions serve to modulate cellular responses to growth factor stimuli in a cell-type and developmental state specific manner. The challenge to exploitation of HS structures as drugs lies in the nature of their biosynthesis and in their chemical properties. HS chains are assembled in the endoplasmic reticulum and Golgi apparatus by a series of enzymes acting in a non-template driven manner. Chains are first polymerized and then subject to modification events that produce mature chains with a regulated domain structure overlaid by substantial heterogeneity. As the HS chain biosynthesis proceeds, the number of biosynthetic enzymatic isoforms increases. These enzymes, including 6O- sulfotransferases and 3O-sulfotransferases, are expressed in a tissue and cell-type specific manner and are believed to modify substrates with isoform specificity. The result is HS chains that have phenotype-specific structure and protein binding functions. Despite the availability of mouse mutants for many of the biosynthetic enzymes, progress in HS biomedicine has suffered from the lack of widely adopted sequencing methods. Over the past few years, however, electron activated dissociation methods (ExD) have been developed in mass spectrometry laboratories. These methods, including electron detachment dissociation (EDD) and negative electron detachment dissociation (NETD) demonstrate feasibility of instrumental sequencing of HS saccharides. The advantage to these methods is that they require no or minimal derivatization, are compatible with high throughput, and provide rich structural information of the HS saccharides. The tandem mass spectra are highly complex, however, and tailor-made bioinformatics methods are necessary to convert raw data into sequences. We have demonstrated feasibility of an algorithm (HS-SEQ) for sequencing HS saccharide from ExD tandem mass spectra. We now propose to develop HS-SEQ so that it can provide better performance and full features supporting automated HS analysis, and be easily used by the wider biomedical community. The availability of mass spectrometers with NETD or EDD capability will grow rapidly over the next few years. We will develop a pipeline for data processing that includes all steps necessary to go from raw data to sequence information. This pipeline will be designed for use by biomedical scientists familiar with HS biochemistry and/or proteomics methods. The HS- SEQ pipeline will run as a web service and be available in source code and binary installer form under a Creative Commons license

Public Health Relevance

Despite the fact that heparan sulfate (HS) is necessary for all aspects of human physiology, understanding of relevant biological mechanisms suffers from the lack of widely disseminated sequencing methods. Recently developed HS sequencing methods remain the purview of experts due to lack of software for analysis of the data. We propose to develop a software pipeline tailored to the needs of non-experts for analysis of raw tandem mass spectral data on HS saccharides.

Agency
National Institute of Health (NIH)
Institute
National Heart, Lung, and Blood Institute (NHLBI)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21HL131554-01
Application #
8984998
Study Section
Special Emphasis Panel (ZRG1-OBT-L (50))
Program Officer
Sarkar, Rita
Project Start
2015-09-15
Project End
2017-06-30
Budget Start
2015-09-15
Budget End
2016-06-30
Support Year
1
Fiscal Year
2015
Total Cost
$329,703
Indirect Cost
$123,578
Name
Boston University
Department
Biochemistry
Type
Schools of Medicine
DUNS #
604483045
City
Boston
State
MA
Country
United States
Zip Code
02118
Wu, Jiandong; Wei, Juan; Hogan, John D et al. (2018) Negative Electron Transfer Dissociation Sequencing of 3-O-Sulfation-Containing Heparan Sulfate Oligosaccharides. J Am Soc Mass Spectrom 29:1262-1272
Hogan, John D; Klein, Joshua A; Wu, Jiandong et al. (2018) Software for Peak Finding and Elemental Composition Assignment for Glycosaminoglycan Tandem Mass Spectra. Mol Cell Proteomics 17:1448-1456
Zaia, Joseph; Khatri, Kshitij; Klein, Joshua et al. (2016) Complete Molecular Weight Profiling of Low-Molecular Weight Heparins Using Size Exclusion Chromatography-Ion Suppressor-High-Resolution Mass Spectrometry. Anal Chem 88:10654-10660
Huang, Yu; Mao, Yang; Zong, Chengli et al. (2015) Discovery of a heparan sulfate 3-O-sulfation specific peeling reaction. Anal Chem 87:592-600
Mao, Yang; Huang, Yu; Buczek-Thomas, Jo Ann et al. (2014) A liquid chromatography-mass spectrometry-based approach to characterize the substrate specificity of mammalian heparanase. J Biol Chem 289:34141-51
Huang, Yu; Mao, Yang; Buczek-Thomas, Jo Ann et al. (2014) Oligosaccharide substrate preferences of human extracellular sulfatase Sulf2 using liquid chromatography-mass spectrometry based glycomics approaches. PLoS One 9:e105143
Hu, Han; Huang, Yu; Mao, Yang et al. (2014) A computational framework for heparan sulfate sequencing using high-resolution tandem mass spectra. Mol Cell Proteomics 13:2490-502