Early detection of cancer improves patient survival. Characterizing the association of peptides and glycans with cancer is one of the most promising strategies to discover early-diagnosis cancer biomarkers. This study evaluates peptide and glycan expression profiles in the progression of chronic liver disease (CLD) to hepatocellular carcinoma (HCC) by using the liquid chromatography-mass spectrometry (LC-MS) technology. The goal is to find and validate peptide and glycan biomarkers for detection of HCC at a treatable stage in a high-risk population of patients with CLD. Label-free LC-MS quantification allows comparison of peptides and glycans with good throughput which allows us to compare a large population of patients. However, such quantification is not addressed adequately in the instrument-specific software packages. In particular, alignment and normalization of LC-MS data present a significant challenge in label-free quantification and comparison of biomolecules. This challenge coupled with biological variability and disease heterogeneity in human populations has restricted recent advances in LC-MS-based biomarker discovery studies. This project brings together experts in bioinformatics, biostatistics, biochemistry, and mass spectrometry to develop a suite of novel analytical tools for LC-MS-based label-free quantification and comparison of peptides and glycans in serum and plasma. Specifically, a novel Bayesian hierarchical model will be investigated for simultaneous alignment and normalization of LC-MS data and for identification of patient subgroups. The Bayesian framework involves fixed and random effects to account for subpopulation homogeneous behavior (fixed systematic changes), while allowing for modeling heterogeneity within a group (random effects). A spike-in study will be conducted to obtain replicate LC-MS runs with known peptide and glycan concentrations. The data will be utilized to develop and optimize the proposed Bayesian framework and to compare its performance with other existing solutions. The optimized framework and a machine learning-based feature selection method will be applied to identify an integrated set of peptide and glycan candidate biomarkers for early detection of HCC. LC-MS analysis of integrated peptides and glycans in both serum and plasma of patients with HCC is to our knowledge unprecedented. Blood samples from patients with HCC and CLD controls in Egypt and United States will be used. The biomarkers will be validated using isotope dilution mass spectrometric assays.

Public Health Relevance

This project will lead to the development of a suite of novel open source analytical tools for label-free quantification of peptides and glycans in serum and plasma using liquid chromatography-mass spectrometry (LC-MS) technologies. The availability of such tools will assist the research community in advancing the promising LC-MS-based biomarker discovery research. The proposed tools will be utilized to find and validate early-diagnosis biomarkers of hepatocellular carcinoma (HCC). Defining clinically applicable biomarkers that detect early-stage HCC in a high-risk population of patients with chronic liver disease has potentially far-reaching consequences for disease management and patient health. This project is important because most HCC patients are diagnosed at a late stage, where the treatment options are limited. There is a pressing need to identify biomarkers of HCC that could be used for early detection and more accurate classification of disease. In addition to screening high-risk populations for early signs of disease, the resulting biomarkers could be used to design and test improved treatment strategies.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
1R01CA143420-01A1
Application #
7899433
Study Section
Cancer Biomarkers Study Section (CBSS)
Program Officer
Rinaudo, Jo Ann S
Project Start
2010-09-01
Project End
2015-04-30
Budget Start
2010-09-01
Budget End
2011-04-30
Support Year
1
Fiscal Year
2010
Total Cost
$286,661
Indirect Cost
Name
Georgetown University
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
049515844
City
Washington
State
DC
Country
United States
Zip Code
20057
Di Poto, Cristina; He, Shisi; Varghese, Rency S et al. (2018) Identification of race-associated metabolite biomarkers for hepatocellular carcinoma in patients with liver cirrhosis and hepatitis C virus infection. PLoS One 13:e0192748
Di Poto, Cristina; Ferrarini, Alessia; Zhao, Yi et al. (2017) Metabolomic Characterization of Hepatocellular Carcinoma in Patients with Liver Cirrhosis for Biomarker Discovery. Cancer Epidemiol Biomarkers Prev 26:675-683
Zuo, Yiming; Cui, Yi; Yu, Guoqiang et al. (2017) Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO. BMC Bioinformatics 18:99
Varghese, Rency S; Zuo, Yiming; Zhao, Yi et al. (2017) Protein network construction using reverse phase protein array data. Methods 124:89-99
Ressom, Habtom W; Di Poto, Cristina; Ferrarini, Alessia et al. (2016) Multi-omic approaches for characterization of hepatocellular carcinoma. Conf Proc IEEE Eng Med Biol Soc 2016:3437-3440
Zuo, Yiming; Cui, Yi; Di Poto, Cristina et al. (2016) INDEED: Integrated differential expression and differential network analysis of omic data for biomarker discovery. Methods 111:12-20
Tsai, Tsung-Heng; Wang, Minkun; Ressom, Habtom W (2016) Preprocessing and Analysis of LC-MS-Based Proteomic Data. Methods Mol Biol 1362:63-76
Wang, Minkun; Yu, Guoqiang; Ressom, Habtom W (2016) Integrative Analysis of Proteomic, Glycomic, and Metabolomic Data for Biomarker Discovery. IEEE J Biomed Health Inform 20:1225-1231
Tsai, Tsung-Heng; Song, Ehwang; Zhu, Rui et al. (2015) LC-MS/MS-based serum proteomics for identification of candidate biomarkers for hepatocellular carcinoma. Proteomics 15:2369-81
Ranjbar, Mohammad R Nezami; Tadesse, Mahlet G; Wang, Yue et al. (2015) Bayesian Normalization Model for Label-Free Quantitative Analysis by LC-MS. IEEE/ACM Trans Comput Biol Bioinform 12:914-27

Showing the most recent 10 out of 22 publications