Algorithms for Literature-Guided Multi-Platform Identification of Cancer Subtypes

Chung, Dongjun; Kelemen, Linda

Abstract

The development of accurate and robust statistical methods for cancer subtype identification using high throughput genomic data is of critical importance to public health, because these methods will inform molecularly-based tumor classification and shared pathogeneses, which offers opportunities for overlapping treatments across various cancer subtypes. In spite of tremendous efforts to develop statistical methods for the analysis of high throughput genomic data profiled in multiple platforms for cancer subtype identification, it still remains a challenging task to implement robust and interpretable identification of cancer subtypes and driver molecular features using these massive, complex, and heterogeneous datasets. The impact derived from improved understanding of tumor classification and driver molecular features can be dramatic, as this knowledge can be used to develop more effective prevention and intervention strategies to reduce the burdens of patients suffered from cancers. The goal of this proposal is to develop a statistical method and software to improve identification of cancer subtypes and driver molecular features by integrating genomic data profiled in multiple platforms with biomedical literature and existing pathway databases. Utilization of pathway information in cancer subtype identification will improve robustness in identification of cancer subtypes and driver molecular features. On the other hand, biomedical literature will supplement the incompleteness of pathway annotations in existing databases. It will also provide a common knowledgebase to integrate information from diverse pathway databases because biomedical literature provides comprehensive information about the relationship among genes. We will test these hypotheses in three specific aims.
In Specific Aim 1, we will develop a novel statistical method and software to improve the pathway knowledge by integrating PubMed literature with existing pathway databases.
In Specific Aim 2, we will develop a novel statistical method and software to improve robustness and interpretability in identification of cancer subtypes and driver molecular features using pathway knowledge. In addition, this method will allow investigation of driver molecular features at multiple levels, including pathway clusters, pathways, and genes.
In Specific Aim 3, we will apply the statistical methods developed in Specific Aims 1 and 2 to the novel genomic data for mucinous ovarian cancer to promote understanding of the subtypes and the driver molecular features of this under-studied disease.

Public Health Relevance

In this project, we will develop novel statistical methods and software to improve robustness and interpretation of cancer subtype identification by integrating high throughput genomic data profiled using multiple platforms with biomedical literature and existing pathway databases. These methods will improve molecular-based tumor classification and understanding of shared pathogeneses, which can potentially be useful for the development of overlapping treatments across cancer subtypes. The application of these methods to mucinous ovarian cancer studies will promote understanding of this under-studied cancer subtype, which can be useful for the development of more effective prevention and intervention strategies to reduce burdens of these diseases.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Exploratory/Developmental Grants (R21)
Project #: 1R21CA209848-01
Application #: 9185145
Study Section: Special Emphasis Panel (ZCA1)
Program Officer: Miller, David J

Project Start: 2016-08-01
Project End: 2018-07-31
Budget Start: 2016-08-01
Budget End: 2017-07-31
Support Year: 1
Fiscal Year: 2016
Total Cost
Indirect Cost

Institution

Name: Medical University of South Carolina
Department: Public Health & Prev Medicine
Type: Schools of Medicine
DUNS #: 183710748

City: Charleston
State: SC
Country: United States
Zip Code: 29403

Related projects


NIH 2017 R21 CA	Algorithms for Literature-Guided Multi-Platform Identification of Cancer Subtypes Chung, Dongjun; Kelemen, Linda E. / Medical University of South Carolina
NIH 2016 R21 CA	Algorithms for Literature-Guided Multi-Platform Identification of Cancer Subtypes Chung, Dongjun; Kelemen, Linda E. / Medical University of South Carolina

Publications

Pandey, Janardan P; Namboodiri, Aryan M; Wolf, Bethany et al. (2018) Endogenous antibody responses to mucin 1 in a large multiethnic cohort of patients with breast cancer and healthy controls: Role of immunoglobulin and Fc? receptor genes. Immunobiology 223:178-182

Lin, Ching Ying; Kwon, Hyunwoo; Rangel Rivera, Guillermo O et al. (2018) Sex Differences in Using Systemic Inflammatory Markers to Prognosticate Patients with Head and Neck Squamous Cell Carcinoma. Cancer Epidemiol Biomarkers Prev 27:1176-1185

Renaud, Ludivine; Silveira, Willian A da; Hazard, E Starr et al. (2017) The Plasticizer Bisphenol A Perturbs the Hepatic Epigenome: A Systems Level Analysis of the miRNome. Genes (Basel) 8:

Chung, Dongjun; Kim, Hang J; Zhao, Hongyu (2017) graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture. PLoS Comput Biol 13:e1005388

Chung, Dongjun; Lawson, Andrew; Zheng, W Jim (2017) A statistical framework for biomedical literature mining. Stat Med 36:3461-3474

Davis-Turak, Jeremy; Courtney, Sean M; Hazard, E Starr et al. (2017) Genomics pipelines and data integration: challenges and opportunities in the research setting. Expert Rev Mol Diagn 17:225-237

Wei, Wei; Ramos, Paula S; Hunt, Kelly J et al. (2016) GPA-MDS: A Visualization Approach to Investigate Genetic Architecture among Phenotypes Using GWAS Results. Int J Genomics 2016:6589843

Comments

Be the first to comment on Dongjun Chung's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: