The development of accurate and robust statistical methods for cancer subtype identification using high throughput genomic data is of critical importance to public health, because these methods will inform molecularly-based tumor classification and shared pathogeneses, which offers opportunities for overlapping treatments across various cancer subtypes. In spite of tremendous efforts to develop statistical methods for the analysis of high throughput genomic data profiled in multiple platforms for cancer subtype identification, it still remains a challenging task to implement robust and interpretable identification of cancer subtypes and driver molecular features using these massive, complex, and heterogeneous datasets. The impact derived from improved understanding of tumor classification and driver molecular features can be dramatic, as this knowledge can be used to develop more effective prevention and intervention strategies to reduce the burdens of patients suffered from cancers. The goal of this proposal is to develop a statistical method and software to improve identification of cancer subtypes and driver molecular features by integrating genomic data profiled in multiple platforms with biomedical literature and existing pathway databases. Utilization of pathway information in cancer subtype identification will improve robustness in identification of cancer subtypes and driver molecular features. On the other hand, biomedical literature will supplement the incompleteness of pathway annotations in existing databases. It will also provide a common knowledgebase to integrate information from diverse pathway databases because biomedical literature provides comprehensive information about the relationship among genes. We will test these hypotheses in three specific aims.
In Specific Aim 1, we will develop a novel statistical method and software to improve the pathway knowledge by integrating PubMed literature with existing pathway databases.
In Specific Aim 2, we will develop a novel statistical method and software to improve robustness and interpretability in identification of cancer subtypes and driver molecular features using pathway knowledge. In addition, this method will allow investigation of driver molecular features at multiple levels, including pathway clusters, pathways, and genes.
In Specific Aim 3, we will apply the statistical methods developed in Specific Aims 1 and 2 to the novel genomic data for mucinous ovarian cancer to promote understanding of the subtypes and the driver molecular features of this under-studied disease.

Public Health Relevance

In this project, we will develop novel statistical methods and software to improve robustness and interpretation of cancer subtype identification by integrating high throughput genomic data profiled using multiple platforms with biomedical literature and existing pathway databases. These methods will improve molecular-based tumor classification and understanding of shared pathogeneses, which can potentially be useful for the development of overlapping treatments across cancer subtypes. The application of these methods to mucinous ovarian cancer studies will promote understanding of this under-studied cancer subtype, which can be useful for the development of more effective prevention and intervention strategies to reduce burdens of these diseases.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Exploratory/Developmental Grants (R21)
Project #
5R21CA209848-02
Application #
9321971
Study Section
Special Emphasis Panel (ZCA1)
Program Officer
Miller, David J
Project Start
2016-08-01
Project End
2019-07-31
Budget Start
2017-08-01
Budget End
2019-07-31
Support Year
2
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Medical University of South Carolina
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
183710748
City
Charleston
State
SC
Country
United States
Zip Code
29403
Pandey, Janardan P; Namboodiri, Aryan M; Wolf, Bethany et al. (2018) Endogenous antibody responses to mucin 1 in a large multiethnic cohort of patients with breast cancer and healthy controls: Role of immunoglobulin and Fc? receptor genes. Immunobiology 223:178-182
Lin, Ching Ying; Kwon, Hyunwoo; Rangel Rivera, Guillermo O et al. (2018) Sex Differences in Using Systemic Inflammatory Markers to Prognosticate Patients with Head and Neck Squamous Cell Carcinoma. Cancer Epidemiol Biomarkers Prev 27:1176-1185
Renaud, Ludivine; Silveira, Willian A da; Hazard, E Starr et al. (2017) The Plasticizer Bisphenol A Perturbs the Hepatic Epigenome: A Systems Level Analysis of the miRNome. Genes (Basel) 8:
Chung, Dongjun; Kim, Hang J; Zhao, Hongyu (2017) graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture. PLoS Comput Biol 13:e1005388
Chung, Dongjun; Lawson, Andrew; Zheng, W Jim (2017) A statistical framework for biomedical literature mining. Stat Med 36:3461-3474
Davis-Turak, Jeremy; Courtney, Sean M; Hazard, E Starr et al. (2017) Genomics pipelines and data integration: challenges and opportunities in the research setting. Expert Rev Mol Diagn 17:225-237
Wei, Wei; Ramos, Paula S; Hunt, Kelly J et al. (2016) GPA-MDS: A Visualization Approach to Investigate Genetic Architecture among Phenotypes Using GWAS Results. Int J Genomics 2016:6589843