Bayesian models for cancer prognosis by integrating diverse types of data

Ji, Yuan

Abstract

A new strategy in cancer prognosis is to base the decision on integrated information from different sources, including the traditional clinical and demographic information of patients, such as age, grade, and tumor size, etc, and the recently emerged genetic information like expression of gene or protein markers. Implementation of such a strategy requires efficient quantitative models that integrate the clinical measurements and genetic measurements together for prognosis. The long-range goal of this application is to improve risk predication, treatment selection, and subtype classification in cancer prevention, diagnosis, and prognosis. The short-term objective is to improve prediction of treatment response for cancer patients by developing innovative statistical models that integrate three different types of data, including two subtypes of informatics data, namely protein pathway data and high-throughput protein expression data, and a third type, which is the standard clinical and demographic data. We will accomplish the objective of this application by pursuing the following five specific aims: 1) Develop Bayesian parametric models that integrate a known genetic pathway with high-throughput protein expression measurements. 2) Develop Bayesian nonparametric model that integrate multiple genetic pathways with protein expression measurements. 3) Develop Bayesian classification procedures based on the Bayesian models proposed in previous two aims. 4) Integrate clinical and demographic measurements into the Bayesian models and apply the Bayesian classification procedures using a comprehensive data set that contains protein expression measurements and clinical measurements for more than 500 patients with leukemia. 5) Validate statistical findings by performing biological experiments, which will be done by our collaborating biologists. The proposed research is expected to provide quantitative prognostic tools for oncologists based on integrated information. The impact of the proposed research will be significant because models developed in this application can be applied to various cancer types and thus potentially improve the prognosis for patients with different types of cancer.

Public Health Relevance

Integrating the protein expression data, the protein pathway data, and the clinical data is expected to significantly improve medical decision making such as treatment selection. The improved decisions are expected to improve the overall patient care. For example, by accurately predicting that certain treatment will not be effective for a cancer patient, this patient will no longer waste time trying out the treatment and will have a better chance finding some other more effective therapies.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Research Project (R01)
Project #: 5R01CA132897-02
Application #: 7686852
Study Section: Special Emphasis Panel (ZRG1-HOP-T (02))
Program Officer: Dunn, Michelle C

Project Start: 2008-09-15
Project End: 2013-07-31
Budget Start: 2009-08-01
Budget End: 2010-07-31
Support Year: 2
Fiscal Year: 2009
Total Cost: $198,660
Indirect Cost

Institution

Name: University of Texas MD Anderson Cancer Center
Department: Biostatistics & Other Math Sci
Type: Other Domestic Higher Education
DUNS #: 800772139

City: Houston
State: TX
Country: United States
Zip Code: 77030

Related projects


NIH 2019 R01 CA	Bayesian Inference for Tumor Heterogeneity with Next-Generation Sequencing Data Ji, Yuan; Mueller, Peter / University of Chicago
NIH 2018 R01 CA	Bayesian Inference for Tumor Heterogeneity with Next-Generation Sequencing Data Ji, Yuan; Mueller, Peter / Northshore University Healthsystem
NIH 2017 R01 CA	Bayesian Inference for Tumor Heterogeneity with Next-Generation Sequencing Data Ji, Yuan; Mueller, Peter / Northshore University Healthsystem
NIH 2016 R01 CA	Bayesian Inference for Tumor Heterogeneity with Next-Generation Sequencing Data Ji, Yuan; Mueller, Peter / Northshore University Healthsystem
NIH 2015 R01 CA	Bayesian Inference for Tumor Heterogeneity with Next-Generation Sequencing Data Ji, Yuan; Mueller, Peter / Northshore University Healthsystem
NIH 2012 R01 CA	Bayesian models for cancer prognosis by integrating diverse types of data Ji, Yuan / Northshore University Healthsystem	$192,146
NIH 2011 R01 CA	Bayesian models for cancer prognosis by integrating diverse types of data Ji, Yuan / University of Texas MD Anderson Cancer Center	$192,700
NIH 2010 R01 CA	Bayesian models for cancer prognosis by integrating diverse types of data Ji, Yuan / University of Texas MD Anderson Cancer Center	$198,660
NIH 2009 R01 CA	Bayesian models for cancer prognosis by integrating diverse types of data Ji, Yuan / University of Texas MD Anderson Cancer Center	$198,660
NIH 2008 R01 CA	Bayesian models for cancer prognosis by integrating diverse types of data Ji, Yuan / University of Texas MD Anderson Cancer Center	$198,660

Publications

Ni, Yang; Müller, Peter; Wei, Lin et al. (2018) Bayesian graphical models for computational network biology. BMC Bioinformatics 19:63

Xu, Yanxun; Müller, Peter; Tsimberidou, Apostolia M et al. (2018) A nonparametric Bayesian basket trial design. Biom J :

Wei, Lin; Jin, Zhilin; Yang, Shengjie et al. (2018) TCGA-assembler 2: software pipeline for retrieval and processing of TCGA/CPTAC data. Bioinformatics 34:1615-1617

Müller, Peter; Xu, Yanxun; Thall, Peter F (2017) Clinical Trial Design as a Decision Problem. Appl Stoch Models Bus Ind 33:296-301

Narayanan, Jaishree; Dobrin, Sofia; Choi, Janet et al. (2017) Structured clinical documentation in the electronic medical record to improve quality and to support practice-based research in epilepsy. Epilepsia 58:68-76

Morita, Satoshi; Müller, Peter (2017) Bayesian population finding with biomarkers in a randomized clinical trial. Biometrics 73:1355-1365

Zuanetti, Daiane Aparecida; Müller, Peter; Zhu, Yitan et al. (2017) Clustering distributions with the marginalized nested Dirichlet process. Biometrics :

Shpak, Max; Ni, Yang; Lu, Jie et al. (2017) Variance in estimated pairwise genetic distance under high versus low coverage sequencing: The contribution of linkage disequilibrium. Theor Popul Biol 117:51-63

Manching, Heather; Sengupta, Subhajit; Hopper, Keith R et al. (2017) Phased Genotyping-by-Sequencing Enhances Analysis of Genetic Diversity and Reveals Divergent Copy Number Variants in Maize. G3 (Bethesda) 7:2161-2170

Sengupta, Subhajit; Gulukota, Kamalakar; Zhu, Yitan et al. (2016) Ultra-fast local-haplotype variant calling using paired-end DNA-sequencing data reveals somatic mosaicism in tumor and normal blood samples. Nucleic Acids Res 44:e25

Showing the most recent 10 out of 58 publications

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: