The primary objective of this proposal is to develop adaptive and exible statistical models for analyses of multivariate, functional and spatial data from high-throughput biomedical studies. These studies raise computational, modeling, and inferential challenges with respect to high-dimensionality as well as structured dependency induced by the various aspects of the processes generating the data. Our work is motivated by, and will be applied to, data from a variety of high- throughput cancer-related studies that were conducted by our biomedical collaborators, in genomics, epigenomics and transcriptomics;although our methods are generally applicable to other contexts. The short-term objective of this research is to develop novel statistical methods and computational tools for statistical and probabilistic modeling of such high-throughput data with particular emphasis on integrative methods to combine information within and across dierent assays as well as clinical data to answer important biological questions. Our long-term goal is to improve risk prediction and treatment selection in cancer prevention, diagnosis and prognosis. We will accomplish the objective of this application by pursuing the following ve specic aims (1) develop new methodology for Bayesian adaptive generalized functional linear mixed models, allowing for local and nonlinear association structures between scalar responses and functional predictors (2) develop hierarchical Bayesian joint models for integrating diverse types of multivariate and functional data. (3) develop Bayesian spatial-functional process models for spatially indexed high-dimensional functional data, methods for data requiring a broader class of within-function and between-function covariance structures using exible families of covariance functions. (4) develop multivariate Bayesian spatial-functional models for joint modeling of multiple spatially indexed functional data. (5) develop ecient, user-friendly and freely available software for the proposed methods.

Public Health Relevance

This project will have significant impact on integrative analysis of various types of genetic data, as well as clinical data. This will results in a better understanding of the underlying biological mechanisms of cancer - leading to better prevention and treatment strategies and improve cancer patient care.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA160736-02
Application #
8323898
Study Section
Special Emphasis Panel (ZRG1-HDM-G (02))
Program Officer
Dunn, Michelle C
Project Start
2011-08-23
Project End
2015-06-30
Budget Start
2012-07-01
Budget End
2013-06-30
Support Year
2
Fiscal Year
2012
Total Cost
$327,850
Indirect Cost
$120,350
Name
University of Texas MD Anderson Cancer Center
Department
Biostatistics & Other Math Sci
Type
Other Domestic Higher Education
DUNS #
800772139
City
Houston
State
TX
Country
United States
Zip Code
77030
Lee, Michael S; McGuffey, Elizabeth J; Morris, Jeffrey S et al. (2016) Association of CpG island methylator phenotype and EREG/AREG methylation and expression in colorectal cancer. Br J Cancer 114:1352-61
Azadeh, Shabnam; Hobbs, Brian P; Ma, Liangsuo et al. (2016) Integrative Bayesian analysis of neuroimaging-genetic data with application to cocaine dependence. Neuroimage 125:813-24
Saha, Abhijoy; Banerjee, Sayantan; Kurtek, Sebastian et al. (2016) DEMARCATE: Density-based magnetic resonance image clustering for assessing tumor heterogeneity in cancer. Neuroimage Clin 12:132-43
Ni, Yang; Stingo, Francesco C; Baladandayuthapani, Veerabhadran (2015) Bayesian nonlinear model selection for gene regulatory networks. Biometrics 71:585-95
Ha, Min Jin; Baladandayuthapani, Veerabhadran; Do, Kim-Anh (2015) DINGO: differential network analysis in genomics. Bioinformatics 31:3413-20
Meyer, Mark J; Coull, Brent A; Versace, Francesco et al. (2015) Bayesian function-on-function regression for multilevel functional data. Biometrics 71:563-74
Ha, Min Jin; Baladandayuthapani, Veerabhadran; Do, Kim-Anh (2015) Prognostic gene signature identification using causal structure learning: applications in kidney cancer. Cancer Inform 14:23-35
Gregory, Karl Bruce; Carroll, Raymond J; Baladandayuthapani, Veerabhadran et al. (2015) A Two-Sample Test for Equality of Means in High Dimension. J Am Stat Assoc 110:837-849
Ni, Yang; Stingo, Francesco C; Baladandayuthapani, Veerabhadran (2014) Integrative bayesian network analysis of genomic data. Cancer Inform 13:39-48
Gregory, Karl B; Momin, Amin A; Coombes, Kevin R et al. (2014) Latent Feature Decompositions for Integrative Analysis of Multi-Platform Genomic Data. IEEE/ACM Trans Comput Biol Bioinform 11:984-94

Showing the most recent 10 out of 18 publications