Survival Bump Hunting for Finding Informative Subgroups in High Dimensional Data.

Rao, J; Dazard, Jean-Eudes

Abstract

Subgroup discovery based on high dimensional genomic data can potentially provide novel insights into a disease process. Typically this has been done with various forms of cluster analysis (both supervised and unsupervised). Extreme subgroups are defined as those which are homogeneous in nature but which present extreme valued outcomes. Of particular interest in this project is to develop methodology to identify such subgroups which are extreme with respect to survival outcomes (e.g. those individuals that do unusually well on a cancer treatment and can be delineated based on high dimensional genomic predictors). If such subgroups are real and are uncovered, implications would include improved understanding of the disease etiology, discovery of new biomarkers with potential therapeutic targets, and allow early and personalized therapeutic interventions. Statistically, thi problem can be framed within a sparse survival bump hunting framework. We have brought together a team of biostatisticians who have pioneered the first sparse bump hunting models for continuous responses, as well as two internationally recognized laboratories as collaborators, who work on multi-platform genomic profiling for pediatric medulloblastoma and non-small cell lung cancer respectively. We thus propose the following specific aims: 1) To develop new models for sparse bump hunting that allow survival outcomes with both continuous and nominal predictors (e.g. gene expression and SNPs).;2) To develop a sparse survival bump hunting approach that will allow us to integrate SNP and gene expression profile data by three different approaches - sparse coaching, bump phenotyping and sparse mediation analysis;3) To develop detailed theory for asymptotic performance of these sparse survival bump hunting models;theory for a new fence-based methodology for studying model validation;and to empirically study and compare the performance in detailed simulations as well as on the datasets provided by our collaborator laboratories;4) To develop a Java-based user-friendly interface and a command line end-user CRAN package in the R language that will implement all of our methodologies and its extensions.

Public Health Relevance

One of the questions of interest is to uncover hidden subgroups of individuals with differential survival (say in response to treatment), and characterize the genomic determinants that define these groups. In this work, we will develop a new methodology that is designed to find extreme subgroups of patients within a population. Specific to this research are methods on how to focus the search on genes who relate most strongly to extreme survival and how to integrate various kinds of genomic profiles using three different strategies that are meant to improve subgroup finding and also glean more biological insights.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Research Project (R01)
Project #: 5R01CA160593-02
Application #: 8624667
Study Section: Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer: Ossandon, Miguel

Project Start: 2013-03-01
Project End: 2017-02-28
Budget Start: 2014-03-01
Budget End: 2015-02-28
Support Year: 2
Fiscal Year: 2014
Total Cost: $232,190
Indirect Cost: $45,438

Institution

Name: University of Miami School of Medicine
Department: Public Health & Prev Medicine
Type: Schools of Medicine
DUNS #: 052780918

City: Coral Gables
State: FL
Country: United States
Zip Code: 33146

Related projects


NIH 2016 R01 CA	Survival Bump Hunting for Finding Informative Subgroups in High Dimensional Data. Rao, J Sunil; Dazard, Jean-Eudes J. / University of Miami School of Medicine
NIH 2015 R01 CA	Survival Bump Hunting for Finding Informative Subgroups in High Dimensional Data. Rao, J Sunil; Dazard, Jean-Eudes J. / University of Miami School of Medicine	$260,014
NIH 2014 R01 CA	Survival Bump Hunting for Finding Informative Subgroups in High Dimensional Data. Rao, J Sunil; Dazard, Jean-Eudes J. / University of Miami School of Medicine	$232,190
NIH 2013 R01 CA	Survival Bump Hunting for Finding Informative Subgroups in High Dimensional Data. Rao, J Sunil; Dazard, Jean-Eudes J. / University of Miami School of Medicine	$268,926

Publications

Rao, J Sunil; Liu, Hongmei (2017) Discordancy Partitioning for Validating Potentially Inconsistent Pharmacogenomic Studies. Sci Rep 7:15169

Dazard, Jean-Eudes; Choe, Michael; LeBlanc, Michael et al. (2016) Cross-validation and Peeling Strategies for Survival Bump Hunting using Recursive Peeling Methods. Stat Anal Data Min 9:12-42

Dazard, Jean-Eudes; Choe, Michael; LeBlanc, Michael et al. (2015) R package PRIMsrc: Bump Hunting by Patient Rule Induction Method for Survival, Regression and Classification. Proc Am Stat Assoc 2015:650-664

Dazard, Jean-Eudes; Choe, Michael; LeBlanc, Michael et al. (2014) Cross-Validation of Survival Bump Hunting by Recursive Peeling Methods. Proc Am Stat Assoc 2014:3366-3380

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: