and Relevance The primary goal of this project is to develop a novel, integrated approach for the analysis of high-throughput cancer genomic data. We plan to develop new variable selection methods for 1) class discovery, that is we propose to determine subgroups of the specified cancer to better understand the underlying cancer biology and 2) predictive gene signatures, that is we propose to determine a subset of genes which are predictive for patients'clinical phenotypes, including survival and response to therapy. Specifically, we will develop a new method for variable selection in clustering. Clustering plays a critical role in the analysis of genomic cancer data. For example, based on the gene expression profiles, important cluster distinctions can be found among a set of tissue samples, which may reflect categories of diseases, mutation status, or different responses to a given therapy. Second, we will develop a new penalized-likelihood method for variable selection in regression which utilizes group information to select groups of correlated genes that share the same biological pathway. The developed methodology will be useful for identifying important gene signatures that may lead to more effective personalized treatment in any health studies where survival time or response to therapy is of interest.

Public Health Relevance

Our project aims to develop a new class of variable selection methods for analyzing high-throughput cancer genomic data. Compared to existing methods, the proposed methods will lead to more powerful methods of class discovery for identifying cancer sub-types and more accurate prediction of patients'survival and response to therapy. The developed methodology will be useful for identifying important gene signatures that may lead to more effective personalized treatment in any health studies where survival time or response to therapy is of interest.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Exploratory/Developmental Grants (R21)
Project #
5R21CA157219-03
Application #
8401160
Study Section
Epidemiology of Cancer Study Section (EPIC)
Program Officer
Dunn, Michelle C
Project Start
2011-08-11
Project End
2014-01-31
Budget Start
2012-08-01
Budget End
2014-01-31
Support Year
3
Fiscal Year
2012
Total Cost
$116,267
Indirect Cost
$21,841
Name
University of Michigan Ann Arbor
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
073133571
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109
Lin, Huazhen; Fei, Zhe; Li, Yi (2016) A semiparametrically efficient estimator of the time-varying effects for survival data with time-dependent treatment. Scand Stat Theory Appl 43:649-663
Li, Yi; Dicker, Lee; Zhao, Sihai Dave (2014) The Dantzig Selector for Censored Linear Regression Models. Stat Sin 24:251-2568
Eng, Kevin H; Hanlon, Bret M (2014) Discrete mixture modeling to address genetic heterogeneity in time-to-event regression. Bioinformatics 30:1690-7
Xu, Peirong; Zhu, Lixing; Li, Yi (2014) Ultrahigh dimensional time course feature selection. Biometrics 70:356-65
Zhou, Ling; Lin, Huazhen; Song, Xinyuan et al. (2014) Selection of latent variables for multiple mixed-outcome models. Scand Stat Theory Appl 41:1064-1082