Modeling and analysis of correlated data is a recurring challenge in health sciences research. These data arise routinely in clinical trials and epidemiological studies of cancer and other diseases, where correlation may be due to the longitudinal nature of data collection on each subject, clustering of observations from the same center or family, or geographical orientation. Much statistical research has been devoted to mixed effects models, which take account of correlation through incorporation of random effects. A prevailing concern is that standard assumptions may be unrealistic or too restrictive to represent the data, which may lead to unreliable inferences on scientific questions of interest and to important features of the data to be obscured. Recent interest has focused on relaxing these assumptions by either (i) allowing more flexible functional dependence of response variables on covariates or (ii) allowing more flexible representation of the distribution of the random effects than provided by the usual normal distribution. The objective of the research in this proposal is to develop flexible mixed effects models that address (i) and (ii), providing the data analyst tools for correlated data for which standard assumptions do not apply. We will develop semiparametric generalized linear models, which allow more flexible representation of the random effects; semiparametric generalized additive models, which further allow flexible covariate dependence; semiparametric frailty models, which allow flexible models for multivariate time-to-event data; and flexible joint models for longitudinal data and primary outcomes, where the goal is to explore the association between a longitudinal measure, e.g. prostate specific antigen, and a response of interest, e.g. cancer risk. Maximum likelihood inference will be developed, and the utility of the methods will be evaluated by simulation study and application to several data sets in cancer and other disease research. A key goal is to provide public-domain, documented general software and examples to the research community.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
1R01CA085848-01
Application #
6090223
Study Section
Special Emphasis Panel (ZRG1-SNEM-5 (01))
Program Officer
Erickson, Burdette (BUD) W
Project Start
2000-05-01
Project End
2003-04-30
Budget Start
2000-05-01
Budget End
2001-04-30
Support Year
1
Fiscal Year
2000
Total Cost
$165,375
Indirect Cost
Name
North Carolina State University Raleigh
Department
Biostatistics & Other Math Sci
Type
Schools of Earth Sciences/Natur
DUNS #
City
Raleigh
State
NC
Country
United States
Zip Code
27695
White, Kyle R; Stefanski, Leonard A; Wu, Yichao (2017) Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods. J Am Stat Assoc 112:1587-1597
Linn, Kristin A; Laber, Eric B; Stefanski, Leonard A (2017) Interactive Q-learning for Quantiles. J Am Stat Assoc 112:638-649
Vock, David M; Durheim, Michael T; Tsuang, Wayne M et al. (2017) Survival Benefit of Lung Transplantation in the Modern Era of Lung Allocation. Ann Am Thorac Soc 14:172-181
Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel et al. (2016) Properties of Estimators in Exponential Family Settings with Observation-based Stopping Rules. J Biom Biostat 7:
Zhang, Daowen; Sun, Jie Lena; Pieper, Karen (2016) Bivariate Mixed Effects Analysis of Clustered Data with Large Cluster Sizes. Stat Biosci 8:220-233
Chen, Jinsong; Liu, Lei; Shih, Ya-Chen T et al. (2016) A flexible model for correlated medical costs, with application to medical expenditure panel survey data. Stat Med 35:883-94
Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel et al. (2015) Estimation After a Group Sequential Trial. Stat Biosci 7:187-205
Zhang, Yichi; Laber, Eric B; Tsiatis, Anastasios et al. (2015) Using decision lists to construct interpretable and parsimonious treatment regimes. Biometrics 71:895-904
Bernhardt, Paul W; Wang, Huixia J; Zhang, Daowen (2015) Statistical Methods for Generalized Linear Models with Covariates Subject to Detection Limits. Stat Biosci 7:68-89
(2015) Response to reader reaction. Biometrics 71:267-273

Showing the most recent 10 out of 88 publications