Statistical methods for analysis of longitudinal, clustered, and time-to-event data and for making causal inference from observational data are central to health sciences research in cancer, HIV, cardiovascular disease (CVD), and a host of other areas. The objective of the projects in this application is to develop new procedures to address existing and emerging challenges in these contexts, motivated by issues arising the Principal Investigator's collaborations. Linear and generalized linear mixed effects models are popular among practitioners for analysis of longitudinal and other clustered data, but there is little work on variable selection in this context. In the first project, we propose a unified, practically accessible framework that simultaneously addresses parameter estimation and variable selection. However, there may be settings where parametric such mixed models are not adequate to represent outcome-time/covariate relationship. Semi- and nonparametric mixed effects models are popular, but, again there is little in the literature on variable selection. We will also develop unified practical procedures for these important classes of models. Mixed effects and measurement error models often invoke parametric assumptions on latent random quantities such as random effects and true, error-prone covariates; normality is a standard such assumption. In the second project, we propose new, accessible, practical methods for evaluating and handling departures from such assumptions. Inference on causal treatment effects from observational data is a fundamental goal of epidemiologic and outcomes research, but, despite the critical importance of variable selection for regressions models in this context, a paucity of work in the literature on formal strategies for such variable selection. In the third project, we will develop and study systematically a formal strategy based on methods particularly well-suited to this objective, culminating in concrete guidance for practitioners. Methods for analysis of censored survival data are traditionally non- or semiparametric. We have demonstrated in the previous project period that, under mild """"""""smoothness"""""""" assumptions, computationally convenient methods handling arbitrary censoring patterns are possible. In the fourth project, we will adapt this approach to further challenges with an eye toward a unified, accessible framework. Relevance: The research proposed in this application will provide public health researchers new tools to learn about relationships among subject characteristics, such as physiologic, demographic, and genetic attributes, and disease outcomes and to determine those with the strongest associations. New methods to be developed will also help researchers learn about the effects of treatments from data that do not come from randomized clinical trials. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA085848-09
Application #
7477059
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Dunn, Michelle C
Project Start
2000-05-01
Project End
2011-07-31
Budget Start
2008-08-01
Budget End
2009-07-31
Support Year
9
Fiscal Year
2008
Total Cost
$219,603
Indirect Cost
Name
North Carolina State University Raleigh
Department
Biostatistics & Other Math Sci
Type
Schools of Earth Sciences/Natur
DUNS #
042092122
City
Raleigh
State
NC
Country
United States
Zip Code
27695
White, Kyle R; Stefanski, Leonard A; Wu, Yichao (2017) Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods. J Am Stat Assoc 112:1587-1597
Linn, Kristin A; Laber, Eric B; Stefanski, Leonard A (2017) Interactive Q-learning for Quantiles. J Am Stat Assoc 112:638-649
Vock, David M; Durheim, Michael T; Tsuang, Wayne M et al. (2017) Survival Benefit of Lung Transplantation in the Modern Era of Lung Allocation. Ann Am Thorac Soc 14:172-181
Chen, Jinsong; Liu, Lei; Shih, Ya-Chen T et al. (2016) A flexible model for correlated medical costs, with application to medical expenditure panel survey data. Stat Med 35:883-94
Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel et al. (2016) Properties of Estimators in Exponential Family Settings with Observation-based Stopping Rules. J Biom Biostat 7:
Zhang, Daowen; Sun, Jie Lena; Pieper, Karen (2016) Bivariate Mixed Effects Analysis of Clustered Data with Large Cluster Sizes. Stat Biosci 8:220-233
Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel et al. (2015) Estimation After a Group Sequential Trial. Stat Biosci 7:187-205
Zhang, Yichi; Laber, Eric B; Tsiatis, Anastasios et al. (2015) Using decision lists to construct interpretable and parsimonious treatment regimes. Biometrics 71:895-904
Bernhardt, Paul W; Wang, Huixia J; Zhang, Daowen (2015) Statistical Methods for Generalized Linear Models with Covariates Subject to Detection Limits. Stat Biosci 7:68-89
(2015) Response to reader reaction. Biometrics 71:267-273

Showing the most recent 10 out of 88 publications