Flexible Statistical Methods for Biomedical Data

Davidian, Marie

Abstract

Statistical methods for analysis of longitudinal, clustered, and time-to-event data and for making causal inference from observational data are central to health sciences research in cancer, HIV, cardiovascular disease (CVD), and a host of other areas. The objective of the projects in this application is to develop new procedures to address existing and emerging challenges in these contexts, motivated by issues arising the Principal Investigator's collaborations. Linear and generalized linear mixed effects models are popular among practitioners for analysis of longitudinal and other clustered data, but there is little work on variable selection in this context. In the first project, we propose a unified, practically accessible framework that simultaneously addresses parameter estimation and variable selection. However, there may be settings where parametric such mixed models are not adequate to represent outcome-time/covariate relationship. Semi- and nonparametric mixed effects models are popular, but, again there is little in the literature on variable selection. We will also develop unified practical procedures for these important classes of models. Mixed effects and measurement error models often invoke parametric assumptions on latent random quantities such as random effects and true, error-prone covariates; normality is a standard such assumption. In the second project, we propose new, accessible, practical methods for evaluating and handling departures from such assumptions. Inference on causal treatment effects from observational data is a fundamental goal of epidemiologic and outcomes research, but, despite the critical importance of variable selection for regressions models in this context, a paucity of work in the literature on formal strategies for such variable selection. In the third project, we will develop and study systematically a formal strategy based on methods particularly well-suited to this objective, culminating in concrete guidance for practitioners. Methods for analysis of censored survival data are traditionally non- or semiparametric. We have demonstrated in the previous project period that, under mild """"""""smoothness"""""""" assumptions, computationally convenient methods handling arbitrary censoring patterns are possible. In the fourth project, we will adapt this approach to further challenges with an eye toward a unified, accessible framework. Relevance: The research proposed in this application will provide public health researchers new tools to learn about relationships among subject characteristics, such as physiologic, demographic, and genetic attributes, and disease outcomes and to determine those with the strongest associations. New methods to be developed will also help researchers learn about the effects of treatments from data that do not come from randomized clinical trials. ? ? ?

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Research Project (R01)
Project #: 5R01CA085848-09
Application #: 7477059
Study Section: Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer: Dunn, Michelle C

Project Start: 2000-05-01
Project End: 2011-07-31
Budget Start: 2008-08-01
Budget End: 2009-07-31
Support Year: 9
Fiscal Year: 2008
Total Cost: $219,603
Indirect Cost

Institution

Name: North Carolina State University Raleigh
Department: Biostatistics & Other Math Sci
Type: Schools of Earth Sciences/Natur
DUNS #: 042092122

City: Raleigh
State: NC
Country: United States
Zip Code: 27695

Related projects

Publications

White, Kyle R; Stefanski, Leonard A; Wu, Yichao (2017) Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods. J Am Stat Assoc 112:1587-1597

Linn, Kristin A; Laber, Eric B; Stefanski, Leonard A (2017) Interactive Q-learning for Quantiles. J Am Stat Assoc 112:638-649

Vock, David M; Durheim, Michael T; Tsuang, Wayne M et al. (2017) Survival Benefit of Lung Transplantation in the Modern Era of Lung Allocation. Ann Am Thorac Soc 14:172-181

Chen, Jinsong; Liu, Lei; Shih, Ya-Chen T et al. (2016) A flexible model for correlated medical costs, with application to medical expenditure panel survey data. Stat Med 35:883-94

Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel et al. (2016) Properties of Estimators in Exponential Family Settings with Observation-based Stopping Rules. J Biom Biostat 7:

Zhang, Daowen; Sun, Jie Lena; Pieper, Karen (2016) Bivariate Mixed Effects Analysis of Clustered Data with Large Cluster Sizes. Stat Biosci 8:220-233

Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel et al. (2015) Estimation After a Group Sequential Trial. Stat Biosci 7:187-205

Zhang, Yichi; Laber, Eric B; Tsiatis, Anastasios et al. (2015) Using decision lists to construct interpretable and parsimonious treatment regimes. Biometrics 71:895-904

Bernhardt, Paul W; Wang, Huixia J; Zhang, Daowen (2015) Statistical Methods for Generalized Linear Models with Covariates Subject to Detection Limits. Stat Biosci 7:68-89

(2015) Response to reader reaction. Biometrics 71:267-273

Showing the most recent 10 out of 88 publications

Comments

Be the first to comment on Marie Davidian's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: