This project will examine new methodology for making inference about the regression parameters in the presence of missing covariate data for two commonly used classes of regression models. In particular, we examine the class of generalized linear models for general types of response data and the Cox model for survival data. The methodology addresses problems occurring frequently in clinical investigations for chronic disease, including cancer and AIDS. The specific objectives of the project are to: 1) develop and study classical and Bayesian methods of inference for the class of generalized linear models (GLM's) in the presence of missing covariate data. In particular, we will i) examine methods for estimating the regression parameters when the missing covariates are either categorical or continuous and the missing data mechanism is ignorable. Also, parametric models for the covariate distribution will be examined. The methods of estimation will focus on the Monte Carlo version of the EM algorithm (Wei and Tanner, 1990) and other related iterative algorithms. The Gibbs sampler (Gelfand and Smith, 1990) along with the adaptive rejection algorithm of Gilks and Wild (1992) will be used to sample from the conditional distribution of the missing covariates given the observed data. ii) examine estimating the regression parameters when the missing covariates are either categorical or continuous and the missing data mechanism is nonignorable. Models for the missing data mechanism will be studied. iii) develop and study Bayesian methods of inference in the presence of missing covariate data when the missing covariates are either categorical or continuous and the missing data mechanism is ignorable. Parametric prior distributions for the regression coefficients are proposed. Properties of the posterior distributions of the regression coefficients will be studied. The methodology will be implemented using Markov Chain Monte Carlo methods similar to those of Tanner and Wong (1987). iv) investigate Bayesian methods when the covariates are either categorical or continuous and the missing data mechanism is nonignorable. Multinomial models for the missing data mechanism will be studied. Dirichlet prior distributions for the multinomial parameters will be investigated. 2) develop and study classical and Bayesian methods of inference for the Cox model for survival outcomes in the presence of missing covariates. Specifically, we will i) develop and study estimation methods for the Cox model for survival outcomes in the presence of missing covariates. Methods for estimating the regression parameters when the missing covariates are either categorical or continuous will be studied. The methods of estimation will focus on an EM type algorithm similar to that of Wei and Tanner (1990). ii) study estimation of the regression parameters when the missing covariates are either categorical or continuous and the missing data mechanisms nonignorable. Models for the missing data mechanism will be studied. Bayesian methods similar to those of 1-iii) and -iv) will be investigated. Computational techniques using the Monte Carlo methods described in 1-iii) will be implemented.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
2R01CA074015-04A1
Application #
6326240
Study Section
Special Emphasis Panel (ZRG1-SNEM-5 (01))
Program Officer
Erickson, Burdette (BUD) W
Project Start
1997-09-01
Project End
2004-03-31
Budget Start
2001-05-10
Budget End
2002-03-31
Support Year
4
Fiscal Year
2001
Total Cost
$183,883
Indirect Cost
Name
Dana-Farber Cancer Institute
Department
Type
DUNS #
149617367
City
Boston
State
MA
Country
United States
Zip Code
02215
Ankerst, Donna P; Goros, Martin; Tomlins, Scott A et al. (2018) Incorporation of Urinary Prostate Cancer Antigen 3 and TMPRSS2:ERG into Prostate Cancer Prevention Trial Risk Calculator. Eur Urol Focus :
Rao, Shangbang; Ibrahim, Joseph G; Cheng, Jian et al. (2016) SR-HARDI: Spatially Regularizing High Angular Resolution Diffusion Imaging. J Comput Graph Stat 25:1195-1211
Joeng, Hee-Koung; Chen, Ming-Hui; Kang, Sangwook (2016) Proportional exponentiated link transformed hazards (ELTH) models for discrete time survival data with application. Lifetime Data Anal 22:38-62
Fraser, Raphael André; Lipsitz, Stuart R; Sinha, Debajyoti et al. (2016) Approximate median regression for complex survey data with skewed response. Biometrics 72:1336-1347
Lipsitz, Stuart R; Fitzmaurice, Garrett M; Arriaga, Alex et al. (2015) Using the jackknife for estimation in log link Bernoulli regression models. Stat Med 34:444-53
M'lan, Cyr Emile; Chen, Ming-Hui (2015) Objective Bayesian Inference for Bilateral Data. Bayesian Anal 10:139-170
Guo, Ruixin; Ahn, Mihye; Zhu, Hongtu et al. (2015) Spatially Weighted Principal Component Analysis for Imaging Classification. J Comput Graph Stat 24:274-296
Gao, Qibing; Ahn, Mihye; Zhu, Hongtu (2015) Cook's Distance Measures for Varying Coefficient Models with Functional Responses. Technometrics 57:268-280
Sun, Qiang; Zhu, Hongtu; Liu, Yufeng et al. (2015) SPReM: Sparse Projection Regression Model For High-dimensional Linear Regression. J Am Stat Assoc 110:289-302
de Castro, Mário; Chen, Ming-Hui; Zhang, Yuanye (2015) Bayesian path specific frailty models for multi-state survival data with applications. Biometrics 71:760-71

Showing the most recent 10 out of 112 publications