Missing data and measurement errors are common problems in statistical data analysis. We are interested in experimental and observational studies where there exist missing data and measurement errors problems. Examples include health surveys containing non-responders or missing items, surrogate marker data with measurement errors, etc. The applications could be longitudinal clinical trials, multilevel community studies and health surveys. The incomplete data could be the non-ignorable missing response used in a model or as predictors, i.e. missing response, missing covariate, and covariate measurement errors. The most complicated scenario is the combination of such difficulties, i.e. the missing response with covariate measurement errors. The results from this project include innovative statistical methods, case studies, tools, solutions, and publications. These resources will be incorporated in our Longit Informatics Center for sharing and illustration. The Longit Informatics Center is an online data analysis environment. Subscribers can access many statistical packages and dynamic graphics for data analysis. In this project, the ultimate results will be two statistical packages added to Longit: 1) MiMe: statistical methods for missing data and measurement errors, and 2) Laso: joint modeling methods for longitudinal and survival outcomes in the study of surrogate marker for clinical event time. These packages include innovative statistical methods, sensitivity analysis and graphical methods. There is no commercial software to deal with complicated case as Laso.

Public Health Relevance

This project aims to develop statistical methods and tools for analyzing incomplete data with missing data and measurement errors.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43GM100573-01
Application #
8252746
Study Section
Special Emphasis Panel (ZRG1-HDM-R (11))
Program Officer
Marcus, Stephen
Project Start
2012-06-01
Project End
2013-02-28
Budget Start
2012-06-01
Budget End
2013-02-28
Support Year
1
Fiscal Year
2012
Total Cost
$198,601
Indirect Cost
Name
Data Numerica Institute, Inc.
Department
Type
DUNS #
003849838
City
Bellevue
State
WA
Country
United States
Zip Code
98006
Huang, Yijian; Wang, Ching-Yun (2018) Cox regression with dependent error in covariates. Biometrics 74:118-126
Yu, Hsiang; Cheng, Yu-Jen; Wang, Ching-Yun (2018) Methods for multivariate recurrent event data with measurement error and informative censoring. Biometrics 74:966-976