Current guidelines for diabetes care recommend individualized treatment plans for complex patients since tight control of glycosylated hemoglobin (A1c) may not be appropriate. However, little evidence exists to support the patient-centered decisions. Electronic health records (EHRs) provide an important source for clinical evidence on improving diabetes care, but suffer from usability deficiencies. Particularly the lab measures and vital signs have intermittent missing values where the irregular visit patterns may be informative about the patients' underlying medication status. Patient characteristics are also incomplete due to linkage error.
We aim to impute the missing values in EHRs and improve the data quality to strengthen the evidence base for diabetes guidelines. The proposed work is motivated by ongoing clinical research to examine the role of patient complexity in the relationship between tight A1c control and the risk of adverse events, using a pre-existing EHR dataset of 8,304 patients with diabetes cared by the UW Health during 2003-2011. We propose Bayesian latent profile models under multiple imputation to account for the potentially non-ignorable visiting process, facilitate modeling a large number of EHR variables of mixed types and develop scalable computation algorithms. Specifically, first we build latent profiles by jointly modeling A1c values, patient characteristics and health outcomes. Second, we generalize the latent profiles by multiple pattern indices and combine the trajectories of multiple lab measures and vital signs with intermittent missing values, as well as accounting for incomplete patient sociodemographics. Third, we release open source computation software and disseminate new clinical findings to the healthcare delivery system. The investigation results will advance statistical methodology development for missing data in longitudinal studies, increase the compatibility of available patient medical records and strengthen the evidence base to support existing diabetes guidelines.

Public Health Relevance

The project aims to properly handle missing data in electronic health records and facilitate comparative effectiveness research on the tight glycosylated hemoglobin control effect for complex patients with diabetes. The proposed Bayesian latent profile approaches generalize multiple imputation for informative visiting process and offer flexibility, scalability and open source computation access. The new findings will advance statistical methodology development for missing data in longitudinal studies, increase the quality of electronic health records and strengthen the evidence base for diabetes guidelines to improve the quality of healthcare.

Agency
National Institute of Health (NIH)
Institute
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21DK110688-01
Application #
9169147
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Savage, Peter J
Project Start
2016-08-01
Project End
2018-07-31
Budget Start
2016-08-01
Budget End
2017-07-31
Support Year
1
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of Wisconsin Madison
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
161202122
City
Madison
State
WI
Country
United States
Zip Code
53715