The overall goal of this proposal is to develop appropriate rank based tests for clustered data when the cluster size is potentially informative and apply the resulting methods for various marginal comparisons (e.g., average condition of teeth before and after treatment) using existing dental database resources, specifically as obtained from the Piedmont 65 + Dental Study and Iowa Fluoride Study. Informative cluster size arises when the number of units in a cluster is non-constant/random and in correlation with the outcome of interest. In the context of dental data, all teeth belonging to an individual will form a cluster. Since tooth loss (in adult) is correlated with two of the diseases we are planning to study, namely, periodontal disease and dental caries, we have potentially informative cluster sizes in the Piedmont data sets. It is a methodological challenge to adapt a classical rank test to such situations. As for example, the two sample Wilcoxon rank sum test has difficulty maintaining the correct size/significance level under informative clustering even if it is adjusted for cluster dependence through appropriate variance estimate. This proposal has a goal of developing proper classes of rank based tests (and related R estimators) and studying their statistical properties for three classical problems adapted to marginal inference under cluster dependence with informative cluster size. These are the so called one sample location problem (Aim 1), the regression problem (Aim 2) and the association problem (Aim 3). In each of these problems, we will obtain a class of test statistics using general score functions that maintain proper asymptotic size under the informative cluster size scenario. We will also study the properties of the related R estimates of marginal parameters. Multivariate extensions of the first two problems will also be considered (Aim 4). Another signification component of the proposed research will be to extend these procedures to handle missing data where the missingness mechanism can be modeled using observable covariates (Aim 5). Finally, when the cluster size is not informative, as in the case of Iowa Study which comprises of children only, we will be able to increase the power of our tests by incorporating cluster specific weights in the construction of our test statistics (Aim 6).

Public Health Relevance

The proposed research will lead to novel methodological and theoretical development in nonparametric/rank tests and estimators for clustered data that will have direct impact on the analyses of a dental data. The results from the proposed research have the potential to transform the way clustered data are handled in practice. Dental researchers and practitioners will be more aware of the informative cluster size issue and employ robust methods such as the ones developed here that accounts for the non-ignorability of the cluster size. -cluster exchangeability remains an issue.

Agency
National Institute of Health (NIH)
Institute
National Institute of Dental & Craniofacial Research (NIDCR)
Type
Small Research Grants (R03)
Project #
1R03DE020839-01A1
Application #
8046185
Study Section
Special Emphasis Panel (ZDE1-LK (25))
Program Officer
Harris, Emily L
Project Start
2011-09-01
Project End
2013-08-31
Budget Start
2011-09-01
Budget End
2012-08-31
Support Year
1
Fiscal Year
2011
Total Cost
$165,120
Indirect Cost
Name
University of Louisville
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
057588857
City
Louisville
State
KY
Country
United States
Zip Code
40292
Lorenz, Douglas J; Levy, Steven; Datta, Somnath (2018) Inferring marginal association with paired and unpaired clustered data. Stat Methods Med Res 27:1806-1817
Choo-Wosoba, Hyoyoung; Gaskins, Jeremy; Levy, Steven et al. (2018) A Bayesian approach for analyzing zero-inflated clustered count data with dispersion. Stat Med 37:801-812
Lan, Ling; Bandyopadhyay, Dipankar; Datta, Somnath (2017) Non-parametric regression in clustered multistate current status data with informative cluster size. Stat Neerl 71:31-57
Nevalainen, Jaakko; Oja, Hannu; Datta, Somnath (2017) Tests for informative cluster size using a novel balanced bootstrap scheme. Stat Med 36:2630-2640
Dutta, Sandipan; Datta, Somnath (2016) A rank-sum test for clustered data when the number of subjects in a group within a cluster is informative. Biometrics 72:432-40
Bible, Joe; Beck, James D; Datta, Somnath (2016) Cluster adjusted regression for displaced subject data (CARDS): Marginal inference under potentially informative temporal cluster size profiles. Biometrics 72:441-51
Choo-Wosoba, Hyoyoung; Levy, Steven M; Datta, Somnath (2016) Marginal regression models for clustered count data based on zero-inflated Conway-Maxwell-Poisson distribution with applications. Biometrics 72:606-18
Kong, Maiying; Xu, Sheng; Levy, Steven M et al. (2015) GEE type inference for clustered zero-inflated negative binomial regression with application to dental caries. Comput Stat Data Anal 85:54-66
Nevalainen, Jaakko; Datta, Somnath; Oja, Hannu (2014) Inference on the marginal distribution of clustered data with informative cluster size. Stat Pap (Berl) 55:71-92
Datta, Somnath; Beck, James D (2014) Robust estimation of marginal regression parameters in clustered data. Stat Modelling 14:489-501

Showing the most recent 10 out of 12 publications