Rank tests for clustered data with potentially informative cluster size: Novel st

Datta, Somnath

Abstract

The overall goal of this proposal is to develop appropriate rank based tests for clustered data when the cluster size is potentially informative and apply the resulting methods for various marginal comparisons (e.g., average condition of teeth before and after treatment) using existing dental database resources, specifically as obtained from the Piedmont 65 + Dental Study and Iowa Fluoride Study. Informative cluster size arises when the number of units in a cluster is non-constant/random and in correlation with the outcome of interest. In the context of dental data, all teeth belonging to an individual will form a cluster. Since tooth loss (in adult) is correlated with two of the diseases we are planning to study, namely, periodontal disease and dental caries, we have potentially informative cluster sizes in the Piedmont data sets. It is a methodological challenge to adapt a classical rank test to such situations. As for example, the two sample Wilcoxon rank sum test has difficulty maintaining the correct size/significance level under informative clustering even if it is adjusted for cluster dependence through appropriate variance estimate. This proposal has a goal of developing proper classes of rank based tests (and related R estimators) and studying their statistical properties for three classical problems adapted to marginal inference under cluster dependence with informative cluster size. These are the so called one sample location problem (Aim 1), the regression problem (Aim 2) and the association problem (Aim 3). In each of these problems, we will obtain a class of test statistics using general score functions that maintain proper asymptotic size under the informative cluster size scenario. We will also study the properties of the related R estimates of marginal parameters. Multivariate extensions of the first two problems will also be considered (Aim 4). Another signification component of the proposed research will be to extend these procedures to handle missing data where the missingness mechanism can be modeled using observable covariates (Aim 5). Finally, when the cluster size is not informative, as in the case of Iowa Study which comprises of children only, we will be able to increase the power of our tests by incorporating cluster specific weights in the construction of our test statistics (Aim 6).

Public Health Relevance

The proposed research will lead to novel methodological and theoretical development in nonparametric/rank tests and estimators for clustered data that will have direct impact on the analyses of a dental data. The results from the proposed research have the potential to transform the way clustered data are handled in practice. Dental researchers and practitioners will be more aware of the informative cluster size issue and employ robust methods such as the ones developed here that accounts for the non-ignorability of the cluster size. -cluster exchangeability remains an issue.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Dental & Craniofacial Research (NIDCR)
Type: Small Research Grants (R03)
Project #: 5R03DE020839-02
Application #: 8321444
Study Section: Special Emphasis Panel (ZDE1-LK (25))
Program Officer: Harris, Emily L

Project Start: 2011-09-01
Project End: 2014-08-31
Budget Start: 2012-09-01
Budget End: 2014-08-31
Support Year: 2
Fiscal Year: 2012
Total Cost: $148,537
Indirect Cost: $34,488

Institution

Name: University of Louisville
Department: Biostatistics & Other Math Sci
Type: Schools of Public Health
DUNS #: 057588857

City: Louisville
State: KY
Country: United States
Zip Code: 40292

Related projects


NIH 2012 R03 DE	Rank tests for clustered data with potentially informative cluster size: Novel st Datta, Somnath / University of Louisville	$148,537
NIH 2011 R03 DE	Rank tests for clustered data with potentially informative cluster size: Novel st Datta, Somnath / University of Louisville	$165,120

Publications

Lorenz, Douglas J; Levy, Steven; Datta, Somnath (2018) Inferring marginal association with paired and unpaired clustered data. Stat Methods Med Res 27:1806-1817

Choo-Wosoba, Hyoyoung; Gaskins, Jeremy; Levy, Steven et al. (2018) A Bayesian approach for analyzing zero-inflated clustered count data with dispersion. Stat Med 37:801-812

Lan, Ling; Bandyopadhyay, Dipankar; Datta, Somnath (2017) Non-parametric regression in clustered multistate current status data with informative cluster size. Stat Neerl 71:31-57

Nevalainen, Jaakko; Oja, Hannu; Datta, Somnath (2017) Tests for informative cluster size using a novel balanced bootstrap scheme. Stat Med 36:2630-2640

Dutta, Sandipan; Datta, Somnath (2016) A rank-sum test for clustered data when the number of subjects in a group within a cluster is informative. Biometrics 72:432-40

Bible, Joe; Beck, James D; Datta, Somnath (2016) Cluster adjusted regression for displaced subject data (CARDS): Marginal inference under potentially informative temporal cluster size profiles. Biometrics 72:441-51

Choo-Wosoba, Hyoyoung; Levy, Steven M; Datta, Somnath (2016) Marginal regression models for clustered count data based on zero-inflated Conway-Maxwell-Poisson distribution with applications. Biometrics 72:606-18

Kong, Maiying; Xu, Sheng; Levy, Steven M et al. (2015) GEE type inference for clustered zero-inflated negative binomial regression with application to dental caries. Comput Stat Data Anal 85:54-66

Nevalainen, Jaakko; Datta, Somnath; Oja, Hannu (2014) Inference on the marginal distribution of clustered data with informative cluster size. Stat Pap (Berl) 55:71-92

Datta, Somnath; Beck, James D (2014) Robust estimation of marginal regression parameters in clustered data. Stat Modelling 14:489-501

Showing the most recent 10 out of 12 publications

Comments

Be the first to comment on Somnath Datta's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: