Complex dependent data involving cluster sampling, longitudinal designs and hierarchical sampling schemes arise frequently in epidemiologic studies of aging and chronic diseases. Such data allow investigators to estimate important effects of covariates on response in an efficient manner. For example, longitudinal data are essential to assess changes in health status over time and determinants of those changes; cluster designs arise naturally in studies involving groups such as families or as the only feasible way to gather large probability samples. Generalized linear mixed models and marginal methods such as generalized estimating equation approaches provide effective analyses of complex dependent data but give rise to additional estimation/inferential/interpretational problems that this proposal will address. Generalized linear mixed models typically involve intractable integrals and popular methods for avoiding this integration yield highly biased estimates of covariate effects and variance components. The generalized estimating equations approach offers several alternative methods for confidence interval construction and variance estimation but few studies have examined or compared the performance of these methods and no guidelines exist to help data analysts choose appropriate and efficient methods or to understand why different methods yield different results. Case-control family designs should allow investigators to more efficiently estimate the associations of interest in the case-control sample, to estimate associations controlled for family characteristics and propensities and to measure familial aggregation (within-family dependence) of the response. However, there has been little investigation of statistical methods for such data. This research will develop and evaluate statistical methods to analyze complex dependent data by developing and evaluating methods for fitting generalized linear mixed models; developing guidelines for the choice of appropriate and efficient confidence interval construction and variance estimation for marginal models; and developing and evaluating methods to analyze case-control family data. This research extends our previous work and addresses many of the issues raised by the 1996 Nantucket conference on the state of the art of methods for longitudinal data analysis and the 1999 NSF-CBMS Regional Conference on generalized linear mixed models. We will produce illustrative, comparative analyses of data from several longitudinal and clustered studies of chronic disease. The comparisons of alternative approaches will identify which are the best for specific applications as well as potentially identify new methods. The results of this research will provide clear guidelines as to the advantages and disadvantages of alternative approaches so that biomedical investigators can effectively construct and use longitudinal and cluster study designs, perform improved inference and avoid inappropriate analyses or incorrect interpretations.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA082370-03
Application #
6514075
Study Section
Special Emphasis Panel (ZRG1-SNEM-5 (01))
Program Officer
Feuer, Eric J
Project Start
2000-06-09
Project End
2005-04-30
Budget Start
2002-05-01
Budget End
2005-04-30
Support Year
3
Fiscal Year
2002
Total Cost
$165,938
Indirect Cost
Name
University of California San Francisco
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
073133571
City
San Francisco
State
CA
Country
United States
Zip Code
94143
McCulloch, Charles E; Neuhaus, John M; Olin, Rebecca L (2016) Biased and unbiased estimation in longitudinal studies with informative visit processes. Biometrics 72:1315-1324
Zeng, Lily; Josephson, S Andrew; Fukuda, Keiko A et al. (2015) A Prospective Comparison of Informant-based and Performance-based Dementia Screening Tools to Predict In-Hospital Delirium. Alzheimer Dis Assoc Disord 29:312-6
Neuhaus, John M; Scott, Alastair J; Wild, Christopher J et al. (2014) Likelihood-based analysis of longitudinal data from outcome-related sampling designs. Biometrics 70:44-52
Levine, Deborah A; Haan, Mary N; Langa, Kenneth M et al. (2013) Impact of gender and blood pressure on poststroke cognitive decline among older Latinos. J Stroke Cerebrovasc Dis 22:1038-45
Saberi, Parya; Johnson, Mallory O; McCulloch, Charles E et al. (2011) Medication adherence: tailoring the analysis to the data. AIDS Behav 15:1447-53
McCulloch, Charles E; Neuhaus, John M (2011) Prediction of random effects in linear and generalized linear models under model misspecification. Biometrics 67:270-9
Neuhaus, John M; McCulloch, Charles E (2011) The effect of misspecification of random effects distributions in clustered data settings with outcome-dependent sampling. Can J Stat 39:488-497
Neuhaus, John M; McCulloch, Charles E; Boylan, Ross (2010) A Note on Type II Error Under Random Effects Misspecification in Generalized Linear Mixed Models. Biometrics :
Pa, Judy; Boxer, Adam; Chao, Linda L et al. (2009) Clinical-neuroimaging characteristics of dysexecutive mild cognitive impairment. Ann Neurol 65:414-23
Boxer, Adam L; Lipton, Anne M; Womack, Kyle et al. (2009) An open-label study of memantine treatment in 3 subtypes of frontotemporal lobar degeneration. Alzheimer Dis Assoc Disord 23:211-7