The use of complex sampling, e.g. stratified multistage cluster sampling, in population based case-control studies is becoming more common. In addition to the cost- and time-effectiveness, the use of complex sample designs can also obtain representative samples from the study population and avoid the biased selection of controls and/or cases. To realize the full potential of these advances, however, there are at least two complexities introduced by the complex sampling, including 1) differential selection probabilities and 2) the intra-cluster correlations. As a result, the sample distribution can be different from the underlying population distribution from which the sample is selected. In this project, we will develop statistical methods accounting for the two complexities for the estimation of effects from genes, environmental factors and their interactions on the risk of complex diseases. Specifically, attracted by the efficiency advantage of the retrospective method, we will explore the assumptions of HWE and GE independence, and develop an efficient estimator suitable for the case-control study with a complex sample design. In practice, many case-control studies apply frequency matched designs, where controls are selected in numbers proportional to the number of cases within matching strata during the complex sampling. We will further incorporate the frequency-matching design into our proposed estimators. The proposed methods will be evaluated using simulations as well as two population-based case-control studies with complex sample designs. A unified software package will be developed.

Public Health Relevance

This project proposes innovative statistical methods for the analysis of data from population-based case-control studies when controls are sampled with a complex sample design. The results of this project will contribute to the understanding of the interplay of the genetic susceptibility and environmental risk factors, and provide an important resource for designing future population-based case-control studies.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project--Cooperative Agreements (U01)
Project #
5U01CA159424-02
Application #
8339451
Study Section
Special Emphasis Panel (ZCA1-SRLB-D (J1))
Program Officer
Dunn, Michelle C
Project Start
2011-09-27
Project End
2013-08-21
Budget Start
2012-09-01
Budget End
2013-08-21
Support Year
2
Fiscal Year
2012
Total Cost
$6,897
Indirect Cost
$2,360
Name
University of Texas Arlington
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
064234610
City
Arlington
State
TX
Country
United States
Zip Code
76019