Linkage disequilibrium is the nonrandom association of alleles at linked loci, which can be used as a gene mapping technique to refine the location of a disease susceptibility locus. Designing linkage disequilibrium studies have received very little attention in recent statistics literature, particularly when the cost of the study and the available sample size are limited. Here we address these issues by proposing two two-stage designs. An optimal design is one which maximizes the power or the chance of identifying the true disease susceptibility locus. The first design will be evaluated when the total cost of the study is the only constraint, for a given number of markers. The goal will be to determine the optimal allocation of sample size in a two-stage process. We will consider a second design when the constraints are on the total cost and the available sample size. The goal of this design will be to determine the number of markers to be genotyped in a two-stage process, and the intermarker spacing which will provide an optimal design. Finally, we will derive the critical value of a test statistics under these two designs. We will first obtain an approximation to the power function and the critical value under the two proposed designs. We will use Monte Carlo simulations to investigate how these values are affected by the choice of total number of markers, intermarker distance, and the total sample size. With increased attention on whole genome studies, the designs proposed here will permit researchers to allocate their limited resources in a manner whereby the power to identify the disease susceptibility locus, if any, is maximized.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM060457-02
Application #
6343095
Study Section
Special Emphasis Panel (ZRG1-SNEM-5 (01))
Program Officer
Eckstrand, Irene A
Project Start
2000-01-01
Project End
2002-12-31
Budget Start
2001-01-01
Budget End
2001-12-31
Support Year
2
Fiscal Year
2001
Total Cost
$136,001
Indirect Cost
Name
Sloan-Kettering Institute for Cancer Research
Department
Type
DUNS #
064931884
City
New York
State
NY
Country
United States
Zip Code
10065
Baum, A E; Akula, N; Cabanero, M et al. (2008) A genome-wide association study implicates diacylglycerol kinase eta (DGKH) and several other genes in the etiology of bipolar disorder. Mol Psychiatry 13:197-207
Sen, Saunak; Satagopan, Jaya M; Broman, Karl W et al. (2007) R/qtlDesign: inbred line cross experimental design. Mamm Genome 18:87-93
Satagopan, Jaya M; Sen, Saunak; Churchill, Gary A (2007) Sequential quantitative trait locus mapping in experimental crosses. Stat Appl Genet Mol Biol 6:Article12
Sen, Saunak; Satagopan, Jaya M; Churchill, Gary A (2005) Quantitative trait locus study design from an information perspective. Genetics 170:447-64
Satagopan, J M; Ben-Porat, L; Berwick, M et al. (2004) A note on competing risks in survival data analysis. Br J Cancer 91:1229-35
Satagopan, Jaya M; Venkatraman, E S; Begg, Colin B (2004) Two-stage designs for gene-disease association studies with sample size constraints. Biometrics 60:589-97
Satagopan, Jaya M; Elston, Robert C (2003) Optimal two-stage genotyping in population-based association studies. Genet Epidemiol 25:149-57
Satagopan, Jaya M; Panageas, Katherine S (2003) A statistical perspective on gene expression data analysis. Stat Med 22:481-99
Satagopan, Jaya M; Verbel, David A; Venkatraman, E S et al. (2002) Two-stage designs for gene-disease association studies. Biometrics 58:163-70
Satagopan, Jaya M; Boyd, Jeff; Kauff, Noah D et al. (2002) Ovarian cancer risk in Ashkenazi Jewish carriers of BRCA1 and BRCA2 mutations. Clin Cancer Res 8:3776-81