The number of research efforts seeking to find genetic variants that predispose to human disease via genetic association studies has grown significantly since the completion of both the Human Genome Project and the International HapMap Project. In this research we consider alternate sample design methodologies for genetic association studies, with the goal of maximizing statistical power for testing genotype-phenotype association. Maximizing statistical power will allow researchers to more quickly and efficiently identify genetic variants predisposing individuals to complex human diseases. We will start by evaluating the cost-effectiveness of gathering duplicate genotype data. Duplicate genotype data is collected by twice genotyping some portion of individuals in a study using a method that may make classification errors (e.g. Single Nucleotide Polymorphisms (SNPs)). Current recommendations are for genetic association studies to duplicate genotype 5-10% of the individuals in the study. Recently, methods were proposed to include duplicate genotype data into genetic tests of association. However, no effort was made to evaluate whether or not gathering duplicate genotype data is cost-effective. We will evaluate the cost-effectiveness of gathering duplicate genotype data by examining power of sample designs which gather duplicate (or higher replicate) genotype data versus those that don't, on a fixed budget. In a similar manner we will consider the cost-effectiveness of obtaining conditional duplicate genotype data. Conditional duplicate genotype data is obtained by duplicate genotyping some individuals but at different rates, dependent upon the first observed genotype. We will also evaluate conditional double sampling, whereby fractions of individuals are sequenced (a near perfect method of genotyping) at rates dependent on the observed SNP genotype. We will synthesize these design recommendations with recommendations for the cost-effective implementation of double sampling. Double sampling involves sequencing a random fraction of individuals. Additionally, we will consider the cost- effectiveness of using classification methods which create informative missing data and demonstrate how informative missing data can be utilized in related tests of association. All design recommendations will be integrated into freely available web-tools so that researchers can quickly assess the cost-effectiveness of these alternative design strategies for their study. Research conclusions will be developed mathematically, confirmed via computer simulation and demonstrated on data from actual genetic association studies. Additionally, all research will be conducted with the active involvement of undergraduate research students. The number of research efforts seeking to find genetic variants that predispose to human disease via genetic association studies has grown significantly since the completion of both the Human Genome Project and the International HapMap Project. In this research we consider alternate sample design methodologies for genetic association studies, with the goal of maximizing statistical power for testing genotype-phenotype association. Maximizing statistical power will allow researchers to more quickly and efficiently identify genetic variants predisposing individuals to complex human diseases.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Academic Research Enhancement Awards (AREA) (R15)
Project #
3R15HG004543-01S1
Application #
7841342
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Ramos, Erin
Project Start
2009-06-01
Project End
2011-05-31
Budget Start
2009-06-01
Budget End
2011-05-31
Support Year
1
Fiscal Year
2009
Total Cost
$19,325
Indirect Cost
Name
Hope College
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
050947084
City
Holland
State
MI
Country
United States
Zip Code
49422
Beck, Andrew; Luedtke, Alexander; Liu, Keli et al. (2017) A POWERFUL METHOD FOR INCLUDING GENOTYPE UNCERTAINTY IN TESTS OF HARDY-WEINBERG EQUILIBRIUM. Pac Symp Biocomput 22:368-379
Aslibekyan, Stella; Almeida, Marcio; Tintle, Nathan (2014) Pathway analysis approaches for rare and common variants: insights from Genetic Analysis Workshop 18. Genet Epidemiol 38 Suppl 1:S86-91
Blue, Elizabeth M; Sun, Lei; Tintle, Nathan L et al. (2014) Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond. Genet Epidemiol 38 Suppl 1:S21-8
Mayer-Jochimsen, Morgan; Fast, Shannon; Tintle, Nathan L (2013) Assessing the impact of differential genotyping errors on rare variant tests of association. PLoS One 8:e56626
Petersen, Ashley; Alvarez, Carolina; DeClaire, Scott et al. (2013) Assessing methods for assigning SNPs to genes in gene-based tests of association using common variants. PLoS One 8:e62161
Liu, Keli; Luedtke, Alexander; Tintle, Nathan (2013) Optimal methods for using posterior probabilities in association testing. Hum Hered 75:2-11
Liu, Keli; Fast, Shannon; Zawistowski, Matthew et al. (2013) A geometric framework for evaluating rare variant tests of association. Genet Epidemiol 37:345-57
Bekmetjev, Airat; VanBruggen, Dirk; McLellan, Brian et al. (2012) The cost-effectiveness of reclassification sampling for prevalence estimation. PLoS One 7:e32058
Luedtke, Alexander; Powers, Scott; Petersen, Ashley et al. (2011) Evaluating methods for the analysis of rare variants in sequence data. BMC Proc 5 Suppl 9:S119
Tintle, Nathan; Aschard, Hugues; Hu, Inchi et al. (2011) Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17. Genet Epidemiol 35 Suppl 1:S56-60

Showing the most recent 10 out of 19 publications