The HapMap project will generate an unprecedented volume of human genotype data that will allow a wealth of inferences about human variation extending well beyond the original goals of the project. We propose a series of investigations that center around the following four aims. First, the HapMap SNPs did not arise from complete resequencing of the samples, but rather the data are from assays of previously discovered SNPs. This imposes an ascertainment bias that may impact many aspects of subsequent analysis, including spurious appearance of associations, underestimating of LD, and potentially serious underestimation of population structure. We plan to investigate the consequences of uncorrected ascertainment bias and the means for correcting this bias. Second, if one imagines that SNPs arise in the population with a range of selection coefficients, then properties of the distribution of selection coefficients can be estimated with increasing power as the number of SNPs increases. Given the magnitude of the HapMap project, it will provide an unprecedented ability to infer the role of natural selection in shaping human variation. Third, while the physical map of the human genome is essentially complete, the genetic map remains considerably lower in resolution because of the nature of sampling meiotic recombination events from limited pedigrees. Even so, the genetic map was greatly improved recently, opening the opportunity for a much more detailed analysis of the relation between local genetic recombination rate, physical separation of markers, linkage disequilibrium, frequency spectrum, and other population genetic attributes of SNPs. Observation of levels of LD that are not commensurate with the local rate of recombination may be a sign of recent natural selection. This investigation entails extensive simulation, and devising of MCMC approaches to parameter estimation and hypothesis testing, centering on the null hypothesis that variability among genomic regions in local population genetic attributes is explicable by the local recombination rate. Finally, the central purpose of HapMap is to provide a step toward the ultimate goal of identifying allelic variation associated with risk of complex disease. The dense LD map provided by HapMap will enable us to simulate disease associations so as to thoroughly quantify the power of whole-genome LD association inference. These tests will help identify the attributes of local LD that best predict the power of association tests, and will serve as additional guidance to identify regions requiring more dense SNP coverage.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003229-02
Application #
6895089
Study Section
Mammalian Genetics Study Section (MGN)
Program Officer
Brooks, Lisa
Project Start
2004-05-21
Project End
2007-04-30
Budget Start
2005-05-01
Budget End
2006-04-30
Support Year
2
Fiscal Year
2005
Total Cost
$286,848
Indirect Cost
Name
Cornell University
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
872612445
City
Ithaca
State
NY
Country
United States
Zip Code
14850
Racimo, Fernando; Marnetto, Davide; Huerta-Sánchez, Emilia (2017) Signatures of Archaic Adaptive Introgression in Present-Day Human Populations. Mol Biol Evol 34:296-317
Racimo, Fernando; Gokhman, David; Fumagalli, Matteo et al. (2017) Archaic Adaptive Introgression in TBX15/WARS2. Mol Biol Evol 34:509-524
Henn, Brenna M; Botigué, Laura R; Peischl, Stephan et al. (2016) Distance from sub-Saharan Africa predicts mutational load in diverse human genomes. Proc Natl Acad Sci U S A 113:E440-9
Poznik, G David; Xue, Yali; Mendez, Fernando L et al. (2016) Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences. Nat Genet 48:593-9
Slavney, Andrea; Arbiza, Leonardo; Clark, Andrew G et al. (2016) Strong Constraint on Human Genes Escaping X-Inactivation Is Modulated by their Expression Level and Breadth in Both Sexes. Mol Biol Evol 33:384-93
Racimo, Fernando; Sankararaman, Sriram; Nielsen, Rasmus et al. (2015) Evidence for archaic adaptive introgression in humans. Nat Rev Genet 16:359-71
Hunter-Zinck, Haley; Clark, Andrew G (2015) Aberrant Time to Most Recent Common Ancestor as a Signature of Natural Selection. Mol Biol Evol 32:2784-97
Ma, Li; Keinan, Alon; Clark, Andrew G (2015) Biological knowledge-driven analysis of epistasis in human GWAS with application to lipid traits. Methods Mol Biol 1253:35-45
Rohlfs, Rori V; Aguiar, Vitor R C; Lohmueller, Kirk E et al. (2015) Fitting the Balding-Nichols model to forensic databases. Forensic Sci Int Genet 19:86-91
Henn, Brenna M; Botigué, Laura R; Bustamante, Carlos D et al. (2015) Estimating the mutation load in human genomes. Nat Rev Genet 16:333-43

Showing the most recent 10 out of 103 publications