Population genetics models play a crucial role in many aspects of modern disease gene mapping studies, from identification of population stratification to haplotype phasing. Though these studies to date have largely been per- formed in populations of European descent, in the coming years they will expand to a large number of diverse populations. This proposal outlines novel statistical models for describing allele frequencies in multiple populations, with applications relevant to the next generation of disease mapping studies. First, I describe a novel model for inferring population history from genome-wide allele frequency data. The history of a population is an important determinant of the amount of genetic variation and extent of linkage disequilibrium in the population, but current methods for inferring history are either limited to a small number of populations or do not allow for gene flow be- tween populations. A proposed approach overcomes these problems, and will allow for efficient modeling of allele frequencies in a large number of diverse populations. Second, I describe an approach for local ancestry inference in populations with arbitrarily complex admixture histories. Many populations involved in disease mapping studies are the result of mixtures between multiple populations (e.g. Latinos throughout the Americas) or between populations without close modern equivalents (e.g. many populations in India). Though local ancestry inference in these populations is important for localizing association signals, most current methods are not designed for these situations. I propose a model that explicitly accounts for the relationship of ancestral populations to modern ones, and will allow for efficient local ancestry inference in populations with complex demographic histories. Finally, I describe a method for detecting subtle changes in allele frequency due to natural selection. It is likely that natural selection in humans acts to cause small shifts in allele frequency at many loci;however, most methods to detect selection rely on rapid fixation of strongly selected alleles. I propose a method, based on explicitly modeling the demographic relationship between populations, to overcome this obstacle.

Public Health Relevance

Population genetics models are crucial for many aspects of disease gene mapping studies. This proposal de- scribes novel models for describing ancestry in diverse human populations, particularly in admixed populations like African-Americans and Latinos in the United States. The technology developed will be applicable in disease gene mapping studies in diverse populations.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Postdoctoral Individual National Research Service Award (F32)
Project #
5F32GM103098-02
Application #
8542505
Study Section
Special Emphasis Panel (ZRG1-F08-Q (20))
Program Officer
Reddy, Michael K
Project Start
2012-09-01
Project End
2015-08-31
Budget Start
2013-09-01
Budget End
2014-08-31
Support Year
2
Fiscal Year
2013
Total Cost
$49,214
Indirect Cost
Name
Harvard University
Department
Genetics
Type
Schools of Medicine
DUNS #
047006379
City
Boston
State
MA
Country
United States
Zip Code
02115
Pickrell, Joseph K; Patterson, Nick; Loh, Po-Ru et al. (2014) Ancient west Eurasian ancestry in southern and eastern Africa. Proc Natl Acad Sci U S A 111:2632-7
Pickrell, Joseph K; Reich, David (2014) Toward a new history and geography of human genes informed by ancient DNA. Trends Genet 30:377-89
Pickrell, Joseph K (2014) Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am J Hum Genet 94:559-73
Loh, Po-Ru; Lipson, Mark; Patterson, Nick et al. (2013) Inferring admixture histories of human populations using linkage disequilibrium. Genetics 193:1233-54
Pickrell, Joseph K; Patterson, Nick; Barbieri, Chiara et al. (2012) The genetic prehistory of southern Africa. Nat Commun 3:1143
Pickrell, Joseph K; Pritchard, Jonathan K (2012) Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet 8:e1002967