The propose research will address a great challenge of modern human genetics which is to understand the recent evolutionary history of our species. This is important not only as key basic knowledge, but also for many medical genetic questions, such as when and where have mutations occurred that contribute to diseases. The backdrop for medical questions like these is the demographic history of human populations, which includes our history of population sizes, population founding events, migrations and admixture, and population size changes. Two new methods for estimating the demographic history of a species will be developed. Both of these have been designed to address three major challenges that limit current methods. First, methods that are intended to study the history of gene exchange must include recombination as part of the model because of the strong interaction affect that recombination and gene exchange have on patterns of genetic variation. Second, methods must be able to handle large, complex demographic models with human populations. This is especially true for histories with gene exchange. Third, methods must be able to handle very large data sets with speed and without introducing biases. One new method will provide estimates of the Allele Frequency Spectrum (AFS) for pairs of linked single-nucleotide- polymorphisms (SNPs). By including recombination for pairs of SNPs, the 2 SNP AFS is far larger and holds much more information on gene exchange than the conventional single SNP-based AFS. The second new method will be a new genealogy sampling approach that uses a greatly reduced representation of genealogies (gene trees). This method does not give up any information, and yet unlike most current genealogy samplers will be able to be used for models with recombination and for large data sets. The third goal is to apply these methods to the study of human demographic history among African populations. African populations have ancient and complex genetic histories that include all of the major demographic processes considered here. Both methods will be applied to a large high-quality population genomic data set that currently includes 33 genomes from 7 populations, including three hunter-gatherer populations. The goal is an accurate and rich multi- population portrait of human history in Africa.

Public Health Relevance

Identifying and understanding the genetic contribution to human diseases depends in part upon having a backdrop of detailed knowledge of demographic history our species. The propose research will provide new methods capable of revealing the demographic history of human populations and will apply these methods to population genomic data from African populations.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-GGG-H (02))
Program Officer
Eckstrand, Irene A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Temple University
Schools of Arts and Sciences
United States
Zip Code
Schrider, Daniel R; Kern, Andrew D (2016) S/HIC: Robust Identification of Soft and Hard Sweeps Using Machine Learning. PLoS Genet 12:e1005928
Sethuraman, Arun; Hey, Jody (2016) IMa2p--parallel MCMC and inference of ancient demography under the Isolation with migration (IM) model. Mol Ecol Resour 16:206-15
Schrider, Daniel R; Shanku, Alexander G; Kern, Andrew D (2016) Effects of Linked Selective Sweeps on Demographic Inference and Model Selection. Genetics :
Chen, Hua; Hey, Jody; Slatkin, Montgomery (2015) A hidden Markov model for investigating recent positive selection through haplotype structure. Theor Popul Biol 99:18-30
Hey, Jody; Chung, Yujin; Sethuraman, Arun (2015) On the occurrence of false positives in tests of migration under an isolation-with-migration model. Mol Ecol 24:5078-83
Chen, Hua; Hey, Jody; Chen, Kun (2015) Inferring Very Recent Population Growth Rate from Population-Scale Sequencing Data: Using a Large-Sample Coalescent Estimator. Mol Biol Evol 32:2996-3011
Schrider, Daniel R; Mendes, Fábio K; Hahn, Matthew W et al. (2015) Soft shoulders ahead: spurious signatures of soft and partial selective sweeps result from linked hard sweeps. Genetics 200:267-84
Sousa, Vitor C; Carneiro, Miguel; Ferrand, Nuno et al. (2013) Identifying loci under selection against gene flow in isolation-with-migration models. Genetics 194:211-33
Loh, Yong-Hwee E; Bezault, Etienne; Muenzel, Frauke M et al. (2013) Origins of shared genetic variation in African cichlids. Mol Biol Evol 30:906-17
Wang, Yong; Nielsen, Rasmus (2012) Estimating population divergence time and phylogeny from single-nucleotide polymorphisms data with outgroup ascertainment bias. Mol Ecol 21:974-86

Showing the most recent 10 out of 19 publications