Inferring the demographic history of a population is an important task in population genetics. Although several methods are available for this task, how to take advantage of large sample size of whole genome sequence data and provide accurate estimation of demographic history remains an open question. We propose several approaches to overcome the shortcomings of existing methods and specifically improve their accuracy and scalability for large sample size and whole genome sequence data. The resulting methods will be applied to the whole genome sequences of the genotype-rich human populations such as the TOPMED European American cohorts (~30,000 individuals) and Icelander whole genome sequence data (2,636 individuals), and provide good estimation of the demographic histories. Finally, a software package will be developed to incorporate the new methods and assist other researchers to easily apply the method to their own data.

Public Health Relevance

We will develop methods for accurately inferring the demographic history of a population using large whole genome sequence data, and apply it to multiple phenotype-rich human populations. Successful completion of these aims will enable population geneticists to obtain a better understanding of prehistoric demographic events of a population, and further disentangle the effects of demographic events and natural selection on human genome.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG009524-05
Application #
9933971
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Brooks, Lisa
Project Start
2018-08-30
Project End
2021-05-31
Budget Start
2020-06-01
Budget End
2021-05-31
Support Year
5
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of South Florida
Department
Type
DUNS #
069687242
City
Tampa
State
FL
Country
United States
Zip Code
33617