Population genetics is a crucial tool for facilitating medical genetics and disease gene mapping studies. This proposal outlines two novel population genetic techniques that each facilitate disease gene mapping in different ways. First, it describes a probabilistic technique for detecting the proportion of homozygosity an individual is likely to have based on SNP array data. This is useful for prioritizing disease case sample individuals for whole genome sequencing and subsequent homozygosity mapping to identify recessive disease genes. Such a tool is useful for large outbred populations such as European Americans for which homozygous regions are likely to be short and therefore cannot be unambiguously detected using existing techniques for SNP array data. I will apply this homozygosity prioritization scheme to a dataset of individuals with autism to identify individuals that will be most informative to resequence and will perform homozygosity mapping on the sequence data to locate recessive disease genes.
The second aim i s to develop methods to permit combined admixture mapping and genome-wide association (GWA) of Latinos, a population for which current admixture mapping methods fail. Latino admixture mapping is challenging because suitable reference haplotypes for their Native American ancestry are lacking, and because existing techniques cannot model their complex three-way admixture. I will explore three methods for performing admixture mapping of Latinos, including using linear combinations of collections of Native American reference haplotypes and utilizing the information about Native American variation present in the Latinos themselves. To show that my approach works in practice, I will apply this tool to study the genetics of type 2 diabetes, a disease with higher prevalence among Latinos.

Public Health Relevance

Insights from population genetics and statistical methods grounded in medical genetics have been very important in facilitating medical genetics and disease gene mapping studies. This pro- posal describes novel population genetic techniques with direct application to disease gene mapping, both in populations that have experienced founder events and in U.S. Latino populations that are admixed. The technology that I develop will be widely applicable to many disease gene mapping studies, but I will focus as proof-of-principle on two diseases: autism and type 2 diabetes.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Postdoctoral Individual National Research Service Award (F32)
Project #
5F32HG005944-03
Application #
8325692
Study Section
Special Emphasis Panel (ZRG1-F08-E (20))
Program Officer
Junkins, Heather
Project Start
2010-07-01
Project End
2013-06-30
Budget Start
2012-07-01
Budget End
2013-06-30
Support Year
3
Fiscal Year
2012
Total Cost
$52,190
Indirect Cost
Name
Harvard University
Department
Genetics
Type
Schools of Medicine
DUNS #
047006379
City
Boston
State
MA
Country
United States
Zip Code
02115
Mercader, Josep M; Liao, Rachel G; Bell, Avery D et al. (2017) A Loss-of-Function Splice Acceptor Variant in IGF2 Is Protective for Type 2 Diabetes. Diabetes 66:2903-2914
Williams, Amy L; Genovese, Giulio; Dyer, Thomas et al. (2015) Non-crossover gene conversions show strong GC bias and unexpected clustering in humans. Elife 4:
SIGMA Type 2 Diabetes Consortium; Williams, Amy L; Jacobs, Suzanne B R et al. (2014) Sequence variants in SLC16A11 are a common risk factor for type 2 diabetes in Mexico. Nature 506:97-101
Williams, Amy L; Patterson, Nick; Glessner, Joseph et al. (2012) Phasing of many thousands of genotyped samples. Am J Hum Genet 91:238-51