) How new mutations affect an individual's fitness lies at the heart of many questions in evolutionary genetics and is highly relevant for elucidating the genetic basis of complex traits. Current evidence suggests that while many mutations are deleterious, selection coefficients for particular mutations come from a distribution of se- lective effects (DSE). Although the DSE has been subjected to intense study over the past decade, many open questions about this distribution and the importance of deleterious mutations in evolution remain unanswered. The widespread use of next-generation sequencing has provided a plethora of new polymorphism datasets that are potentially informative about the nature of fitness effects of mutations. However, existing computational approaches make simplifying assumptions and lack statistical power to extract the full potential of these data. This project will develop and apply new computational approaches to integrate these emerging polymorphism datasets from multiple species together in a rigorous statistical framework to estimate fundamental parameters relating to natural selection across genomes. We will develop novel computational methods that use very large samples of human genomes to better estimates selective effects. First, we propose to leverage runs of homo- zygosity within an individual to co-estimates dominance coefficients and the DSE. We will develop a second set of methods that utilizes transmission patters within large numbers of pedigrees to estimate selective ef- fects. Third, previous studies of the DSE have assumed all sites are independent of each other and have ig- nored epistasis. We will develop a composite likelihood approach to estimate the degree of epistasis between many pairs of SNPs. We will then apply these methods to the large cohorts of human genomes that are being sequenced to obtain the most detailed estimates of the DSE, dominance coefficients, and nature of epistasis from human populations. Next, these new computational tools will be combined with data from a variety of dif- ferent species to address longstanding questions concerning selective effects. We will investigate the evolution of the DSE itself by testing whether it has changed in response to recent shifts in the environment. As an ex- ample, we will test whether the change in selective pressure associated with dog domestication has led to dif- ferences in the DSE between dogs and wolves. This will be accomplished by generating exome sequencing data from of dogs and wolves. Finally, we will investigate why different species appear to show disparate amounts of amino acid adaptation. This work will resolve a current paradox in population genetics and will con- tribute to understanding the different modes of adaptation across different species. In sum, successful comple- tion of this research will provide the most comprehensive estimates of dominance, the DSE, and the amount of epistasis in natural populations to date. These estimates will dramatically improve the interpretation of genetic variation in future evolutionary and medical genetic studies.

Public Health Relevance

A major challenge in identifying genetic variants contributing risk to complex disease is distinguishing genetic variants that disrupt biological function from those that do not. Approaches to distinguish these types of variants routinely take advantage of evolutionary information, either to prioritize variants that have been subjected to natural selection or to calibrate models of background genetic variation in the genome. This project is relevant to public health because it will greatly increase our knowledge of deleterious mutations and, by doing so, will increase the utility of evolutionary information for identifying biologically relevant and pathogenic variants.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Unknown (R35)
Project #
5R35GM119856-02
Application #
9340237
Study Section
Special Emphasis Panel (ZRG1-CB-E (50)R)
Program Officer
Janes, Daniel E
Project Start
2016-09-01
Project End
2021-05-31
Budget Start
2017-06-01
Budget End
2018-05-31
Support Year
2
Fiscal Year
2017
Total Cost
$304,159
Indirect Cost
$101,901
Name
University of California Los Angeles
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
092530369
City
Los Angeles
State
CA
Country
United States
Zip Code
90095
Mooney, Jazlyn A; Huber, Christian D; Service, Susan et al. (2018) Understanding the Hidden Complexity of Latin American Population Isolates. Am J Hum Genet 103:707-726
Robinson, Jacqueline A; Brown, Caitlin; Kim, Bernard Y et al. (2018) Purging of Strongly Deleterious Mutations Explains Long-Term Persistence and Absence of Inbreeding Depression in Island Foxes. Curr Biol 28:3487-3494.e4
Huber, Christian D; Durvasula, Arun; Hancock, Angela M et al. (2018) Gene expression drives the evolution of dominance. Nat Commun 9:2750
Kim, Bernard Y; Huber, Christian D; Lohmueller, Kirk E (2018) Deleterious variation shapes the genomic landscape of introgression. PLoS Genet 14:e1007741
Huber, Christian D; Kim, Bernard Y; Marsden, Clare D et al. (2017) Determining the factors driving selective effects of new nonsynonymous mutations. Proc Natl Acad Sci U S A 114:4465-4470
Kim, Bernard Y; Huber, Christian D; Lohmueller, Kirk E (2017) Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples. Genetics 206:345-361
Pedersen, Casper-Emil T; Lohmueller, Kirk E; Grarup, Niels et al. (2017) The Effect of an Extreme and Prolonged Population Bottleneck on Patterns of Deleterious Variation: Insights from the Greenlandic Inuit. Genetics 205:787-801
Beichman, Annabel C; Phung, Tanya N; Lohmueller, Kirk E (2017) Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories. G3 (Bethesda) 7:3605-3620