The human genome is arguably one of the most well-annotated and best-assembled mammalian genomes;nevertheless, important gaps remain in our understanding of its sequence organization, function, and evolution. Our genome is particularly enriched for complex segmental duplications, which harbor rapidly evolving genes and predispose our species'genome to non-allelic homologous recombination and disease. The long-term objective of our research has been to develop computational and experimental methods to understand the organization, diversity, and disease impact of segmental duplications. The goal of this competing renewal is to begin to understand the function and evolution of the duplicated genes themselves. We propose to focus here on 13 human- and great ape-specific gene families that have expanded within the last 15 million years of ape evolution. There are three aims: (1) Understand the genetic diversity and structure of these recent duplications by generating high-quality reference sequences using clone-based resources and long-read sequencing technologies;(2) Reconstruct gene family history by comparative sequencing of loci in great apes and exploring changes in gene structure, rates of substitution, and expression;and (3) Develop a robust genotyping assay based on molecular inversion probes to assess the genetic variation of these genes at the population level and the effect of forces such as non-allelic gene conversion. We hypothesize that segmental duplications have played an important role in human neurocognitive adaptation and that patterns of copy number polymorphisms and substitution will differ significantly between functional and nonfunctional paralogs. This research has the additional benefit that it will add new sequence to the human genome, identify missing genes, and provide us with the ability to systematically explore genetic variation of regions frequently overlooked as part of disease-association studies.

Public Health Relevance

This proposal focuses on the genetic characterization of ~120 genes from 13 gene families within complex regions of duplication that have been difficult to sequence and assemble. The work will improve the quality of the genome, provide a fundamental understanding of how new genes arise, and develop a novel approach to rapidly assess genetic variation of these genes as they relate to human disease and evolutionary adaptation.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Genetic Variation and Evolution Study Section (GVE)
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Washington
Schools of Medicine
United States
Zip Code
Hehir-Kwa, Jayne Y; Marschall, Tobias; Kloosterman, Wigard P et al. (2016) A high-quality human reference panel reveals the complexity and distribution of genomic structural variants. Nat Commun 7:12989
Nuttle, Xander; Giannuzzi, Giuliana; Duyzend, Michael H et al. (2016) Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility. Nature 536:205-9
Gordon, David; Huddleston, John; Chaisson, Mark J P et al. (2016) Long-read sequence assembly of the gorilla genome. Science 352:aae0344
Shi, Lingling; Guo, Yunfei; Dong, Chengliang et al. (2016) Long-read sequencing and de novo assembly of a Chinese genome. Nat Commun 7:12065
Huddleston, John; Eichler, Evan E (2016) An Incomplete Understanding of Human Genetic Variation. Genetics 202:1251-4
Mohajeri, Kiana; Cantsilieris, Stuart; Huddleston, John et al. (2016) Interchromosomal core duplicons drive both evolutionary instability and disease susceptibility of the Chromosome 8p23.1 region. Genome Res 26:1453-1467
Dennis, Megan Y; Eichler, Evan E (2016) Human adaptation and evolution by segmental duplication. Curr Opin Genet Dev 41:44-52
Watson, C T; Steinberg, K M; Graves, T A et al. (2015) Sequencing of the human IG light chain loci from a hydatidiform mole BAC library reveals locus-specific signatures of genetic diversity. Genes Immun 16:24-34
1000 Genomes Project Consortium; Auton, Adam; Brooks, Lisa D et al. (2015) A global reference for human genetic variation. Nature 526:68-74
Xue, Yali; Prado-Martinez, Javier; Sudmant, Peter H et al. (2015) Mountain gorilla genomes reveal the impact of long-term population decline and inbreeding. Science 348:242-5

Showing the most recent 10 out of 72 publications