Exploring regions of extreme diversity in the human genome

Meltz Steinberg, Karyn

Abstract

Sequences with less than 99.5% identity in the human genome, called regions of extreme diversity, are an important source of genetic variation that often contribute to human disease. These regions often overlap segmental duplications and are refractory to short read next-generation sequencing technologies that will fail to accurately map and align sequence reads. It is therefore necessary to exploit alternative sequencing methods such as long molecule sequencing. A sequenced fosmid clone provides a long, contiguous stretch of sequence from a single haplotype making fosmid clone libraries unique and powerful resources for detecting extreme genetic variation. The broad objective of this proposal is to use fosmid clone libraries from 16 diverse human genomes to characterize the sequence of regions of extreme nucleotide variation. I hypothesize that highly complex, divergent loci may represent uncharacterized duplications, mutational hotspots, or ancient haplotypes that have been maintained in human populations. Because highly divergent regions sometimes overlap structural variants, another objective is to characterize the sequences underlying common, recurrent structural variation in duplicated regions. These intractable regions often map to divergent SNP haplotypes or have variable breakpoints that traditional methods such as arrayCGH are unable to accurately genotype. Utilizing fosmid clone-derived end-sequence data, I have identified 385 loci greater than 100kb where four or more clones map to the region but the identity between the sample and reference is less than 99.5% as well as 208 loci of recurrent structural mutation. I will analyze sequence data from clones that map to these loci to examine the nucleotide diversity underlying these regions within a population genetic framework. Finally, to assess the worldwide distribution and population frequencies of this variation I will develop genotyping assays to test in a diverse panel of ethnic groups. The comprehensive annotation of these complex loci will serve as a benchmark for many next generation sequencing efforts such as the 1000 Genomes Project, and the experiments proposed here will enhance our understanding of human population genetics, evolutionary history and disease susceptibility.

Public Health Relevance

The study of the population genetics of complex regions of the human genome, where there are many differences between individuals, can provide insight into the evolutionary history of complex traits such as human disease. The knowledge gained here will enhance our understanding of the human genome and potentially influence further genomic and medical studies.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Postdoctoral Individual National Research Service Award (F32)
Project #: 5F32GM097807-02
Application #: 8370578
Study Section: Special Emphasis Panel (ZRG1-F08-E (20))
Program Officer: Reddy, Michael K

Project Start: 2011-09-16
Project End: 2012-11-15
Budget Start: 2012-09-16
Budget End: 2012-11-15
Support Year: 2
Fiscal Year: 2012
Total Cost: $11,315
Indirect Cost

Institution

Name: University of Washington
Department: Genetics
Type: Schools of Medicine
DUNS #: 605799469

City: Seattle
State: WA
Country: United States
Zip Code: 98195

Related projects


NIH 2012 F32 GM	Exploring regions of extreme diversity in the human genome Meltz Steinberg, Karyn Naomi / University of Washington	$11,315
NIH 2011 F32 GM	Exploring regions of extreme diversity in the human genome Meltz Steinberg, Karyn Naomi / University of Washington	$48,398

Publications

Watson, Corey T; Steinberg, Karyn Meltz; Graves, Tina A et al. (2015) Sequencing of the human IG light chain loci from a hydatidiform mole BAC library reveals locus-specific signatures of genetic diversity. Genes Immun 16:24-34

Watson, Corey T; Steinberg, Karyn M; Huddleston, John et al. (2013) Complete haplotype sequence of the human immunoglobulin heavy-chain variable, diversity, and joining genes and characterization of allelic and copy-number variation. Am J Hum Genet 92:530-46

Itsara, Andy; Vissers, Lisenka E L M; Steinberg, Karyn Meltz et al. (2012) Resolving the breakpoints of the 17q21.31 microdeletion syndrome with next-generation sequencing. Am J Hum Genet 90:599-613

Steinberg, Karyn Meltz; Antonacci, Francesca; Sudmant, Peter H et al. (2012) Structural diversity and African origin of the 17q21.31 inversion polymorphism. Nat Genet 44:872-80

Comments

Be the first to comment on Karyn Meltz Steinberg's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: