Abstract: Mammalian genomes have a complex physical structure shaped by myriad duplications, deletions and rearrangements, and this structure varies considerably among the populations and individuals of a species. These """"""""structural variations"""""""" are of special importance to our understanding of evolution and disease because single mutational events can affect large phenotypic changes, and because mutation rates vary dramatically among different genomic loci. We are only in the very early stages of understanding how structurally plastic genomes truly are, and why they are this way. Massively parallel paired-end DNA sequencing now offers the opportunity, in theory, to reconstruct the architecture of entire genomes on a routine basis. However, the practical utility of these methods remains limited by the significant computational challenges posed by proper data interpretation, and by cost. Over the past year we have developed novel experimental and computational tools, and we are now close to our initial goal of being able to comprehensively map structural variation in mammalian genomes, at reasonable cost and with modest computing power. We propose to apply these tools to examine structural variation in three especially revealing contexts: among diverse mouse strains with shared genealogical origins, among related mouse colonies separated by ~2,000 generations of breeding, and among single cells from diverse somatic lineages of the body and brain. In each case we will systematically identify and characterize """"""""hotspot"""""""" loci that mutate at elevated rates. These studies will yield an unbiased evaluation of the extent and origin of structural variation in mammalian genomes, and will enable us pursue our final goal: to develop a high-throughput platform for identifying factors that affect structural mutation rates. This work has immediate relevance to medicine considering that structural genomic variation has emerged as a major cause of both inherited and spontaneous human disease. Public Health Relevance: Structural variation is a ubiquitous feature of mammalian genomes, but little is known about the underlying process through which these duplications, deletions and rearrangements of DNA arise. This question is of great relevance to public health because spontaneous structural mutations in the germline contribute to a number of spontaneous human diseases, including autism and schizophrenia, and because mutations arising in somatic cells can lead to acquired diseases such as cancer. We will use powerful new DNA sequencing technologies and novel computational methods to investigate this process.

Agency
National Institute of Health (NIH)
Institute
Office of The Director, National Institutes of Health (OD)
Type
NIH Director’s New Innovator Awards (DP2)
Project #
1DP2OD006493-01
Application #
7852159
Study Section
Special Emphasis Panel (ZGM1-NDIA-O (02))
Program Officer
Basavappa, Ravi
Project Start
2009-09-30
Project End
2013-09-30
Budget Start
2009-09-30
Budget End
2013-09-30
Support Year
1
Fiscal Year
2009
Total Cost
$2,310,000
Indirect Cost
Name
University of Virginia
Department
Biochemistry
Type
Schools of Medicine
DUNS #
065391526
City
Charlottesville
State
VA
Country
United States
Zip Code
22904
Pfister, Katherine; Pipka, Justyna L; Chiang, Colby et al. (2018) Identification of Drivers of Aneuploidy in Breast Tumors. Cell Rep 23:2758-2769
Hazen, Jennifer L; Faust, Gregory G; Rodriguez, Alberto R et al. (2016) The Complete Genome Sequences, Unique Mutational Spectra, and Developmental Potency of Adult Neurons Revealed by Cloning. Neuron 89:1223-1236
Chiang, Colby; Layer, Ryan M; Faust, Gregory G et al. (2015) SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat Methods 12:966-8
Faust, Gregory G; Hall, Ira M (2014) SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics 30:2503-5
Layer, Ryan M; Chiang, Colby; Quinlan, Aaron R et al. (2014) LUMPY: a probabilistic framework for structural variant discovery. Genome Biol 15:R84
McConnell, Michael J; Lindberg, Michael R; Brennand, Kristen J et al. (2013) Mosaic copy number variation in human neurons. Science 342:632-7
Malhotra, Ankit; Lindberg, Michael; Faust, Gregory G et al. (2013) Breakpoint profiling of 64 cancer genomes reveals numerous complex rearrangements spawned by homology-independent mechanisms. Genome Res 23:762-76
Mell, Joshua Chang; Hall, Ira M; Redfield, Rosemary J (2012) Defining the DNA uptake specificity of naturally competent Haemophilus influenzae cells. Nucleic Acids Res 40:8536-49
Faust, Gregory G; Hall, Ira M (2012) YAHA: fast and flexible long-read alignment with optimal breakpoint detection. Bioinformatics 28:2417-24
Quinlan, Aaron R; Hall, Ira M (2012) Characterizing complex structural variation in germline and somatic genomes. Trends Genet 28:43-53

Showing the most recent 10 out of 13 publications