The repetitive DNA structure of the human genome The recent completion of the human genome project gives today's scientists the privileged opportunity to provide, for the first and only time, a detailed comprehensive description of the structure of the human DNA sequence. Large parts of our genome remain relatively understudied, especially the repetitive DNA fractions, which account for greater than 45% of our total DNA sequence. Many important genomic turnover mechanisms contribute to the large accumulation of repetitive DNA in our genome, including genomic duplication, transposition, unequal crossing over, and gene conversion, which have a huge impact on the structure of our genome over the course of evolution. Therefore, we propose to undertake the first genome- wide survey and analysis of three distinct aspects of human repetitive DNA, by developing novel computer algorithms, genome analysis tools, and rigourous experimental approaches. 1) We propose to identify and characterize the complete catalogue of human inverted DNA repeats, which have been associated with many important genome functions such as DNA replication, meiotic crossover, and gene conversion. Our results have shown that the human X chromosome contains a preponderance of large highly homologous inverted repeats that contain testes genes. 2) We propose to investigate novel classes of tandemly repeated """"""""satellite"""""""" DNA that contain human transposons. These are organized in multiple large arrays primarily in the pericentromeric regions of chromosomes, where rapid chromosome evolution takes place. We have identified and characterized a large family of tandem repeats composed almost entirely of rearranged MaLR LTR transposons, found on 8 different human chromosomes in arrays as large as 70kb. 3) We propose to perform a genome-wide analysis of human transposable elements (TE's) by analyzing the large number of nested transposon clusters where newer TE's have transposed into older TE's. We have developed unique methodology and computer algorithms that can locate and index all such transposon clusters in the human genome, and can derive a relative chronological order of human TEs over the course of evolution. This represents a completely novel method of studying molecular evolution that is not dependent on the assumption of a constant mutation rate (molecular clock). The studies proposed in this application will facilitate both computational and biological approaches to genomics and provide a unique analysis of a large and relatively neglected portion of our DNA sequence.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM072084-01A1
Application #
7030500
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Anderson, Richard A
Project Start
2006-05-01
Project End
2010-04-30
Budget Start
2006-05-01
Budget End
2007-04-30
Support Year
1
Fiscal Year
2006
Total Cost
$330,621
Indirect Cost
Name
Mount Sinai School of Medicine
Department
Genetics
Type
Schools of Medicine
DUNS #
078861598
City
New York
State
NY
Country
United States
Zip Code
10029
Burnside, R D; Ibrahim, J; Flora, C et al. (2011) Interstitial deletion of proximal 8q including part of the centromere from unbalanced segregation of a paternal deletion/marker karyotype with neocentromere formation at 8p22. Cytogenet Genome Res 132:227-32
Strawbridge, Eva M; Benson, Gary; Gelfand, Yevgeniy et al. (2010) The distribution of inverted repeat sequences in the Saccharomyces cerevisiae genome. Curr Genet 56:321-40
Scott, Stuart A; Cohen, Ninette; Brandt, Tracy et al. (2010) Large inverted repeats within Xp11.2 are present at the breakpoints of isodicentric X chromosomes in Turner syndrome. Hum Mol Genet 19:3383-93
Abrusan, Gyorgy; Giordano, Joti; Warburton, Peter E (2008) Analysis of transposon interruptions suggests selection for L1 elements on the X chromosome. PLoS Genet 4:e1000172
Warburton, Peter E; Hasson, Dan; Guillem, Flavia et al. (2008) Analysis of the largest tandemly repeated DNA families in the human genome. BMC Genomics 9:533
Giordano, Joti; Ge, Yongchao; Gelfand, Yevgeniy et al. (2007) Evolutionary history of mammalian transposons determined by genome-wide defragmentation. PLoS Comput Biol 3:e137