(Taken from project abstract) Repetitive DNA represents over 50% of eukaryotic genomes and contains a tremendous wealth of information about genomic structure and evolution. Its knowledge is of crucial importance for understanding complex relationships between genetic blueprints and their biological functions. It is also of practical significance for genome mapping and sequencing projects. Inevitably, researchers studying growing numbers of DNA sequences in the coming years will have to analyze increasing numbers of repetitive elements integrated within them in ever growing detail. There are at least three major aspects of such studies: (I) automatic annotation of repetitive DNA using reference collections of sequences of known repeats and specialized computer software; (ii) evaluation of particular repeats based on the existing biological information, and (iii) other specialized studies involving sequence context and properties of the repeats. Such studies will require electronically available and constantly updated database containing reference collections of repeats, description of the reference repeats organized in a condensed form, and specialized collections allowing detailed studies of available homologous repeats. The first database of repetitive DNA known as """"""""Repbase"""""""" has been released by our group in 1992. Here we propose further steps necessary to improve and enhance this resource. Our specific goals include: 1. Develop and maintain reference collections of repetitive elements from all sequenced eukaryotic species. 2. Initiate a peer-reviewed electronic encyclopedia containing, updated key information on repetitive DNA. 3. Develop and maintain secondary collections of repetitive DNA.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Biotechnology Resource Grants (P41)
Project #
5P41LM006252-03
Application #
2771702
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Florance, Valerie
Project Start
1996-09-30
Project End
1999-11-30
Budget Start
1998-09-01
Budget End
1999-11-30
Support Year
3
Fiscal Year
1998
Total Cost
Indirect Cost
Name
Genetic Information Research Institute
Department
Type
DUNS #
City
Mountain View
State
CA
Country
United States
Zip Code
94043
Hubley, Robert; Finn, Robert D; Clements, Jody et al. (2016) The Dfam database of repetitive DNA families. Nucleic Acids Res 44:D81-9
Kojima, Kenji K (2015) A New Class of SINEs with snRNA Gene-Derived Heads. Genome Biol Evol 7:1702-12
Kojima, Kenji K; Jurka, Jerzy (2015) Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia. PLoS One 10:e0140084
Kojima, Kenji K; Jurka, Jerzy (2013) A superfamily of DNA transposons targeting multicopy small RNA genes. PLoS One 8:e68260
Wheeler, Travis J; Clements, Jody; Eddy, Sean R et al. (2013) Dfam: a database of repetitive DNA based on profile hidden Markov models. Nucleic Acids Res 41:D70-82
Jurka, Jerzy; Bao, Weidong; Kojima, Kenji K et al. (2012) Distinct groups of repetitive families preserved in mammals correspond to different periods of regulatory innovations in vertebrates. Biol Direct 7:36
Groenen, Martien A M; Archibald, Alan L; Uenishi, Hirohide et al. (2012) Analyses of pig genomes provide insight into porcine demography and evolution. Nature 491:393-8
Lehnert, Stefan; Kapitonov, Vladimir; Thilakarathne, Pushpike J et al. (2011) Modeling the asymmetric evolution of a mouse and rat-specific microRNA gene cluster intron 10 of the Sfmbt2 gene. BMC Genomics 12:257
Jurka, Jerzy; Bao, Weidong; Kojima, Kenji K (2011) Families of transposable elements, population structure and the origin of species. Biol Direct 6:44
Kojima, Kenji K; Kapitonov, Vladimir V; Jurka, Jerzy (2011) Recent expansion of a new Ingi-related clade of Vingi non-LTR retrotransposons in hedgehogs. Mol Biol Evol 28:17-20

Showing the most recent 10 out of 46 publications