Repbase Update (www.girinst.org) is a database of repetitive elements currently representing over 3,200 families and subfamilies of transposable elements (TEs) from eukaryotic species. Each family is identifiable by a unique name, and its annotation includes biologically meaningful information such as species of origin, its systematic classification, keywords, reference to the scientific literature or names of contributors, brief commentaries, etc. Most of the 1,799 repetitive families added to Repbase Update (RU) during the last cycle are either unreported anywhere else, or have been thoroughly revised in terms of their consensus sequences and biological classification. Our approach is based on computer-assisted reconstruction and analysis of publicly available DNA sequence data. Additional contributions come via the peer-reviewed electronic journal Repbase Reports. RU has been routinely used by public institutions, genome sequencing consortia, and private companies throughout the world in routine gene discovery, polymorphism studies, sequence assembly and probe design. RU content has been partially described in original peer-reviewed articles, reviews and book chapters. This database became a unique resource for individual research projects of biological and medical importance and for creation of secondary databases. During the next five years RU is expected to double its size. This information will be extracted, reconstructed, analyzed, annotated, classified, indexed and made available to researchers. We will also add an undetermined number of unannotated repetitive families. We propose the following specific aims to meet the challenge: (1) Continue detection, reconstruction, annotation and electronic distribution of reference sequences for repetitive families from all sequenced eukaryotic species. (2) Continue systematic analysis and classification of repetitive families and prepare comprehensive dictionaries cross-referencing their nomenclature and biological classification. (3) Facilitate external submissions to RU and provide collaborative support to smaller databases compiled by different research groups. (4). Upgrade CENSOR program for annotation and analysis of repetitive DNA and continue generating maps of repetitive elements for selected genomes. (5) Pursue development of automated de novo identification and analysis of transposable elements. (6) Organize small workshops devoted to training and standardization of repeat nomenclature.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Biotechnology Resource Grants (P41)
Project #
5P41LM006252-11
Application #
7391784
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Florance, Valerie
Project Start
1996-09-30
Project End
2009-04-30
Budget Start
2008-05-01
Budget End
2009-04-30
Support Year
11
Fiscal Year
2008
Total Cost
$265,639
Indirect Cost
Name
Genetic Information Research Institute
Department
Type
DUNS #
946494259
City
Mountain View
State
CA
Country
United States
Zip Code
94043
Hubley, Robert; Finn, Robert D; Clements, Jody et al. (2016) The Dfam database of repetitive DNA families. Nucleic Acids Res 44:D81-9
Kojima, Kenji K (2015) A New Class of SINEs with snRNA Gene-Derived Heads. Genome Biol Evol 7:1702-12
Kojima, Kenji K; Jurka, Jerzy (2015) Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia. PLoS One 10:e0140084
Kojima, Kenji K; Jurka, Jerzy (2013) A superfamily of DNA transposons targeting multicopy small RNA genes. PLoS One 8:e68260
Wheeler, Travis J; Clements, Jody; Eddy, Sean R et al. (2013) Dfam: a database of repetitive DNA based on profile hidden Markov models. Nucleic Acids Res 41:D70-82
Jurka, Jerzy; Bao, Weidong; Kojima, Kenji K et al. (2012) Distinct groups of repetitive families preserved in mammals correspond to different periods of regulatory innovations in vertebrates. Biol Direct 7:36
Groenen, Martien A M; Archibald, Alan L; Uenishi, Hirohide et al. (2012) Analyses of pig genomes provide insight into porcine demography and evolution. Nature 491:393-8
Lehnert, Stefan; Kapitonov, Vladimir; Thilakarathne, Pushpike J et al. (2011) Modeling the asymmetric evolution of a mouse and rat-specific microRNA gene cluster intron 10 of the Sfmbt2 gene. BMC Genomics 12:257
Jurka, Jerzy; Bao, Weidong; Kojima, Kenji K (2011) Families of transposable elements, population structure and the origin of species. Biol Direct 6:44
Kojima, Kenji K; Kapitonov, Vladimir V; Jurka, Jerzy (2011) Recent expansion of a new Ingi-related clade of Vingi non-LTR retrotransposons in hedgehogs. Mol Biol Evol 28:17-20

Showing the most recent 10 out of 46 publications