In the last few years, rapid accumulation of genome sequences and protein structures has been paralleled by major advances in sequence database search methods. The powerful Position-Specific Iterating BLAST (PSI-BLAST) method developed at the NCBI formed the basis of our work on protein motif analysis. A new mode of PSI-BLAST application which includes exhaustive database search by repeating PSI-BLAST iterations to convergence with newly identified protein family members was developed and implemented in an automatic procedure. Another new procedure, IMPALA, is a reversal of the PSI-BLAST method and allows one to search a library of protein family profiles by using an individual protein sequence as a query. These methods were applied to the systematic analysis of several classes of protein domains. It was shown that a number of signaling domains previously considered to be specifically eukaryotic are detectable in archaea and/or bacteria. By combining domain detection with a cross-genome comparison, these domains were classified into ancestral and horizontally transferred ones. The evolutionary histories of protein domains that comprise the repair systems and programmed cell death systems were investigated in detail. Also, the DNA-binding domains encoded in archaeal genomes have been thoroughly studied resulting in the demonstration that the repertoire of such domains in archaea resembles that in bacteria but not in eukaryotes. A number of previously undetected domains and protein families were discovered including the ACT domain ? multipurpose ligand-binding model involved in allosteric regulation of avariety of enzymes and a superfamily of predicted protease from bacteria, archaea and eukaryotes that are homologous to animal transglutaminases. - Protein sequence motifs, iterative database search, fold recognition, multiple alignment

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM000061-06
Application #
6290486
Study Section
Special Emphasis Panel (CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
6
Fiscal Year
1999
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Ng, C Leong; Waterman, David G; Koonin, Eugene V et al. (2009) Conformational flexibility and molecular interactions of an archaeal homologue of the Shwachman-Bodian-Diamond syndrome protein. BMC Struct Biol 9:32
Yutin, Natalya; Wolf, Maxim Y; Wolf, Yuri I et al. (2009) The origins of phagocytosis and eukaryogenesis. Biol Direct 4:9
Wolf, Yuri I; Novichkov, Pavel S; Karev, Georgy P et al. (2009) Inaugural Article: The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages. Proc Natl Acad Sci U S A 106:7273-80
Koonin, Eugene V; Aravind, L (2009) Comparative genomics, evolution and origins of the nuclear envelope and nuclear pore complex. Cell Cycle 8:1984-5
Makarova, Kira S; Wolf, Yuri I; Koonin, Eugene V (2009) Comprehensive comparative-genomic analysis of type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes. Biol Direct 4:19
Makarova, Kira S; Wolf, Yuri I; van der Oost, John et al. (2009) Prokaryotic homologs of Argonaute proteins are predicted to function as key components of a novel system of defense against mobile genetic elements. Biol Direct 4:29
Galperin, Michael Y (2008) Telling bacteria: do not LytTR. Structure 16:657-9
Hou, Shaobin; Makarova, Kira S; Saw, Jimmy H W et al. (2008) Complete genome sequence of the extremely acidophilic methanotroph isolate V4, Methylacidiphilum infernorum, a representative of the bacterial phylum Verrucomicrobia. Biol Direct 3:26
Basu, Malay Kumar; Carmel, Liran; Rogozin, Igor B et al. (2008) Evolution of protein domain promiscuity in eukaryotes. Genome Res 18:449-61
Elkins, James G; Podar, Mircea; Graham, David E et al. (2008) A korarchaeal genome reveals insights into the evolution of the Archaea. Proc Natl Acad Sci U S A 105:8102-7

Showing the most recent 10 out of 50 publications