The rapid accumulation of genome sequences and protein structures during the last decade has been paralleled by major advances in sequence database search methods. The powerful Position-Specific Iterating BLAST (PSI-BLAST) method developed at the NCBI formed the basis of our work on protein motif analysis. In addition, Hidden Markov Models (HMM) and protein structure comparison methods were applied. During the last year, we made further progress in detailed analysis of the classification, evolution, and functions of several classes of proteins. Specifically, we studied in detail the protein domains that are involved in eukaryotic RNA interference mechanisms and showed that the protein machinery of eukaryotic RNAi was pieced together from ancestral archaeal, bacterial and phage proteins that are involved in DNA repair and RNA processing. We also used computational methods to identify a novel prokaryotic toxin-antitoxin systems that are predicted to function via RNA binding or cleavage and other, diverse mechanisms. We explored the evolution of the eukaryotic phagocytosis system and demonstrated the presence of actin-related domains in a distinct set of crenarchaeota and korarchaeota, suggesting that the archaeal ancestor of eukaryotes might have already possessed a primitive form of branched cytoskeleton that facilitated engulfment of other prokaryotes. We further investigated the evolution and genomic context of prokaryotic homologs of the eukaryotic argonaute protein (the key component of the RNAi system) and showed that the genes encoding these proteins tend to be located within """"""""genomic islands"""""""" that also contain many other genes for various defense systems. These findings suggest that the prokaryotic homologs of argonaute are components of a novel defense system that also includes a variety of other putative nucleases that we identifies using sensitive methods of domain analysis. We additionally contributed to experimental and structural analysis of a variety of domains previously identified in our computational studies, in particular, the key components of the prokaryotic CRSIPR-associated acqauired immunity system.

Project Start
Project End
Budget Start
Budget End
Support Year
16
Fiscal Year
2009
Total Cost
$829,277
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
Zip Code
Krupovic, Mart; Cvirkaite-Krupovic, Virginija; Iranzo, Jaime et al. (2018) Viruses of archaea: Structural, functional, environmental and evolutionary genomics. Virus Res 244:181-193
Yutin, Natalya; Makarova, Kira S; Gussow, Ayal B et al. (2018) Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut. Nat Microbiol 3:38-46
He, Fei; Bhoobalan-Chitty, Yuvaraj; Van, Lan B et al. (2018) Anti-CRISPR proteins encoded by archaeal lytic viruses inhibit subtype I-D immunity. Nat Microbiol 3:461-469
Shmakov, Sergey A; Makarova, Kira S; Wolf, Yuri I et al. (2018) Systematic prediction of genes functionally linked to CRISPR-Cas systems by gene neighborhood analysis. Proc Natl Acad Sci U S A 115:E5307-E5316
Pushkarev, Alina; Inoue, Keiichi; Larom, Shirley et al. (2018) A distinct abundant group of microbial rhodopsins discovered using functional metagenomics. Nature 558:595-599
Yutin, Natalya; Bäckström, Disa; Ettema, Thijs J G et al. (2018) Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis. Virol J 15:67
Ferrer, Manuel; Sorokin, Dimitry Y; Wolf, Yuri I et al. (2018) Proteomic Analysis of Methanonatronarchaeum thermophilum AMET1, a Representative of a Putative New Class of Euryarchaeota, ""Methanonatronarchaeia"". Genes (Basel) 9:
Koonin, Eugene V; Makarova, Kira S (2018) Discovery of Oligonucleotide Signaling Mediated by CRISPR-Associated Polymerases Solves Two Puzzles but Leaves an Enigma. ACS Chem Biol 13:309-312
Galperin, Michael Y; Makarova, Kira S; Wolf, Yuri I et al. (2018) Phyletic Distribution and Lineage-Specific Domain Architectures of Archaeal Two-Component Signal Transduction Systems. J Bacteriol 200:
Smargon, Aaron A; Cox, David B T; Pyzocha, Neena K et al. (2017) Cas13b Is a Type VI-B CRISPR-Associated RNA-Guided RNase Differentially Regulated by Accessory Proteins Csx27 and Csx28. Mol Cell 65:618-630.e7

Showing the most recent 10 out of 117 publications