1) Evolutionary classification of P-loop NTPases As a part of my ongoing research on the P-loop NTPases, which constitute the largest set of monophyletic protein domains in most proteomes, we completed the evolutionary classification of the AAA+ ATPases and the P-loop kinases. These studies helped us to identify the early diversification events in the history of these proteins. We provide evidence that the P-loop NTPases first differentiated into two classes, namely the KG class (Kinase, GTPase class) and the ASCE class (Additional strand conserved glutamate) that includes the AAA+, RecA-like, PilT/VirD4-like and ABC ATPases. These classes had several representatives that could be traced back to the last universal common ancestor of all life forms suggesting that they had undergone a vast radiation even before that stage. Hence they provide insights into some of the earliest aspects of protein evolution. 2) Analysis of the novel protease families A multi-pronged strategy including extensive sequence searches, structural modeling, and analysis of contextual information extracted from domain architectures, genetic screens, and large-scale protein-protein interaction analyses was employed to predict previously undetected components of the eukaryotic ubiquitin (Ub) signaling system. Two novel groups of proteins that are likely to function as de-ubiquitinating and de-SUMOylating peptidases (DUBs) were identified. The first group of putative DUBs, designated PPPDE superfamily (after Permuted Papain fold Peptidases of DsRNA viruses and Eukaryotes), consists of predicted thiol peptidases with a circularly permuted papain-like fold. In addition to eukaryotic proteins, the PPPDE superfamily includes predicted proteases from several groups of double-stranded RNA viruses and one single-stranded DNA virus. The apparent recruitment of DUBs for viral polyprotein processing seems to represent a common theme in evolution of viruses. The second group of putative DUBs identified in this study is the WLM (Wss1p-like metalloproteases) family of the Zincin-like superfamily of Zn-dependent peptidases, which are linked to the Ub-system by virtue of fusions with the UB-binding PUG (PUB), Ub-like, and Little Finger domains. More specifically, genetic evidence implicates the WLM family in de-SUMOylation. If validated experimentally, the WLM family proteins will represent the first case of a Zincin-like metalloprotease involvement in Ub-signaling. 3) Evolution of the nuclear membrane and nuclear pore complex The presence of a distinct nucleus, the compartment for confining the genome, transcription and RNA maturation, is a central (and eponymous) feature that distinguishes eukaryotes from prokaryotes. Structural integrity of the nucleus is maintained by the nuclear envelope (NE). A crucial element of this structure is the nuclear pore complex (NPC), a macromolecular machine with over 90 protein components, which mediates nucleo-cytoplasmic communication. Given the indispensability of these structures for nuclear function, the natural history of the nucleus can only be understood in terms of the origin and subsequent evolution of NE and NPC components. We investigated the provenance of the conserved domains found in these perinuclear proteins and reconstructed a parsimonious scenario for NE and NPC evolution by means of comparative-genomic analysis of their components from the available sequences of 28 sequenced eukaryotic genomes. We show that the NE and NPC proteins were tinkered together from diverse domains, which evolved from prokaryotic precursors at different points in eukaryotic evolution, divergence from pre-existing eukaryotic paralogs performing other functions, and de novo. It is shown that several central components of the NPC, in particular, the RanGDP import factor NTF2, the HEH domain of Src1p-Man1, and, probably, also the key domains of karyopherins and nucleoporins, the HEAT/ARM and WD40 repeats, have a bacterial, most likely, endosymbiotic origin. The specialized immunoglobulin (Ig) domain in the globular tail of the animal lamins, and the Ig domains in the nuclear membrane protein GP210 are shown to be related to distinct prokaryotic families of Ig domains. This suggests that independent, late horizontal gene transfer events from bacterial sources might have contributed to the evolution of perinuclear proteins in some of the major eukaryotic lineages. Snurportin 1, one of the highly conserved karyopherins, contains a cap-binding domain which is shown to be an inactive paralog of the guanylyl transferase domain of the mRNA-capping enzyme, exemplifying recruitment of paralogs of pre-exsiting proteins for perinuclear functions. We infer an autogenous scenario of nuclear evolution in which the nucleus emerged in the primitive eukaryotic ancestor (the ?prekaryote?) as part of cell compartmentalization triggered by archaeo-bacterial symbiosis. A pivotal event in this process was the radiation of Ras-superfamily GTPases yielding Ran, the key regulator of nuclear transport. A primitive NPC with approximately 20 proteins and a Src1p-Man1-like membrane protein with a DNA-tethering HEH domain are inferred to have been integral perinuclear components in the las common ancestor of modern eukaryotes.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM092504-01
Application #
6988475
Study Section
(CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
1
Fiscal Year
2004
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Balaji, S; Babu, M Madan; Iyer, Lakshminarayan M et al. (2005) Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains. Nucleic Acids Res 33:3994-4006
Aravind, L; Anantharaman, Vivek; Balaji, Santhanam et al. (2005) The many faces of the helix-turn-helix domain: transcription regulation and beyond. FEMS Microbiol Rev 29:231-62
Tasneem, Asba; Iyer, Lakshminarayan M; Jakobsson, Eric et al. (2005) Identification of the prokaryotic ligand-gated ion channels and their implications for the mechanisms and origins of animal Cys-loop ion channels. Genome Biol 6:R4
Iyer, Lakshminarayan M; Leipe, Detlef D; Koonin, Eugene V et al. (2004) Evolutionary history and higher order classification of AAA+ ATPases. J Struct Biol 146:11-31
Abrahamsen, Mitchell S; Templeton, Thomas J; Enomoto, Shinichiro et al. (2004) Complete genome sequence of the apicomplexan, Cryptosporidium parvum. Science 304:441-5
Iyer, Lakshminarayan M; Koonin, Eugene V; Aravind, L (2004) Evolution of bacterial RNA polymerase: implications for large-scale bacterial phylogeny, domain accretion, and horizontal gene transfer. Gene 335:73-88
Zhang, Hong; Christoforou, Andrea; Aravind, L et al. (2004) The C. elegans Polycomb gene SOP-2 encodes an RNA binding protein. Mol Cell 14:841-7
Wertz, Ingrid E; O'Rourke, Karen M; Zhou, Honglin et al. (2004) De-ubiquitination and ubiquitin ligase domains of A20 downregulate NF-kappaB signalling. Nature 430:694-9
D'Angelo, Anna; Garzia, Livia; Andre, Alessandra et al. (2004) Prune cAMP phosphodiesterase binds nm23-H1 and promotes cancer metastasis. Cancer Cell 5:137-49
Pradel, Gabriele; Hayton, Karen; Aravind, L et al. (2004) A multidomain adhesion protein family expressed in Plasmodium falciparum is essential for transmission to the mosquito. J Exp Med 199:1533-44

Showing the most recent 10 out of 30 publications