A DNA topoisomerase IB in Thaumarchaeota testifies for the presence of this enzyme in the last common ancestor of Archaea and Eucarya
© Brochier-Armanet et al; licensee BioMed Central Ltd. 2008
Received: 25 November 2008
Accepted: 23 December 2008
Published: 23 December 2008
DNA topoisomerase IB (TopoIB) was thought for a long time to be a eukaryotic specific enzyme. A shorter version was then found in viruses and later on in several bacteria, but not in archaea. Here, we show that a eukaryotic-like TopoIB is present in the recently sequenced genomes of two archaea of the newly proposed phylum Thaumarchaeota. Phylogenetic analyses suggest that a TopoIB was present in the last common ancestor of Archaea and Eucarya. This finding indicates that the last common ancestor of Archaea and Eucarya may have harboured a DNA genome.
This article was reviewed by Eugene Koonin and Anthony Poole
DNA topoisomerases are ubiquitous enzymes that control DNA topology and solve topological conflicts arising during DNA replication, transcription, and recombination [1–3] (For a recent review on DNA topoisomerases see also ). Based on their mechanisms of action, DNA topoisomerases belong to two classes, type I (Topo I) and type II (Topo II): Topo I change the number of DNA topological links by introducing transient single-stranded breaks in the DNA molecule, whereas Topo II introduce transient double-stranded breaks. According to phylogenetic criteria, both Topo II and Topo I classes regroup several families of unrelated (i.e. non homologous) proteins: Topo IIA and IIB on one hand, and Topo IA (that also includes the so-called Topo III of eukaryotes and bacteria), IB and IC on the other hand [5, 6]. This indicates that enzymes with either Topo I or Topo II activity originated multiple times independently in the course of evolution. For instance, Topo IIA and IIB share a homologous ATP binding subunit, but their DNA cleavage-religation subunits are non homologous and are structurally unrelated [2, 7]. Regarding Topo I enzymes, Topo IA, which form a transient covalent link in 5' of the DNA break during the reaction of topoisomerization, share a Toprim domain with Topo II, some nucleases and primases , whereas Topo IB, which form a transient covalent link in 3' of the DNA break, are distantly related to tyrosine recombinases [2, 9]. Although Topo IC forms a 3' DNA link similarly to Topo IB, it harbors a novel unique fold, and is unrelated to Topo IB and tyrosine recombinases . The three different Topo I families show very distinctive distributions in the living world: Topo IA are present in currently available complete genomes of organisms from the three domains of life , whereas Topo IC appears so far specific to one particular species, the archaeon Methanopyrus kandleri . Finally, Topo IB is present in eukaryotes, in poxviruses, in the mimivirus, and in some bacteria [6, 10, 11].
Up to now, Topo IB have never been observed in Archaea, in sharp contrast to members of the Topo IA family which are present in one or more copies in all archaeal genomes  (Additional files 2 and 3). Surprisingly, we recently noticed that a Topo IB coding gene was identified in the genome of the archaeon Cenarchaeum symbiosum [23, 24], but that a Topo IA coding gene was absent . Phylogenetic analyses of the archaeal domain based on concatenation of ribosomal proteins and comparative genome analysis have recently led us to propose that C. symbiosum and its relatives, formerly included in the phylum Crenarchaeota, should be considered as members of a separate and possibly ancient phylum, that we proposed to name Thaumarchaeota . We predicted that the absence of a Topo IA and the presence of a Topo IB might be a distinctive feature of all thaumarchaeota members. As expected, we have detected an archaeal Topo IB homologue (YP_001582656), misannotated as an 2-alkenal reductase, in the recently sequenced genome of a second thaumarchaeon Nitrosopumilus maritimus , which also lacks a Topo IA homologue. Both thaumarchaeal Topo IB display a domain organisation that is very similar to that of their eukaryotic homologues, since these harbour both the N-terminal Topoisom_I_N and the C-terminal Topoisom_I domain (Figure 1A and Additional files 1). The main difference between the eukaryotic and the archaeal Topo IB is that the former possess a long and highly variable extension upstream of the Topoisom_I_N domain that is absent in the archaeal sequences (Figure 1A and Additional files 1). Two hypotheses can be proposed to explain the presence of a Topo IB coding gene in Thaumarchaeota. One is that this gene was acquired by the last common ancestor of Thaumarchaeota via a horizontal gene transfer (HGT) (blue arrow, Figure 1B-a). In that case, the donor would have been a eukaryote since both the thaumarchaeal and the eukaryotic Topo IB harbour a similar domain organisation. Alternatively, a Topo IB coding gene might have been present in the last common ancestor of Archaea and Eucarya and was then lost in all archaea, except in the lineage leading to Thaumarchaeota (Figures 1Bb-d). To distinguish between these two hypotheses on the origin of thaumarchaeal Topo IB, we have performed an in-depth phylogenetic analysis of Topo IB homologues.
Topo IB have been for long thought to be absent in Archaea. Our finding now extends the presence of Topo IB homologues in members of all three domains of life. This may thus suggest that this enzyme was already present in the Last Universal Common Ancestor (LUCA). However, Topo IB homologues are either absent or scarcely distributed in complete genomes from most main bacterial phyla (Additional files 3). Moreover, the bacterial part of the Topo IB tree is not congruent with the bacterial specie tree (i.e. the monophyly of main bacterial groups is not recovered, Figure 2), suggesting that the history of Topo IB in Bacteria was dominated by lateral gene transfers. It was previously suggested that the viral-like Topo IB found in Bacteria was originally introduced from a DNA virus . Our new and more detailed phylogenetic analysis, as well as the similarity of the domain organisation of viral and bacterial Topo IB, confirms the close relationship between these sequences and their probable common ancestry, although the direction of transfer is yet unclear.
The likely presence of both a Topo IA and Topo IB in the last common archaeal ancestor ( and this study, respectively), suggests that this ancestor was possibly more "complex" than modern archaea (if complexity is defined in terms of number of genes and/or redundancy of cellular processes). This idea was already proposed by Lecompte et al. who highlighted a streamlining in the evolution of archaeal ribosomes . This is consistent with the recent observation that several proteins common to Archaea and Eukaryotes are missing in either Crenarchaeota, Euryarchaeota or Thaumarchaeota  and may indicate a possible tendency of evolution by streamlining of some central molecular processes in the archaeal domain. Finally, one of us has recently proposed that a transition from RNA genomes to DNA genomes occurred independently in each of the three life domains by the contribution of three different DNA viruses to three complex RNA cells . The idea of different DNA viruses at the origin of Archaea and Eucarya sought to explain the existence of several critical differences in their DNA replication systems, including the ancestral presence of a Topo IB exclusively in Eucarya. Our finding that the last common ancestor of Archaea and Eucarya probably contained a Topo IB weakens this argument, and is more in favour of a DNA genome for this ancestor.
Review of Brochier-Armanet, gribaldo, and Forterre
'A DNA topoisomerase IB in Thaumarchaeota testifies for the presence of this enzyme in the last common ancestor of Archaea and Eukaryotes"
This is a very straightforward study of the Topo IB of Thaumarchaeota (formerly, mesophilic Crenarchaeota).demonstrating that the archaeal TopoIB clusters with the eukaryotic orthologs, at the base of the eukaryotic subtree. Combined with the fact that the archaeal and eukaryotic Topo IB proteins have similar domain organizations, these findings clearly demonstrate their monophyly.
1) I think, however, this is where the certainty stops. Indeed, I do not believe that the scenario with horizontal transfer of the eukaryotic Topo IB gene into the common ancestor of Thaumarchaeota can be considered rigorously falsified because it is hardly possible to rule out a dramatic acceleration of evolution after the transfer, resulting in the observed tree topology. PHyml is relatively robust to this sort of artifacts but there are inescapable limits. Ditto regarding the presence of Topo IB: the results of this work add credence to such a conclusion but alternatives based on horizontal gene transfer cannot be ruled out. I think the paper would become better balanced if these uncertainties were acknowledged, and the conclusions, especially, those at the end of the Abstract are toned down. In particular, the "support" of the Thaumaarcaheal rooting of the tree inferred from the phylogenetic analysis of this single gene is very weak, and it would be better to speak of the compatibility of the results with such rooting.
We think that the hypothesis of a HGT from present days eukaryotes to the ancestor of Thaumarchaeota is less likely than the hypothesis of the presence of a Topo IB gene in the ancestor of Eucarya and Archaea, followed by the loss of gene in the ancestor of Euryarchaeota/Crenarchaeota. However, as pointed out by referee two, we present both hypotheses and said carefully in the text that our phylogenetic analysis as the domain organisation of Topo IB homologues "strongly suggest".
Concerning the phylogenetic analyses, we used alternative methods to ML (as Bayesian methods), all the resulting trees strongly support the sister-grouping of Thaumarchaeota and Eucarya. We add this point in the text.
2) I also think that another adjustment, a less fundamental but, perhaps, even more badly needed one relates to the very "discovery" of the archaeal Topo IB. The protein sequence is very well conserved, so it is somewhat disingenuous to claim the finding of Topo IB as a discovery sensu strictu. The Cenarchaeum Topo IB is annotated in GenBank as such; it is another matter that the presence of this interesting gene in the Cenarchaeum genome is not highlighted in the primary paper (Hallam et al. PNAS 2006, 103: 18296) although "two topoisomerases" are mentioned. In any case, I do not think that it is proper to claim this finding in itself as a "discovery"; it would be much better to cite Hallam et al., and to explain the entire situation.
We cite the paper describing the genome of C. symbiosum and explain in the text, that one of the two DNA topoisomerases coding genes identified in the genome of C. symbiosum codes for a Topo IB, and that surprisingly no Topo IA coding gene was present in this genome.
Conversely, the ortholog from Nitrosopumilis is mistakenly annotated as some completely unrelated enzyme, and I think it is desirable to correct this (trivial) error. These corrections will not detract from the message of the present article but will make it better balanced.
We mention the fact that the gene coding for a putative homologue of TopoIB in N. maritimus was misannotated in this genome.
This succinct report presents a nice phylogenetic result that provides two important evolutionary insights. The first is that the identification of Topo IB topoisomerases within members of the recently proposed archaeal phylum Thaumarchaeota (together with a supporting phylogenetic analysis) indicates that a Topo IB enzyme was likely present in the common ancestor of eukaryotes and archaea. This potentially tells us two things. First, if the presence of Topo IB within archaea is restricted to the Thaumarchaea, it strengthens the view that this is a genuine phylum (as recently proposed by these authors – ref. ). In that paper, the authors presented evidence that the mesophilic archaeon, Crenarchaeum symbiosum did not group within the Crenarchaea, and that, in their trees, this species was likewise distinct from Euryarchaeota. If the basal position of Thaumarchaeota is correct, the implication is that Topo IB was lost early in archaeal evolution, prior to the divergence of Euryarchaea and Crenarchaea. While their results (in ref. ) indicated that C. symbiosum is basal to the archaeal tree, in the current paper, they nevertheless approach this with caution, and provide us with three different scenarios (Figure 1B) that serve as a valuable framework for evaluating the implications of the conservation of eukaryotic and archaeal Topo IB (the fourth, transfer from eukaryotes – their Figure 1B-a – can be ruled out on the phylogenetic results presented). Figure 1B is therefore a very welcome addition to this paper because it allows the reader to evaluate the data and phylogeny in Figure 1B with respect to several hypotheses. Too often we see only one possible hypothesis being presented (and one sometimes gets a sense that the analysis of the data in a particular way is a foregone conclusion), so it is nice to see that the authors have thought this through carefully, and are both aware of and open to the compexities of interpretation.
The second insight is that placement of this topoisomerase type in the common ancestor of archaea and eukaryotes strengthens the evidence that this ancestor had a DNA-based genome. This point might need brief explanation. While the naïve expectation is that DNA was present in the Last Universal Common Ancestor, the available comparative genomic data on enzymes involved in deoxyribonucleotide synthesis and DNA replication do not allow this conclusion to be readily drawn. In light of these conflicting data, Forterre recently proposed a model (ref. ) wherein each domain may have independently gained the capacity for DNA synthesis. The essence of the model (an arms race between cells and viruses) is very elegant, and invokes known processes (there are several cases where viruses are known to carry altered genomes – phage genomes with uracil instead of thymine, for example). It is exciting to see that the discovery of Thaumarchaeal Topo IB helps to improve our understanding of DNA origins in that its inclusion supports a less complex scenario (i.e. at most two independent gains).
CBA is the recipient of an Action Thématique et Incitative sur Programme (ATIP) of the French Centre National de la Recherche Scientifique. The work on DNA topoisomerases at the university Paris-Sud is supported by a grant from the Association de la Recherche contre le Cancer (ARC), PF is supported by funding from the Institut Universitaire de France (IUF)
- Champoux JJ: DNA topoisomerases: structure, function, and mechanism. Annu Rev Biochem. 2001, 70: 369-413. 10.1146/annurev.biochem.70.1.369.PubMedView ArticleGoogle Scholar
- Corbett KD, Berger JM: Structure, molecular mechanisms, and evolutionary relationships in DNA topoisomerases. Annu Rev Biophys Biomol Struct. 2004, 33: 95-118. 10.1146/annurev.biophys.33.110502.140357.PubMedView ArticleGoogle Scholar
- Wang JC: Cellular roles of DNA topoisomerases: a molecular perspective. Nat Rev Mol Cell Biol. 2002, 3: 430-440. 10.1038/nrm831.PubMedView ArticleGoogle Scholar
- Schoeffler AJ, Berger JM: DNA topoisomerases: harnessing and constraining energy to govern chromosome topology. Q Rev Biophys. 2008, 41: 41-101.PubMedView ArticleGoogle Scholar
- Forterre P: DNA topoisomerase V: a new fold of mysterious origin. Trends Biotechnol. 2006, 24: 245-247. 10.1016/j.tibtech.2006.04.006.PubMedView ArticleGoogle Scholar
- Forterre P, Gribaldo S, Gadelle D, Serre MC: Origin and evolution of DNA topoisomerases. Biochimie. 2007, 89: 427-446. 10.1016/j.biochi.2006.12.009.PubMedView ArticleGoogle Scholar
- Gadelle D, Filee J, Buhler C, Forterre P: Phylogenomics of type II DNA topoisomerases. Bioessays. 2003, 25: 232-242. 10.1002/bies.10245.PubMedView ArticleGoogle Scholar
- Aravind L, Leipe DD, Koonin EV: Toprim – a conserved catalytic domain in type IA and II topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins. Nucleic Acids Res. 1998, 26: 4205-4213. 10.1093/nar/26.18.4205.PubMedPubMed CentralView ArticleGoogle Scholar
- Cheng C, Kussie P, Pavletich N, Shuman S: Conservation of structure and mechanism between eukaryotic topoisomerase I and site-specific recombinases. Cell. 1998, 92: 841-850. 10.1016/S0092-8674(00)81411-7.PubMedView ArticleGoogle Scholar
- Taneja B, Patel A, Slesarev A, Mondragon A: Structure of the N-terminal fragment of topoisomerase V reveals a new family of topoisomerases. Embo J. 2006, 25: 398-408. 10.1038/sj.emboj.7600922.PubMedPubMed CentralView ArticleGoogle Scholar
- Benarroch D, Claverie JM, Raoult D, Shuman S: Characterization of mimivirus DNA topoisomerase IB suggests horizontal gene transfer between eukaryal viruses and bacteria. J Virol. 2006, 80: 314-321. 10.1128/JVI.80.1.314-321.2006.PubMedPubMed CentralView ArticleGoogle Scholar
- Champoux JJ, Dulbecco R: An activity from mammalian cells that untwists superhelical DNA – a possible swivel for DNA replication (polyoma-ethidium bromide-mouse-embryo cells-dye binding assay). Proc Natl Acad Sci USA. 1972, 69: 143-146. 10.1073/pnas.69.1.143.PubMedPubMed CentralView ArticleGoogle Scholar
- Garinther WI, Schultz MC: Topoisomerase function during replication-independent chromatin assembly in yeast. Mol Cell Biol. 1997, 17: 3520-3526.PubMedPubMed CentralView ArticleGoogle Scholar
- Kim RA, Wang JC: Function of DNA topoisomerases as replication swivels in Saccharomyces cerevisiae. J Mol Biol. 1989, 208: 257-267. 10.1016/0022-2836(89)90387-2.PubMedView ArticleGoogle Scholar
- Brill SJ, DiNardo S, Voelkel-Meiman K, Sternglanz R: Need for DNA topoisomerase activity as a swivel for DNA replication for transcription of ribosomal RNA. Nature. 1987, 326: 414-416. 10.1038/326414a0.PubMedView ArticleGoogle Scholar
- Liu LF, Desai SD, Li TK, Mao Y, Sun M, Sim SP: Mechanism of action of camptothecin. Ann N Y Acad Sci. 2000, 922: 1-10.PubMedView ArticleGoogle Scholar
- Bauer WR, Ressner EC, Kates J, Patzke JV: A DNA nicking-closing enzyme encapsidated in vaccinia virus: partial purification and properties. Proc Natl Acad Sci USA. 1977, 74: 1841-1845. 10.1073/pnas.74.5.1841.PubMedPubMed CentralView ArticleGoogle Scholar
- Shuman S: Vaccinia virus DNA topoisomerase: a model eukaryotic type IB enzyme. Biochim Biophys Acta. 1998, 1400: 321-337.PubMedView ArticleGoogle Scholar
- Krogh BO, Shuman S: Catalytic mechanism of DNA topoisomerase IB. Mol Cell. 2000, 5: 1035-1041. 10.1016/S1097-2765(00)80268-3.PubMedView ArticleGoogle Scholar
- Tian L, Shuman S: Vaccinia topoisomerase mutants illuminate roles for Phe59, Gly73, Gln69 and Phe215. Virology. 2007, 359: 466-476. 10.1016/j.virol.2006.08.056.PubMedView ArticleGoogle Scholar
- Osheroff N: Unraveling the structure of the variola topoisomerase IB-DNA complex: a possible new twist on smallpox therapy. Mol Interv. 2006, 6: 245-248. 10.1124/mi.6.5.4.PubMedView ArticleGoogle Scholar
- Krogh BO, Shuman S: A poxvirus-like type IB topoisomerase family in bacteria. Proc Natl Acad Sci USA. 2002, 99: 1853-1858. 10.1073/pnas.032613199.PubMedPubMed CentralView ArticleGoogle Scholar
- Hallam SJ, Konstantinidis KT, Putnam N, Schleper C, Watanabe Y, Sugahara J, Preston C, de la Torre J, Richardson PM, DeLong EF: Genomic analysis of the uncultivated marine crenarchaeote Cenarchaeum symbiosum. Proc Natl Acad Sci USA. 2006, 103: 18296-18301. 10.1073/pnas.0608549103.PubMedPubMed CentralView ArticleGoogle Scholar
- Brochier-Armanet C, Boussau B, Gribaldo S, Forterre P: Mesophilic Crenarchaeota: proposal for a third archaeal phylum, the Thaumarchaeota. Nat Rev Microbiol. 2008, 6: 245-252. 10.1038/nrmicro1852.PubMedView ArticleGoogle Scholar
- Konneke M, Bernhard AE, de la Torre JR, Walker CB, Waterbury JB, Stahl DA: Isolation of an autotrophic ammonia-oxidizing marine archaeon. Nature. 2005, 437: 543-546. 10.1038/nature03911.PubMedView ArticleGoogle Scholar
- Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, Wu D, Paulsen I, Nelson KE, Nelson W, et al: Environmental genome shotgun sequencing of the Sargasso Sea. Science. 2004, 304: 66-74. 10.1126/science.1093857.PubMedView ArticleGoogle Scholar
- Makarova KS, Sorokin AV, Novichkov PS, Wolf YI, Koonin EV: Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea. Biol Direct. 2007, 2: 33-10.1186/1745-6150-2-33.PubMedPubMed CentralView ArticleGoogle Scholar
- Kwapisz M, Beckouet F, Thuriaux P: Early evolution of eukaryotic DNA-dependent RNA polymerases. Trends Genet. 2008, 24: 211-215. 10.1016/j.tig.2008.02.002.PubMedView ArticleGoogle Scholar
- Lecompte O, Ripp R, Thierry JC, Moras D, Poch O: Comparative analysis of ribosomal proteins in complete genomes: an example of reductive evolution at the domain scale. Nucleic Acids Res. 2002, 30: 5382-5390. 10.1093/nar/gkf693.PubMedPubMed CentralView ArticleGoogle Scholar
- Forterre P: Three RNA cells for ribosomal lineages and three DNA viruses to replicate their genomes: a hypothesis for the origin of cellular domain. Proc Natl Acad Sci USA. 2006, 103: 3669-3674. 10.1073/pnas.0510333103.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.