Skip to main content

Are we degenerate tetraploids? More genomes, new facts



Within the bilaterians, the appearance and evolution of vertebrates is accompanied by enormous changes in anatomical, morphological and developmental features. This evolution of increased complexity has been associated with two genome duplications (2R hypothesis) at the origin of vertebrates. However, in spite of extensive debate the validity of the 2R hypothesis remains controversial. The paucity of sequence data in early years of genomic era was an intrinsic obstacle in tracking the genome evolutionary history of chordates.


In this article I review the 2R hypothesis by taking into account the recent availability of genomic sequence data for an expanding range of animals. I argue here that genetic architecture of lower metazoans and representatives of major vertebrate and invertebrate lineages provides no support for the hypothesis relating the origin of vertebrates with widespread gene or genome duplications.


It appears that much of the genomic complexity of modern vertebrates is very ancient likely predating the origin of chordates or even the Bilaterian-Nonbilaterian divergence. The origin and evolution of vertebrates is partly accompanied by an increase in gene number. However, neither can we take this subtle increase in gene number as an only causative factor for evolution of phenotypic complexity in modern vertebrates nor we can take it as a reflection of polyplodization events early in their history.


This article was reviewed by Eugene Koonin, Joshua Cherry (nominated by David Lipman), and Jerzy Jurka.


To explain the genetic basis of major transitions in organismal evolution, in 1970 Susumu Ohno famously proposed that multiple rounds of whole genome duplications (2R hypothesis) had occurred during the early history of vertebrate lineage, driving the evolution of developmental and morphological complexity in vertebrates [1, 2]. Ohno,s idea was based solely on genome size differences, chromosomal topologies, and recent tetraploidization events in some fish and amphibians. Over the past decade the 2R hypothesis has gained extensive popularity among evolutionary and developmental biologist.

Proponents presented several lines of evidence in favor of entire genome duplication hypothesis in the early vertebrates. First, compared to model invertebrate genomes (fruit fly, nematode, sea squirt and amphioxus) the typical vertebrate genome possess more genes [3]. Second, the existence of paralogons in the human genome [4, 5]. Third, the conservation of gene synteny throughout vertebrates and their invertebrate ancestors and the spread of these anciently conserved syntenic fragments among multiple vertebrate chromosomes (vertebrate paralogons) [6, 7]. Fourth, the refinement of extensive gene duplication events early in vertebrate history through molecular-clock based approaches (absolute dating) [8, 9]. Fifth, the extrapolation of genome evolution scenarios in plant and yeast to genome evolution events in vertebrates [1012]. A further piece of evidence in favor of two rounds of whole genome duplication hypothesis emerges from the observation that protostome invertebrates (fruit fly) and deuterostome cephalochordate amphioxus possess single HOX cluster whereas the vertebrates have four or more clusters [1315]. Impressed by the fascination associated with this hypothesis some researchers have gone so far as to state " there is now incontrovertible evidence supporting the 2R hypothesis"[16].

Opponents of the 2R hypothesis argued that the current data is not compelling evidence of polyploidization and increase in the number of paralogous genes in vertebrates occurred as a result of small scale gene duplication events involving single genes and chromosomal segments, scattered at different times during the history of life [1721].

In spite of extensive empirical scrutiny, the 2R remains controversial. The intrinsic difficulty was the lack of broad phylogenetic representation in the available sequence data. However, the recent availability of sequence data from an expanding range of vertebrate and invertebrate species from interspersed time points (Figure 1) has provided an unprecedented opportunity to test two main competing hypotheses by employing the deepest and more diverse sampling of organisms.

Figure 1
figure 1

A phylogeny of animals. Evolutionary relationship, numbers of genes and divergence times of selected animals whose genomes have been sequenced. Along with the increasing overall gene number, the occurrence of single HOX gene cluster in invertebrate genome and four or more clusters in vertebrate genome is taken as evidence for two rounds of whole genome duplication at the base of vertebrate lineage (2R hypothesis). The double headed arrow underneath shows the proposed time windows of genome amplification events. Features not drawn to scale.

Presentation of the hypothesis

Rapid paralogous gene increase in the early stage of vertebrate evolution

The recently sequenced genomes for two morphologically simple invertebrates, sea anemone (a cnidarian model) and sea urchin (nonchordate deuterostome) provides a unique perspective on the evolution of animal genomes [22, 23]. The genomic complexity of sea anemone and sea urchin, in terms of gene contents and structure, suggests that many features of the genetic toolkit seen in modern vertebrates existed in the ancestral deuterostome genome or even in the common ancestor of bilaterians-nonbilaterians animals (Figure 1). Surprisingly this ancestral animal gene set has undergone extensive contraction in many protostomes and nonvertebrate chordate animals. Many gene families that were expanded deep in the history of life (before the bilaterian-nonbilaterian divergence) and whose descendants survive in modern vertebrates, appeared to be absent in model protostomes (fruit fly, nematode) and chordates (amphioxus, Ciona). For instance, Nematostella vectensis (sea anemone) which is a model animal from the basal metazoan phylum Cnidaria shares 11 of 12 Wnt genes found in humans, while fruit fly shares only 6 of 12 [24]. Similarly, the number of orthologous genes between the ascidian (nonvertebrate chordate) and the modern vertebrates are less than the number estimated between the vertebrates and the sea urchin (simple deuterostome) [23]. Inferring gene/genome duplication events at the base of vertebrate lineage, by using any of these highly derived invertebrate model outgroups, including amphioxus which was previously assumed as a good representative of ancestral chordate genome, would thus be problematic. For instance, defining the duplication time windows for human gene families (under the 1R/2R hypothesis) by using an invertebrate outgroup such as amphioxus/ciona or fruit fly could falsely place many of those gene expansion events that might have occurred deeper in the life history, at the base of vertebrate lineage [9, 25]. This sort of data could inaccurately show a rapid paralogous gene increase in the early stage of vertebrate evolution and could also lead to a misleading conclusion of big-bang events (genome duplications).

Morphological complexity and gene number

The homeobox genes are key components of developmental toolkit of animals, and it has been suggested that functional diversification of these genes through duplication and divergence was pivotal in the evolution of complex developmental and morphological features in bilaterian animals. If the origin of particular homeobox gene is in actuality associated with the evolution of particular animal trait, in that case one should expect an increase in the homeobox toolkit repertoire during evolutionary transitions from simple to complex life forms, like from basal diploblastic animals to triploblastic life forms, protostome to deuterostome and from simple chordates to modern vertebrates. However, recent data shows that all the major classes of homeodomain coding genes have undergone significant radiation deep in animal evolution, before the divergence of Cnidaria and Bilateria [26]. Surprisingly, the genome of morphologically simple nonbilaterian, the sea anemone possesses significantly more homeobox genes than the morphologically and developmentally complex fruit fly.

Furthermore, a fundamental problem with the hypothesis linking the sudden appearance of complex morphological traits in vertebrates with whole genome/extensive gene duplication events is that in defining the evolutionary steps in the phenotypic complexity, the extinct lineages were completely ignored. However, when fossils are taken into account, jumps in morphological complexity disappear and the proposed whole genome duplication in vertebrate history no longer is correlated with the origin of body plans [27]. Thus it appears that a correlation between the big jumps in the morphological complexity and proposed whole genome duplication in vertebrates is an artifact of incomplete taxonomic sampling.

Given the fact that there is no apparent relationship between the gene number and organismal complexity (Figure 1), the assumption that modern vertebrates have attained higher level of complexity in their body plans by expanding their genetic toolkit through big-bangs of genome amplification events is not credible. Instead the most plausible explanation of organismal complexity is to deploy each gene with particular developmental function at many times and places during development. This increase in the diversity of gene function may involve an increase in the number of transcription factors or an increase in the number of cis-acting regulatory elements that control region-specific expression of genes[28]. This sort of regulatory evolution creates many-fold increase in the complexity of developmental processes without drastic expansion of gene number [2932].

Paralogy regions in the human genome

The distinct chromosomal regions within a genome that contains a set of similar genes are known as paralogy regions or paralogons. Very often the genetic constitution of these intra-genomic homology blocks remains conserved across distantly related genomes (conserved synteny) [6, 7, 33]. The occurrence of two-fold, three-fold and four-fold paralogy regions in human and other vertebrates and their co-occurrence in distinct genomes is taken as an evidence that ancestral genome of modern vertebrates was shaped by two rounds of polyploidization (2R hypothesis) [4, 6, 25, 3436]. Alternative hypothesis suggests that the constituent multigene families of these anciently conserved vertebrate paralogy regions arose as a result of small scale duplications involving gene clusters and chromosomal segments which occurred at different time points during chordate evolution [17, 3739]. Whereas the spread of similar gene sets on distinct vertebrate chromosomes is the consequence of subsequent rearrangement events[17, 40]. The conservation of these blocks of genes both within and between species may reflect selective reasons [37]. For instance; to have genes of similar expression or function in the physical proximity on a chromosomal segment.

In fact, the occurrence of two-fold paralogy regions can be explained by many alternative explanatory scenarios, whereas the three and four-fold paralogy regions may be indicative of two rounds of ancient whole genome or whole chromosome duplication events. However, given the large size of human and other vertebrate chromosomes the fraction of gene families having three or four representatives on four-fold paralogy regions is very small [41]. For instance, human HOX cluster bearing chromosomes (Hsa2, 7, 12 and 17) contains in total ~4609 known coding genes (NCBI 36 assembly of human genome), whereas the HOX cluster paralogon harbors only 11 known multigene families with at least three of their members anciently linked to HOX clusters [17]. This illustrates that spatial organization of ~0.95% of coding contents of human HOX cluster bearing chromosomes favors the structuring of these chromosome through ancient tetraploidization events, whereas the history of remaining portion may not be in harmony with this scenario. Thus it is not plausible to take the presence of few similar gene sets on three or four human chromosomes as evidence that the chromosomes are related by large duplication.

Moreover, for identifying ancient paralogons in the vertebrate genomes some researchers have combined the map-self comparison approach with a comparison with a pre-duplication species. In these studies the coverage and distribution pattern of old paralogs [5] or anciently conserved syntenic fragments [6] in the human and other vertebrate genomes was subjected to statistical analysis to rule out the null hypothesis: that the constituent gene families of vertebrate paralogons have arisen by distinct duplication events and their members were brought together in three or four collinear regions on different chromosomes as a result of rearrangement of genomic segments including multiple contiguous genes [17, 37]. However, it is advisable to recognize that the statistical validation of distribution of old paralogs or anciently conserved (vertebrate-invertebrate) syntenic segments among multiple vertebrate chromosomes does not constitute the evidence for the mechanism of origin of paralogons. Therefore, special care should be taken in interpreting the sheer map distribution of a subset of ancient vertebrate genes as an illustration of polyploidization in vertebrate early evolutionary history.

The precise nature of those events that has created ancient paralogy regions (> 450 Mya) of vertebrate genome is hard to track because such events are obscured by long term evolutionary divergence and rearrangements. However, an important insight into these ancient events can be gained by tracing the genome evolutionary scenario of very recently diverged species. For instance, the availability of genome sequence data for multiple primate species has shown that that our own genetic material is expanded by ~5% during the past 35–40 million years (divergence of New and Old World monkeys) of evolution through segmental duplications (SDs) and rearrangement events [42, 43]. These recent human segmental duplications are large nearly identical copies of genomic DNA, ranging from 300 kb to 1 Mb in size, and are present in at least two genomic locations [44]. Detailed analysis has attributed several roles to SD events: creation of new genes in primates, expansion of multigene families, and triggering large scale chromosomal rearrangements in hominoid [44]. Huge impact of recent SDs in architecturing the primate genome over the short period of evolutionary time, apparently support the notion that small scale duplications and rearrangements might have remained a predominant mechanism in shaping the vertebrate genome throughout their evolutionary history [37, 38].

Now the question is that, what kept the similar set of genes together on distinct chromosomal regions or in distinct genomes? The preservation of intra-genomic and inter-genomic synteny over longer period of evolutionary time is unlikely to be the result of chance. Empirical data strongly support the notion that, there are adaptive reasons to keep certain configurations of genes together over evolutionary time [37]. For instance, higher order structural organization of chromosomes, might help in co-regulation of particular sets of neighboring genes in a coordinated manner by recruiting them to shared regions of gene expression [45]. Similarly, the gene regulatory elements spread across long regions impose critical constraint on genomic architecture and are known to have maintained exceptionally long syntenic blocks both within and across species [4648].

HOX cluster duplication and the history of vertebrate genome evolution

HOX family of homeobox containing transcriptional factors lies in adjacent positions and forms a cluster along a chromosome in animal genomes. All invertebrates including fruit fly and amphioxus have only one HOX gene cluster whereas all vertebrates possess more than one set of HOX clusters [49]. The fact that human and other mammals possess four coherent HOX gene clusters whereas their closest invertebrate relative amphioxus contain just a single set of collinear HOX genes, is taken as evidence that the origin of vertebrates coincided with two rounds of whole genome duplication events [10, 50]. Undoubtedly the HOX gene cluster duplication occurred specifically on the vertebrate lineage [13, 51]. However, conclusions regarding HOX gene clusters alone cannot to be extended to define the evolutionary history of entire genome. In fact the evolutionary history of human HOX clusters itself support the notion that they originated by regional duplication. Zhang and Nei [52] analyzed the phylogeny of mammalian HOX clusters and proposed two alternative topologies (((HOXC HOXD) HOXA) HOXB) and ((HOXC HOXD) (HOXA HOXB)). The former topology suggests three separate regional duplication steps (1→2→4 HOX clusters) whereas the later favors two rounds of whole genome duplication events (2 + 2 topology). Analysis of COL and ERBB multigene families, whose members are closely linked to four human HOX clusters, did not resolve the cluster duplication events decisively [37, 53]. The representatives of human SP gene family are very closely spaced to HOX clusters, with human SP1 gene mapping at approximately 526 kb centromeric to HOXC, SP2; at ~614 kb centromeric to HOXB, SP3; at ~2 Mb centromeric to HOXD and SP4; at ~5 Mb telomeric to HOXA (Figure 2). The close physical linkage of SP family members with each of human HOX clusters makes them an interesting test case to evaluate the HOX cluster duplication history. Thorough phylogenetic analysis of SP family members (employing broad taxonomic sampling) has revealed that SP1, SP2, SP3 and SP4 genes share their duplication history with HOX clusters and have arisen together with the HOX clusters through three separate segmental duplication steps [17] (Figure 2). Thus congruency in the phylogenies of human HOX clusters and SP genes not only resolved the cluster duplication events but also decisively rejected the assumption that four HOX clusters in tetrapods are the result of two rounds of whole genome duplication early in vertebrate history (Figure 2).

Figure 2
figure 2

Evolutionary history of Human HOX clusters and closely linked SP family members. Close physical linkage and congruency in the phylogenies of human SP gene family members and HOX clusters supports the view that vertebrate HOX gene clusters arose as a result of small scale gene duplication events involving chromosomal segments or gene clusters. 1d-3d, three rounds of segmental duplication. Features not drawn to scale.

Testing the hypothesis

I have questioned the validity of arguments adduced in support of Ohno's view that the complexity of vertebrate genomes originated by means of whole genome duplications. Instead, the hypothesis presented here suggests that the animal genomes evolved through small-scale duplication events scattered at different times over the history of life. This hypothesis is testable in the light of currently available extensive genomic data form an expanding range of vertebrate and invertebrate animals: 1) through comparison of total gene number and gene family size differences among modern vertebrate and basal invertebrate (for instance, sea anemone/sea urchin) genomes; 2) by estimating the fraction of vertebrate chromosomes occupied by three or four fold paralogy regions; 3) by conducting the phylogenetic analysis of multigene families with three or more of their representatives residing on four-fold paralogy regions (paralogons) in the human genome; 4) by estimating the genome evolutionary scenario of very recently diverged vertebrate species (for instance, primates among vertebrates). Evaluating the correlation between the organismal complexity and gene duplications may also be helpful in testing the validity of two main competing hypotheses [27] (small-scale duplication versus 2R hypothesis).

Very often the statistical evaluation of vertebrate genome evolutionary history is based on comparative data from few vertebrate and highly derived invertebrate genomes and thus could inadvertently lead to unfounded conclusions. Therefore, I propose that broadening of taxonomic sampling in statistical approaches by employing the currently available extensive genomic data, in particular from well preserved basal metazoan genomes and recently diverged vertebrate species, would provide a much clearer picture of animal genome evolution.

Implications of the hypothesis

Initial attempts to unravel the evolutionary aspects of our own genome have borne out highly controversial results. Some has suggested that drastic events (2 rounds of whole genome duplication/2R hypothesis) at the base of vertebrate lineage led to the greater genetic complexity in modern vertebrate genome whereas other researchers rejected the 2R hypothesis at all and suggested a continuous mode of small scale duplications (segmental/gene cluster). These alternative scenarios of vertebrate genome evolution are largely based on data from human and few other vertebrate and invertebrate genomes. The recent availability of additional vertebrate and invertebrate genomes has provided an unprecedented insight into the core evolutionary processes that had shaped our genome, deep in the history of life. Surveying of newly sequenced genomes from the deepest branches of life has revealed that many components of the genetic toolkit seen in modern vertebrates arose and diversified deep in animal history even before the origin of chordates. Indeed the origin and evolution of vertebrates is partly accompanied by an increase in gene number, for instance one coherent HOX cluster in amphioxus like invertebrate to four or more HOX clusters in modern vertebrates. However, neither can we take this subtle increase in gene number as an only causative factor for evolution of phenotypic complexity in modern vertebrates nor we can take it as a reflection of whole genome duplication events early in their history. In depth analysis of the genomic data from recently diverged primate species on markedly different phenotypic trajectories provides valuable clues to ancient genomic events. These data supports the notion that small scale duplications and rearrangements have remained a pervasive phenomenon, driving the vertebrate evolution both at phenotypic and genotypic level, throughout their history. I conclude, therefore, that the comparative genomic data from species that diverged early in metazoan evolution (such as Cnidarian-bilaterian) as well as from the very recently diverged animals (such as primates among vertebrates) provides no evidence in favor of the ancestral tetraploidy in vertebrates. In fact it appears that the 2R hypothesis is an artifact, invoked by the lack of phylogenetic breadth in the genome sequence data in early years of genomic era.

Reviewers' Comments

Reviewer report 1

Dr. Eugene V. Koonin

National Center for Biotechnology Information

National Library of Medicine

National Institute of Health

Bethesda, USA.

This is a critical overview of the 2R hypothesis (two rounds of whole genome duplication) on the origin of vertebrates. The conclusion that, to a large extent, is based on the unexpected genomic complexity of organizationally simple animals, such as sea anemone, and on the modest number of 2-fold and, particularly, 4-fold paralogons in vertebrate genomes, is that there is currently no basis to accept the 2R hypothesis. Instead, it is proposed that the vertebrate genome evolved by relatively small, regional duplications.

This is an old controversy to which this paper does not add any new analysis, only discussion, and that, in my opinion, somewhat perfunctory. As I see the situation, the jury is still out with regard to the 2R. It is important to realize that the 2R hypothesis has gone a long way since the days it was first proposed by Ohno. The 2R hypothesis now claims support from the comparisons of the gene order in the vertebrate paralogons and in Amphioxus, i.e., the duplications in vertebrates appear to be synteny-preserving rather than synteny-disrupting [6]. Even more importantly, perhaps, a global analysis of the distribution of old paralogs in vertebrate genomes has been claimed to support 2R [5, 16]. Thus, 2R is not sheer speculation, considerable effort has been undertaken to test this hypothesis, and there are strong claims of evidence that consistently supports it.

Personally, I have a certain epistemological sympathy for the position taken in this paper in the sense that I believe that, as a matter of principle, the piecemeal duplication model should be the null hypothesis of (in this case, vertebrate) genome evolution that has to be falsified in favor of WGD scenarios. I doubt that the current statistical argument for such falsification is overwhelming so that 2R is to be accepted as the final verdict. However, I also think that the evidence in support of 2R is rather diverse and rather substantial, so it needs to be addressed seriously rather than summarily dismissed, primarily, on the basis of the high complexity of primitive metazoan genomes which is not a logically consistent argument against 2R. I believe that, for a really critical assessment of the 2R hypothesis, the evolutionary genomics literature, and in particular, the evidence claimed in support of 2R should be examined in considerably greater detail and more carefully, with special attention to the underlying assumptions of the statistical models employed in the respective studies.

Author Response

I am thankful to Dr. Koonin for his comments on this manuscript.

1. It is advisable to recognize that the statistical support for the spread of old paralogs or anciently conserved (vertebrate-invertebrate) syntenic fragments among multiple vertebrate chromosomes [5, 6] does not constitute the evidence for the mechanism of origin of vertebrate paralogy regions. I propose therefore, that, special care should be taken in interpreting the sheer map distribution of a subset of ancient vertebrate genes as a strong support of polyploidization in vertebrate early evolutionary history.

2. In this article I do not intend to challenge the statistical models describing the vertebrate genome evolutionary events. My purpose here is to highlight the fact that well preserved genetic architecture of basal metazoans and comparative analysis of primate genomes (or any other group of vertebrates with comparable relatedness) casts serious doubt on the plausibility of the 2R hypothesis. Instead the recently sequenced genomes of animals from interspersed time points clearly shows that much of the genomic complexity seen in the modern vertebrates is very ancient than was previously anticipated (by 2R proponents). It appears that vertebrates had accomplished this genomic complexity through piecemeal duplications at widely different times over the evolution of life.

Statistical testing of vertebrate genome evolutionary scenarios is often based on comparative observations from few vertebrate and highly derived invertebrate genomes, and thus could inadvertently lead to unfounded conclusions. Therefore, I recommend that future statistical approaches to test hypothesis concerning vertebrate genome evolution, should take into account the newly sequenced genomes of basal metazoan animals and recently diverged vertebrate species (for instance primates).

3. In the light of your comments I have considerably expanded the survey of evolutionary genomics literature.

Reviewer report 2

Dr. Jerzy Jurka

President & Director

Genetic Information Research Institute

Mountain View, USA.

The author presents a critical review of the so-called "Ohno's hypothesis" or "2R hypothesis" postulating that the early vertebrate lineage underwent one or more complete genome duplications. The author argues that the genome sequence data do not support the 2R hypothesis. While I am not sure if the 2R hypothesis is falsifiable based on genomic data, I would support this publication if the author could include discussion of a recent paper in favor of the 2R hypothesis.

Masanori Kasahara, "The 2R hypothesis: an update", "Current Opinion in Immunology" (2007), doi:10.1016/j.coi.2007.07.009

Author response

I am thankful to Dr. Jurka for reviewing this manuscript.

In the revised manuscript, by keeping in view the suggestion of Dr. Jurka, I included the recent paper from Kasahara M. (2007) [16] and other articles favoring 2R.

Reviewer report 3

Dr. Joshua L. Cherry

National Center for Biotechnology Information

National Library of Medicine

National Institutes of Health

Bethesda, USA.

Nominated by David J Lipman, National Center for Biotechnology Information, NIH, Bethesda, USA.

This review article assesses the hypothesis of two whole-genome duplications in vertebrate evolution in light of recent sequence data. I agree with the article's conclusion that there is no good reason to believe this hypothesis. I have a few comments about some of the arguments presented and the implications of some of the language used.

I found the role of the SP gene family in resolving the history of HOX clusters to be unclear. In fact the argument is more tenuous than the discussion would suggest. How can phylogenetic analysis of SP reveal both that the SP genes "share their evolutionary history with HOX clusters" and that HOX genes have "arisenthrough three separate segmental duplication steps"? Knowing that the evolutionary histories are the same would presumably entail knowing the phylogeny of HOX, so that the SP phylogeny would provide no additional information about HOX. It is in fact simply assumed that the candidate HOX phylogeny that agrees with the SP phylogeny is the correct one. This is possible, but if other linked paralogs have different histories, as suggested by the cited references, it is far from certain.

I would add that it is too strong to say that the alternative rooted topology, ((HOXC HOXD) (HOXA HOXB)), "favors two rounds of whole genome duplication events". This topology is consistent with 2R, but also with three local duplication events. In the absence of whole-genome duplications it would not be surprising to find some sets of paralogs with this type of topology.

I am uncertain of the meaning, in paragraph 3 of Paralogy Regions in the Human Genome, of "those vertebrate species that have recent evolutionary origin." Because the species analyzed are primates, this might be taken to imply, incorrectly, that humans and our closest relatives are more recently evolved than other organisms and that evolution is a thing of the more distant past for other groups. Comparative analysis of primates can of course yield valuable information, but the same role could be played here by any other group of vertebrate species with comparable relatedness. Other expressions in the manuscript also suggest a ladder-like view of evolution, even if that is not the author's intent: "genetic architecture of deepest as well as most recent branches of animals" (Abstract); "evolutionary basal invertebrate" (paragraph 1 of Rapid Paralogous Gene Increase); "an ideal chordate ancestral genome" (same paragraph); "the deepest branches of life" (final paragraph).

Author response

I am grateful to Dr. Cherry for valuable comments and useful suggestions on this manuscript.

1. The most parsimonious explanation of the order of branching in HOX cluster and closely linked SP phylogenies is that, both of these gene families arose simultaneously (co-duplicated group) through three independent duplication steps (Figure 2). Other genes families (having three or four members linked to HOX clusters) in the HOX cluster paralogons, e.g. ERBB, COL, GLI, HH, SLC4A and others have recently been resolved into four discrete co-duplicated groups [17]. It has been shown that genes within each of these co-duplicated groups (of HOX cluster paralogons) are duplicated in concert with each other whereas the constituent genes of two different co-duplicated groups may not have duplicated simultaneously [17]. This observation is contrary to 2R scenario, which assumed that constituent gene families of HOX cluster paralogons arose simultaneously through two rounds of WGD.

2. I must agree that the SP phylogeny helps in understanding the phylogeny (duplication history) of HOX and would provide no additional information about HOX evolutionary history. Therefore, I replaced the term "share their evolutionary history" with the "share their duplication history" (HOX cluster duplication and the history of vertebrate genome evolution).

3. I must also agree with Dr. Cherry's argument that in the absence of whole-Genome duplications it would not be surprising to find some sets of paralogs with (AB)(CD) type topology.

4. In the revised manuscript I tried to erase a ladder-like view of evolution and have used precise terms to explain the evolutionary relatedness among animals.

5. Minor issues were also addressed in the revised manuscript.


  1. Ohno S: Evolution by gene duplication. 1970, Heidelberg: Springer-Verlag

    Chapter  Google Scholar 

  2. Ohno S: Ancient linkage groups and frozen accidents. Nature. 1973, 244: 259-262. 10.1038/244259a0.

    Article  Google Scholar 

  3. Sidow A: Gen(om)e duplications in the evolution of early vertebrates. Curr Opin Genet Dev. 1996, 6: 715-722. 10.1016/S0959-437X(96)80026-8.

    Article  PubMed  CAS  Google Scholar 

  4. Lundin LG, Larhammar D, Hallbook F: Numerous groups of chromosomal regional paralogies strongly indicate two genome doublings at the root of the vertebrates. J Struct Funct Genomics. 2003, 3: 53-63. 10.1023/A:1022600813840.

    Article  PubMed  CAS  Google Scholar 

  5. Dehal P, Boore JL: Two rounds of whole genome duplication in the ancestral vertebrate. Plos Biology. 2005, 3: e314-10.1371/journal.pbio.0030314.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Hufton AL, Groth D, Vingron M, Lehrach H, Poustka AJ, Panopoulou G: Early vertebrate whole genome duplications were predated by a period of intense genome rearrangement. Genome Res. 2008, 18: 1582-1591. 10.1101/gr.080119.108.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  7. Putnam NH, Butts T, Ferrier DEK, Furlong RF, Hellsten U, Kawashima T, Robinson-rechavi M, Shoguchi E, Terry A, Yu J-K, et al: The amphioxus genome and the evolution of the chordate karyotype. Nature. 2008, 453: 1064-1072. 10.1038/nature06967.

    Article  PubMed  CAS  Google Scholar 

  8. Panopoulou G, Hennig S, Groth D, Krause A, Poustka AJ, Herwig R, Vingron M, Lehrach H: New evidence for genome-wide duplications at the origin of vertebrates using an amphioxus gene set and completed animal genomes. Genome Res. 2003, 13: 1056-1066. 10.1101/gr.874803.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Vandepoele K, De Vos W, Taylor JS, Meyer A, Peer Van de Y: Major events in the genome evolution of vertebrates: paranome age and size differ considerably between ray-finned fishes and land vertebrates. Proc Natl Acad Sci USA. 2004, 101: 1638-1643. 10.1073/pnas.0307968100.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  10. Panopoulou G, Poustka AJ: Timing and mechanism of ancient vertebrate genome duplications – the adventure of a hypothesis. Trends Genet. 2005, 21: 559-567. 10.1016/j.tig.2005.08.004.

    Article  PubMed  CAS  Google Scholar 

  11. Kellis M, Birren BW, Lander ES: Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature. 2004, 428: 617-624. 10.1038/nature02424.

    Article  PubMed  CAS  Google Scholar 

  12. Bowers JE, Champon BA, Rong J, Paterson AH: Unravelling angiosperm genome evolution by phylogenetic analysis of chromosome duplication events. Nature. 2003, 422: 433-438. 10.1038/nature01521.

    Article  PubMed  CAS  Google Scholar 

  13. Garcia-Fernandez J, Holland PW: Archetypal organization of the amphioxus Hox gene cluster. Nature. 1994, 370: 563-566. 10.1038/370563a0.

    Article  PubMed  CAS  Google Scholar 

  14. Amores A, Suzuki T, Yan YL, Pomeroy J, Singer A, Amemiya C, Postlethwait JH: Developmental Roles of Pufferfish Hox Clusters and Genome Evolution in the Ray-Fin Fish. Genome Res. 2004, 14: 1-10. 10.1101/gr.1717804.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  15. Pollard SL, Holland PW: Evidence for 14 homeobox gene clusters in human genome ancestry. Current Biology. 2000, 10: 1059-1062. 10.1016/S0960-9822(00)00676-X.

    Article  PubMed  CAS  Google Scholar 

  16. Kasahara M: The 2R hypothesis: an update. Curr Opin Immunol. 2007, 19: 547-552. 10.1016/j.coi.2007.07.009.

    Article  PubMed  CAS  Google Scholar 

  17. Abbasi AA, Grzeschik KH: An insight into the phylogenetic history of HOX linked gene families in vertebrates. BMC Evol Biol. 2007, 7: 239-10.1186/1471-2148-7-239.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Hughes AL, Friedman R: 2R or not 2R: testing hypotheses of genome duplication in early vertebrates. J Struct Funct Genomics. 2003, 3: 85-93. 10.1023/A:1022681600462.

    Article  PubMed  CAS  Google Scholar 

  19. Hughes AL: Phylogenies of developmentally important proteins do not support the hypothesis of two rounds of genome duplication early in vertebrate history. J Mol Evol. 1999, 48: 565-576. 10.1007/PL00006499.

    Article  PubMed  CAS  Google Scholar 

  20. Cotton JA, Page RD: The shape of human gene family phylogenies. BMC Evolutionary Biology. 2006, 6: 66-10.1186/1471-2148-6-66.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Friedman R, Hughes AL: Two pattern of Genome Organization in Mammals:The Chromosomal Distribution of Duplicate Genes in Human and Mouse. Mol Biol Evol. 2004, 21: 1008-1013. 10.1093/molbev/msh076.

    Article  PubMed  CAS  Google Scholar 

  22. Putnam NH, Srivastava M, Hellsten U, Dirks B, Chapman J, Salamov A, Terry A, Shapiro H, Lindquist E, Kapitonov VV, et al: Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization. Science. 2007, 317: 86-94. 10.1126/science.1139158.

    Article  PubMed  CAS  Google Scholar 

  23. Sodergren E, Weinstock GM, Davidson EH, Cameron RA, Gibbs RA, Angerer RC, Angerer LM, Arnone MI, Burgess DR, Burke RD, et al: The genome of the sea urchin Strongylocentrotus purpuratus. Science. 2006, 314: 941-952. 10.1126/science.1133609.

    Article  PubMed  Google Scholar 

  24. Kusserow A, Pang K, Sturm C, Hrouda M, Lentfer J, Schmidt HA, Technau U, von Haeseler A, Hobmayer B, Martindale MQ, et al: Unexpected complexity of the Wnt gene family in a sea anemone. Nature. 2005, 433: 156-160. 10.1038/nature03158.

    Article  PubMed  CAS  Google Scholar 

  25. McLysaght A, Hokamp K, Wolfe KH: Extensive genomic duplication during early chordate evolution. Nat Genet. 2002, 31: 200-204. 10.1038/ng884.

    Article  PubMed  CAS  Google Scholar 

  26. Ryan JF, Burton PM, Mazza ME, Kwong GK, Mullikin JC, Finnerty JR: The cnidarian-bilaterian ancestor possessed at least 56 homeoboxes: evidence from the starlet sea anemone, Nematostella vectensis. Genome Biol. 2006, 7: R64-10.1186/gb-2006-7-7-r64.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Donoghue PC, Purnell MA: Genome duplication, extinction and vertebrate evolution. Trends Ecol Evol. 2005, 20: 312-319. 10.1016/j.tree.2005.04.008.

    Article  PubMed  Google Scholar 

  28. Levine M, Tjian R: Transcription regulation and animal diversity. Nature. 2003, 424: 147-151. 10.1038/nature01763.

    Article  PubMed  CAS  Google Scholar 

  29. Carroll SB: Chance and necessity: the evolution of morphological complexity and diversity. Nature. 2001, 409: 1102-1109. 10.1038/35059227.

    Article  PubMed  CAS  Google Scholar 

  30. Carroll SB: Evolution at two levels: on genes and form. PLoS Biol. 2005, 3: e245-10.1371/journal.pbio.0030245.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Marcellini S, Simpson P: Two or Four Bristles: Functional Evolution of an Enhancer of scute in Drosophilidae. Plos Biol. 2006, 4: e386-10.1371/journal.pbio.0040386.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Prud'homme B, Gompel N, Carroll SB: Emerging principles of regulatory evolution. Proc Natl Acad Sci USA. 2007, 104 (Suppl 1): 8605-8612. 10.1073/pnas.0700488104.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Amores A, Force A, Yan YL, Joly L, Amemiya C, Fritz A, Ho RK, Langeland J, Prince V, Wang YL, et al: Zebrafish hox clusters and vertebrate genome evolution. Science. 1998, 282: 1711-1714. 10.1126/science.282.5394.1711.

    Article  PubMed  CAS  Google Scholar 

  34. Sundström G, Larsson TA, Larhammar D: Phylogenetic and chromosomal analyses of multiple gene families syntenic with vertebrate Hox clusters. BMC Evolutionary Biology. 2008, 8: 254-10.1186/1471-2148-8-254.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Larsson TA, Olsson F, Sundström G, Lundin LG, Brenner S, Venkatesh B, Larhammar D: Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor regions. BMC Evolutionary Biology. 2008, 8: 184-10.1186/1471-2148-8-184.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Hokamp K, McLysaght A, Wolfe KH: The 2R hypothesis and the human genome sequence. Journal of Structural and Functional Genomics. 2003, 3: 95-110. 10.1023/A:1022661917301.

    Article  PubMed  CAS  Google Scholar 

  37. Hughes AL, da Silva J, Friedman R: Ancient genome duplications did not structure the human Hox-bearing chromosomes. Genome Res. 2001, 11: 771-780. 10.1101/gr.GR-1600R.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  38. Martin AP: Increasing genomic complexity by gene duplication and the origin of vertebrates. AmNat. 1999, 154: 111-128.

    Google Scholar 

  39. Hughes AL: Phylogenetic tests of the hypothesis of block duplication of homologous genes on human chromosomes 6, 9, and 1. Mol Biol Evol. 1998, 15: 854-870.

    Article  PubMed  CAS  Google Scholar 

  40. Furlong RF, Holland PW: Polyploidy in vertebrate ancestry: Ohno and beyond. Biological Journal of the Linnean Society. 2004, 82: 425-430. 10.1111/j.1095-8312.2004.00329.x.

    Article  Google Scholar 

  41. Skrabanek L, Wolfe KH: Eukaryote genome duplication – where's the evidence?. Curr Opin Genet Dev. 1998, 8: 694-700. 10.1016/S0959-437X(98)80039-7.

    Article  PubMed  CAS  Google Scholar 

  42. Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, Adams MD, Myers EW, Li PW, Eichler EE: Recent segmental duplications in the human genome. Science. 2002, 297: 1003-1007. 10.1126/science.1072047.

    Article  PubMed  CAS  Google Scholar 

  43. Samonte RV, Eichler EE: Segmental duplications and the evolution of the primate genome. Nat Rev Genet. 2002, 3: 65-72. 10.1038/nrg705.

    Article  PubMed  CAS  Google Scholar 

  44. Bailey JA, Eichler EE: Primate segmental duplications: crucibles of evolution, diversity and disease. Nat Rev Genet. 2006, 7: 552-564. 10.1038/nrg1895.

    Article  PubMed  CAS  Google Scholar 

  45. Meaburn KJ, Misteli T: Cell biology: chromosome territories. Nature. 2007, 445: 379-781. 10.1038/445379a.

    Article  PubMed  CAS  Google Scholar 

  46. Goode DK, Snell P, Smith SF, Cooke JE, Elgar G: Highly conserved regulatory elements around the SHH gene may contribute to the maintenance of conserved synteny across human chromosome 7q36.3. Genomics. 2005, 86: 172-181. 10.1016/j.ygeno.2005.04.006.

    Article  PubMed  CAS  Google Scholar 

  47. Kikuta H, Laplante M, Navratilova P, Komisarczuk AZ, Engstrom PG, Fredman D, Akalin A, Caccamo M, Sealy I, Howe K, et al: Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates. Genome Res. 2007, 17: 545-555. 10.1101/gr.6086307.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  48. Lee AP, Koh EG, Tay A, Brenner S, Venkatesh B: Highly conserved syntenic blocks at the vertebrate Hox loci and conserved regulatory elements within and outside Hox gene clusters. Proc Natl Acad Sci USA. 2006, 103: 6994-6999. 10.1073/pnas.0601492103.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  49. Lemons D, McGinnis W: Genomic evolution of Hox gene clusters. Science. 2006, 313: 1918-1922. 10.1126/science.1132040.

    Article  PubMed  CAS  Google Scholar 

  50. Larhammar D, Lundin LG, Hallbook F: The human Hox-bearing chromosome regions did arise by block or chromosome (or even genome) duplications. Genome Res. 2002, 12: 1910-1920. 10.1101/gr.445702.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  51. Hoegg S, Meyer A: Hox clusters as models for vertebrate genome evolution. Trends Genet. 2005, 21: 421-424. 10.1016/j.tig.2005.06.004.

    Article  PubMed  CAS  Google Scholar 

  52. Zhang J, Nei M: Evolution of Antennapedia-class homeobox genes. Genetics. 1996, 142: 295-303.

    PubMed  CAS  PubMed Central  Google Scholar 

  53. Bailey WJ, Kim J, Wagner GP, Ruddle FH: Phylogenetic reconstruction of vertebrate Hox cluster duplications. Mol Biol Evol. 1997, 14: 843-853.

    Article  PubMed  CAS  Google Scholar 

Download references


I am grateful to the three reviewers for their constructive comments. I am also thankful to Saneela Amir (COMSATS Institute of Information Technology, Abbottabad, 22060, Pakistan) for improving the language. This study was supported by Institute of Human Genetics Philipps University Marburg, Germany, Higher Education Commission (HEC) of Pakistan and National Center for Bioinformatics, QAU, Islamabad.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Amir Ali Abbasi.

Additional information

Competing interests

The author declares that they have no competing interests.

Authors' contributions

AAA conceived and designed the project. AAA wrote and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Abbasi, A.A. Are we degenerate tetraploids? More genomes, new facts. Biol Direct 3, 50 (2008).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: