A mystery unveiled
© BioMed Central Ltd 2011
Published: 30 May 2011
Skip to main content
© BioMed Central Ltd 2011
Published: 30 May 2011
A recent phylogenomic study has provided new evidence for two ancient whole genome duplications in plants, with potential importance for the evolution of seed and flowering plants.
So why have these older events not been detected before? Plant genomes are highly dynamic and usually go through an intense phase of structural rearrangements and gene loss following duplication. Bioinformatics tools to detect within-genome colinearity can be used to find remnants of relatively recent WGDs, but, since colinearity fades with time, old(er) duplication events in plants generally can not be detected this way. In many cases, WGDs can also be detected by building age distributions of paralogs, where the number of paralogs is plotted against their age, which can be approximated by the number of synonymous substitutions per synonymous site (K S). A peak in such a distribution indicates a burst of duplications at about the same time, and is often interpreted as a WGD event. However, due to gene loss and saturation effects, K S age distributions become unreliable for the detection of older duplication events. To see whether they could find evidence for older WGDs in plants, Jiao et al.  used a phylogenomic approach and constructed thousands of evolutionary trees for gene families built from a collection of genes from sequenced genomes and more than 12.5 million new expressed-sequence-tag sequences from basal angiosperms and gymnosperms. A semi-parametric penalized likelihood approach was then applied to estimate the divergence time of duplicated genes or clades compared with some pre-set speciation times. Analysis of the distribution of inferred duplication times showed two clear peaks: one in the ancestor of all extant angiosperms, dated at 192 million years ago (mya), and one in the ancestor of all extant seed plants dated at 319 mya.
In addition, Jiao et al.  performed a gene ontology enrichment analysis for the genes that survived these two old WGDs, and found an overabundance of genes from several functional categories, including transferases and binding proteins, transcription factors and protein kinases. A biased retention of transcriptional and developmental regulators and proteins that form complexes is a recurrent observation following WGDs. Dosage balance effects can explain the specific retention of such genes [2, 3]. When a dosage-sensitive gene is duplicated on its own, the effects of its increased dosage with respect to its interaction partners or targets would be deleterious, and the duplication would be selected against . But if the gene and its interaction partners are duplicated together, as in a WGD, their relative dosage is preserved. As a matter of fact, the observation of biased gene retention following a duplication event, such as observed by Jiao et al. , is actually rather strong, albeit circumstantial, evidence that the duplication event did affect the whole genome. The consequences of these dosage balance effects is that post-WGD organisms are endowed with a collection of extra transcription factors, transporters and complexes that may not necessarily be useful immediately but that cannot be purged easily from the genome. In the long run, these non-adaptively preserved genes, which can be considered neutral byproducts or regulatory and developmental spandrels , may be co-opted for adaptive innovations. As previously shown , the total numbers of regulatory and developmental regulators added to plant genomes through WGDs is huge, and consequently WGDs may have spurred really important evolutionary innovations. In that respect, the older genome duplications described by Jiao et al.  could have been particularly important since they might have coincided with, respectively, the origin of the seed and the flower, two key developments in the evolution of land plants. For instance, the authors identified several tens of genes involved in flower developmental pathways, with at least one ancient duplication event before the divergence of monocots and dicots. Other examples include regulators of flowering time and seed germination, showing duplication patterns consistent with WGDs pre-dating the origin of seed plants and angiosperms.
Darwin was deeply bothered by what he perceived to be an abrupt origin and highly accelerated rate of diversification of flowering plants in the mid-Cretaceous period . Apart from increasing the diversification potential for morphologic innovation and biological novelty, WGDs often seem to be accompanied by marked and sudden increases in species richness . Although a link between WGDs and increased species diversity remains correlational rather than causal, mechanisms such as reciprocal gene loss or gene subfunctionalization might explain how gene and genome duplications facilitate the formation of novel species. Furthermore, it has been suggested recently that WGDs might increase the chance for species to survive extinction events . Therefore, the preferential retention of developmental genes and genes functioning in signaling and regulatory cascades, plus the fact that WGDs might facilitate speciation events while at the same time reducing the chance of extinction, sheds novel light on the evolution of one of the most important clades of organisms on this planet. Perhaps Darwin's 'abominable mystery' of the origin and radiation of the angiosperms might be just about to be solved.
number of synonymous substitutions per synonymous site
million years ago
whole genome duplication
I would like to thank Jeffrey A Fawcett and Xiaolian Zhu for help with Figure 1, and Steven Maere for a critical reading of the manuscript.