Genomes of extremophile crucifers: new platforms for comparative genomics and beyond
© BioMed Central Ltd 2012
Published: 16 August 2012
Skip to main content
© BioMed Central Ltd 2012
Published: 16 August 2012
Recent reports describe the genome sequencing of Thellungiella salsuginea and Thellungiella parvula, two extremophile crucifers closely related to the stress-sensitive model plant Arabidopsis thaliana.
Abiotic stresses, such as salinity, are important factors limiting plant productivity. Over millions of years, extremophile plants have evolved efficient ways of sustaining growth in challenging environments, and the study of these plants can provide valuable insights into the underlying mechanisms of their stress tolerance .
Structural genomic comparisons show that, despite significant differences in genome sizes, large regions of co-linearity are present between the four species [2, 4, 5]. Differences in genome size could be explained mainly by differences in the intergenic regions, where repeated sequences and transposable elements are commonly found. While in A. thaliana large numbers of microdeletions in these areas are suspected, resulting in a reduction of genome size , recent activity of transposable elements is thought to be one of the main reasons for the expansion of the T. salsuginea  and A. lyrata  genomes. The latter elements are also thought to be one of the factors at the origin of so-called taxonomically restricted, or orphan, genes and gene families, which are new genes that have recently arisen in a taxon. Wu et al.  show the T. salsuginea genome to contain 984 families of such genes, the functions of which still remain to be explored. Finally, tandem duplication, segmental duplication and retrotransposon-related gene duplication may act on genome structure, and also may be related to functional adaptation, both via modifying gene expression and by providing an opportunity for functional diversification. Retrotransposition as a source of gene duplication is found to be especially common in the extremophile T. salsuginea, relative to A. thaliana .
To elucidate how and which structural genomic changes may influence the stress tolerance of extremophile organisms, both Dassanayake et al.  and Wu et al.  take advantage of a priori knowledge of gene function, as inferred through sequence homology, by applying this functional information to a focus on copy number variations. A simple method for identifying the functional categories whose constituent genes are subject to recent expansion is to perform gene set enrichment analyses. These analyses are frequently based on gene ontology annotations and can be used to identify functional groups of genes (or annotations) that are statistically over- or underrepresented in one genome compared with another. In both Thellungiella studies [3, 4], gene set enrichment analysis shows that numerous categories of genes already known to be related to abiotic stress, including 'response to salt stress', 'abscisic acid stimulus', 'transporter activity' and 'development', are indeed overrepresented in the genomes of these halophytes compared with A. thaliana. This finding is of fundamental importance, as it demonstrates that positive selection for duplications of existing stress-response genes is likely to play a part in the adaptation to high abiotic stress environments.
Manual analysis and annotation of gene families that have undergone selective expansion in extremophiles is then used by Wu et al.  in T. salsuginea to highlight specific genes and processes involved in stress tolerance, including genes encoding transcription factors, abscisic acid biosynthetic enzymes, a key enzyme involved in wax production, and the sodium transporter HKT1. The latter gene is of particular interest in T. salsuginea as HKT1 has not only been triplicated with respect to A. thaliana, but one of the three copies has also undergone a significant increase in expression, most likely related to changes in the cis-regulatory region. Such changes have also been recorded for another sodium transporter encoded by the salt overly sensitive 1 (SOS1) gene. For this gene, both T. parvula and T. salsuginea copies exhibit homologous promoter regions and high expression, whereas the A. thaliana transporter differs in promoter sequence and is expressed at lower levels , providing an example of how genomic changes in non-coding sequences may affect physiology.
The recent work on Thellungiella [3, 4] shows that, even for closely related species, successful adaptation to abiotic stress is likely to involve the combination of numerous genomic changes related to known but also to novel genes, which are driven by various evolutionary mechanisms. The studies provide a starting point for a fresh approach, namely the genome-wide, and thus holistic, study of the molecular bases of adaptation, and pave the way for the use of systems biology tools to construct and model metabolic and regulatory networks. These models can, in a next step, be expanded to combine other forms of high-throughput data, such as metabolomic results and/or observations on small RNAs, DNA methylation and other forms of epigenetic regulation . At the same time, targeted approaches aiming to identify the function of individual genes are still needed, as even the best models will be incomplete until they are able to incorporate and understand the numerous orphan genes and gene families, which may well be the most innovative features of extremophile genomes. In the same vein, the most complete regulatory networks would remain hypothetical until backed up by experimental evidence.
We anticipate that the discussed studies, exploiting natural genetic variability of land plants to study the evolutionary processes of adaptation to extreme environments, will be inspirational for the development of similar approaches in other organisms. For instance, algae have also colonized environments spanning a wide range of abiotic factors. Meta-comparisons of the evolutionary mechanisms underlying adaptation across lineages and kingdoms will then provide insights into both conserved and specific mechanisms, and increase our understanding of the general principles underlying adaptation.