The circadian clock goes genomic
© BioMed Central Ltd 2013
Published: 24 June 2013
Skip to main content
© BioMed Central Ltd 2013
Published: 24 June 2013
Large-scale biology among plant species, as well as comparative genomics of circadian clock architecture and clock-regulated output processes, have greatly advanced our understanding of the endogenous timing system in plants.
Plants rely on an endogenous timekeeper to optimally prepare for the recurrent cycles of day and night, light and darkness, energy production and energy consumption, activity of pollinators, as well as seasonal changes that tell them when to flower or shed their leaves [1, 2]. The 'circadian' clockwork (from Latin circa diem, about one day) is entrained to the periodic light regime of the environment: plants use this information to control internal processes so that they take place at the most appropriate time of day for maximal output and performance. This global system works at various genomic levels.
Overall, the principles of rhythm generation in plants are the same as in mammals or Drosophila, but the components involved are largely different, pointing to independent origins of the timekeeping mechanisms. In mammals, the core loop comprises the transcription factors CLOCK and BMAL1, which activate the expression of Cryptochrome and Period genes. The PERIOD/CRYPTOCHROME complex, in turn, represses BMAL1/CLOCK-mediated transcription of their own genes. Additional feedback loops consisting of transcriptional activators and repressors interlock with this central loop to regulate the expression of the core clock genes (for a detailed description, see Zhang and Kay , Staiger and Köster , and Dibner et al. ).
Chronobiology, the discipline of endogenous timekeeping, went molecular with the first demonstration of mRNAs in pea plants that appeared at sunrise and disappeared at sunset, and continued to cycle with a 24-h rhythm even in the absence of a light-dark cycle . It was difficult to appreciate these circadian experiments as they were not just a 'minus light' sample compared with a 'plus light' sample, but required processing of many samples harvested around the clock. A major advance in this sort of approach was to move beyond a gene-by-gene examination. The first circadian microarray study was opportunely performed just after the compilation of the Arabidopsis genome [12, 13]. Cycling gene clusters could thus be linked to nearby non-coding DNA, and conserved elements in the upstream regions revealed phase-specific promoter elements [12, 14–16]. These studies provided valuable insights into the genome-wide mechanism of clock outputs for the first time. Groups of genes that are co-ordinately directed to certain times of the day pointed to entire pathways that were not previously known to be clock-regulated, such as the phenylpropanoid pathway .
Subsequently, many homologous genes were found to be clock-regulated and phased to similar times of day in poplar and rice, as they are in Arabidopsis . Furthermore, the same three major classes of cis-regulatory modules of Arabidopsis were found in poplar and rice. The morning module consists of the morning element (CCACAC), which confers expression at the beginning of the day, and a ubiquitous G-box (CACGTG) regulatory element associated with regulation by light and by the phytohormone abscisic acid. The evening module consists of the evening element (AAAATATCT), which confers expression at the end of the day, and the GATA motif, which is associated with light-regulated genes. The midnight modules come in three variants, ATGGCC (PBX), AAACCCT (TBX) and AAGCC (SBX). This points to a strong conservation of clock-regulated transcriptional networks between mono- and dicotyledonous species . As shown in Figure 1c, oscillations of the output genes can be accomplished through direct binding of rhythmically expressed clock proteins to phase modules in the promoters of output genes, or via intermediate transcription factors.
The information from numerous microarray experiments conducted under different light and temperature regimes by the community were assembled into the easy-to-use DIURNAL database . This site is widely consulted to check for rhythmic transcript patterns, reflecting the growing awareness of the importance of temporal programs in gene expression .
Rhythmically expressed genes in Arabidopsis were found to be over-represented among phytohormone- and stress-responsive pathways. This revealed that endogenous or environmental cues elicit reactions of different intensities depending on the time-of-day [15, 19]. This so-called 'gating' is thought to optimize the response to a plethora of stimuli impinging on the plant, and may be of particular relevance for sessile organisms . An example of this is how the PRR5, PRR7 and PRR9 proteins contribute to the cold stress response . These PRRs also contribute to coordinating the timing of the tricarboxylic acid cycle . In this way, one set of regulators directly link global gene expression patterns to rhythmic primary metabolism and stress signaling.
A similar systems-based approach identified the circadian clock as a key player in other facets of metabolism, since CCA1 regulates a network of nitrogen-responsive genes throughout the plant . CCA1 also has a role in coordination of the reactive oxygen species response that occurs each day as part of light harvesting for photosynthesis and the reaction to abiotic stress, such as the response to high salt . Another clock-optimized process is the regulation of plant immunity. The defense of Arabidopsis against Pseudomonas syringae or insects depends on the time-of-day of pathogen attack [24–26]. Furthermore, genes that are induced upon infection with the oomycete Hyaloperonospora arabidopsidis, which causes downy mildew disease, have more CCA1 binding sites in their promoters than expected . cca1 mutants show reduced resistance when infected at dawn. Since lhy mutants are not impaired in disease resistance, this points to a specific effect of the CCA1 clock protein rather than a general effect of the clock . Similarly, the RNA-binding protein AtGRP7 (Arabidopsis thaliana glycine-rich RNA binding protein 7), which is part of a negative feedback loop downstream of the core oscillator, plays a role in immunity [28–30].
Microarray analysis has also contributed to the question of whether there is one clock for all parts of the plant. Plants, unlike animals, do not have their circadian system organized into a master clock situated in the brain and 'slave' clocks in peripheral organs . However, the differential oscillatory patterns of core clock genes in Arabidopsis shoots and roots point to a distinct clock in roots that runs only on the morning loop .
Soon after discovering the effect of the clock on transcription, it became apparent that clock-controlled promoter activity does not always lead to detectable oscillations in mRNA steady-state abundance. This was attributable to a long half-life of the transcripts . In Arabidopsis, a global search for short-lived transcripts identified a suite of clock-controlled transcripts. For some of these, the mRNA stability changes over the circadian cycle . Corresponding factors that may coordinately regulate the half-life of sets of transcripts are yet to be identified, although candidates include RNA-binding proteins that themselves undergo circadian oscillations .
A prominent role for post-transcriptional control in circadian timekeeping was suggested by the long period phenotype of the prmt5 mutant defective in PROTEIN ARGININE METHYLTRANSFERASE 5 [36–38]. Among the protein substrates of PRMT5 are splicing factors, and thus PRMT5 has a global impact on splicing. Alternative splicing of the clock gene PRR9 is affected by loss of PRMT5 and the transcript isoform encoding functional PRR9 is barely detectable in prmt5 mutants, suggesting that the circadian defect may partly be caused by changes in PRR9 splicing . Additional splicing factors that affect circadian rhythms are SPLICEOSOMAL TIMEKEEPER LOCUS1, the SNW/Ski-interacting protein (SKIP) domain protein SKIP, and the paralogous RNA-binding proteins AtGRP7 and AtGRP8 [39–41]. Notably, AtGRP7 and AtGRP8 form a feedback loop through unproductive alternative splicing and decay of transcript isoforms with a premature termination codon, associating for the first time nonsense-mediated decay with the circadian system [42, 43].
In another approach, a high-resolution RT-PCR panel based on fluorescently labeled amplicons was used to systematically monitor alternative splicing of the core oscillator genes . Alternative splicing events were observed 63 times, and of these, at least 13 were affected by low temperature. This suggested that alternative splicing might serve to adjust clock function to temperature changes. More recently, RNA-Seq analyses identified alternative splicing of many clock genes, and an event leading to the retention of an intron in CCA1 was conserved across different plant species . In the future, a systematic comparison of alternative splicing networks (both for core clock genes and clock output genes) to the corresponding transcriptional programs will unravel the contribution of alternative splicing to the rhythms in transcript and protein abundance.
To date, the extent to which proteins undergo circadian oscillations in the plant cell has not been systematically studied. An initial proteomic study in rice revealed a difference in expression phases between mRNAs and proteins, suggesting regulation at the post-transcriptional, translational and post-translational levels . Uncoupling of protein rhythms from mRNA rhythms has also been observed in mouse liver, where 20% of soluble proteins show a rhythm in protein abundance but only half of them originate from rhythmic transcripts .
A prominent class of small noncoding RNAs are microRNAs (miRNAs), which are 19 to 22 nucleotide long single-stranded RNAs that base-pair with mRNA targets and thereby control the level of target transcripts or the level of translation of these mRNAs . miRNAs that oscillate across the circadian cycle have been widely described in mammals and Drosophila. In these organisms, miRNAs target clock components and play a role in entrainment or regulation of clock output [49, 50].
In Arabidopsis, a suite of miRNAs was interrogated for rhythmic expression. Using tiling arrays, miR157A, miR158A, miR160B and miR167D were found to be clock-controlled . On the other hand, miR171, miR398, miR168 and miR167 oscillate diurnally but are not controlled by the clock . The functional implications of these mRNA oscillations are not yet clear. Based on the prominent role miRNAs play in modulating the circadian clock in Drosophila or mammals, such a function is to be expected in plants, where miRNAs so far have a demonstrated role only in clock output, such as seasonal timing of flowering .
Another class of noncoding RNAs is naturally occurring antisense transcripts (NATs). In Arabidopsis, rhythmic NATs were detected for 7% of the protein coding genes using tiling arrays . Among these were the clock proteins LHY and CCA1, TOC1, PRR3, PRR5, PRR7 and PRR9. In the bread mold Neurospora crassa, NATs have been implicated in clock regulation. Suites of large antisense transcripts overlap the clock gene frequency in opposite phase to sense frq. These NATs are also induced by light and thus appear to play a role in entrainment by light signals . A causal role for noncoding RNAs in the plant circadian system has yet to be established.
Forward genetic screens of mutagenized plants carrying clock-controlled promoters fused to the LUCIFERASE reporter for aberrant timing of bioluminescence were instrumental to uncover the first clock genes, TOC1, ZEITLUPE and LUX/PCL1 [55–58]. Likely because of extensive redundancy in plant genomes, most other clock genes were identified by reverse genetic approaches and genome-wide studies. In fact, up to 5% of transcription factors have the capacity to contribute to proper rhythm generation . A yeast one hybrid screen of a collection of transcription factors for their binding to the CCA1/LHY regulatory regions revealed CIRCADIAN HIKING EXPEDITION (CHE) as a modulator of the clock .
These CHE studies attempted to bridge TOC1 with the regulation of CCA1/LHY, but failed to fully explain the effect of TOC1 on CCA1/LHY expression. Subsequently, chromatin immunoprecipitation (ChIP)-Seq showed that TOC1 directly associates with the CCA1 promoter, and this interaction is not dependent on CHE [61, 62]. Thus, while CHE is not generally seen as a core clock component, its analysis revealed that genomic approaches can feasibly interrogate the capacity of a given transcription factor to modulate clock performance. Genome-wide analysis of cis-elements in clock-controlled promoters should identify the motifs that control rhythmic RNA expression of a clock-controlled gene, and this facilitates the identification of the trans factors that create such rhythms (Figure 1c).
ChIP-Seq revealed that PRR5 functions as a transcriptional repressor to control the timing of target genes . It can be expected that the global DNA-binding activity of all core-clock components will be rapidly assembled and this will be associated with the roles of each factor in regulating global transcription, accounting for up to 30% of all transcripts .
So far, a number of clock components have been shown to be required to modify histones at the appropriate time. For example, CCA1 antagonizes H3Ac at the TOC1 promoter . In contrast, REVEILLE8 (RVE8), a MYB-like transcription factor similar to CCA1 and LHY, promotes H3Ac at the TOC1 promoter, predominantly during the day . However, it is unclear if CCA1 and RVE8 cause the histone modification at the TOC1 promoter, or if histone modification allows CCA1 or RVE8 to actively participate in regulation of TOC1 transcription, respectively. The underlying molecular mechanism of the temporal histone modification and components involved are currently elusive. Furthermore, it remains to be shown whether other histone modifications, such as phosphorylation, ubiquitination or sumoylation , also contribute to the clock gene expression and change across the day.
The availability of an ever-increasing number of sequenced plant genomes has made it possible to track down the evolution of core clock genes. The Arabidopsis core oscillator comprises families of proteins that are assumed to have partially redundant functions [1, 3]. The founding hypothesis was that the higher-land-plant clock derived from algae. The green alga Ostreococcus tauri, the smallest living eukaryote with its 12.5 Mb genome (10% of Arabidopsis) has only a CCA1 homolog, forming a simple two-component feedback-loop with a TOC1 homolog, the only PRR-like gene found in Ostreococcus . This supported that the hypothesis that the CCA1-TOC1 cycle is the ancestral oscillator (Figure 2).
Recent efforts to clone crop-domestication genes have revealed that ancient and modern breeding has selected variants in clock components. The most notable examples include the transitions of barley and wheat as cereals and alfalfa and pea as legumes from the Fertile Crescent to temperate Europe. This breeding and seed trafficking was arguably the greatest force in Europe leading the transition from nomadic to civilized lifestyles. It is known that ancestral barley and wheat are what are now called the winter varieties. The common spring varieties arose as late flowering cultivars, which profit from the extended light and warmth of European summers over that of the Middle East. That occurred from a single mutation in barley (Hordeum vulgare) in a PRR ortholog most similar to PRR7 termed Ppd-1 (Photoperiod-1) (Figure 2) . In wheat (Triticum aestivum), since it is polyploid and recessive mutations rarely have any phenotypic impact, breeders selected promoter mutations at PPD that led to dominant late-flowering . Interestingly, in the beet Beta vulgaris, a PRR7-like gene named BOLTING TIME CONTROL1 (BvBTC1) is involved in the regulation of bolting time, mediating responses to both long days and vernalization . Evolution at PRR7 is thus a recurrent event in plant domestication.
As barley (Hordeum vulgare) moved north, early flowering was selected in a late-flowering context due to the presence of the spring allele at ppdh1. Mutations in the barley ELF3 ortholog, termed EAM8 (Figure 2), were selected . Interestingly, the migration of bean and alfalfa to temperate Europe also coincided with ELF3 mutations . In Asia, rice varieties in domestication have also mapped to the ELF3 locus . It will be intriguing to assess the genome-wide population structure of clock gene variation as a possible driving force in species migration over latitude and altitude. Genome-wide efforts to explore this show that such studies have merit .
One identifying feature of plants within clades of multicellular organisms is the possibility of fertile polyploids. It is speculated that, over evolutionary time, all higher-land plants were at one time polyploid, and indeed, it has been estimated that up to 80% of extant plant species are in a non-diploid state . This raises several confounding features on the genome. For one, in autopolyploids, derived from an expansion of genomes derived from one species, the process of going from 2× to 4× obviously increases the copy number of all genes by twofold. One report to examine this comes from the comparison of the Brassica rapa oscillator repertory . On average, it is possible for this species to have threefold more of an individual gene over Arabidopsis. However, this is not always the case, as gene loss of these redundant copies has occurred at numerous loci . By examining the probability of gene presence, it has been shown that the retention of clock genes has been more highly favored than the retention of genes randomly sampled from the genome ; this was not a linkage disequilibrium effect, as even the neighboring genes, as known by synteny, were retained at a lower rate. Thus, Brassica rapa has gained fitness by keeping additional copies of clock genes (Figure 2). Why that is awaits testing.
In allopolyploids that arise from the intercrossing of species, the clock confronts allele choice issues between the potentially conflicting parental genomes. Allopolyploids are common in nature, are often easy to recreate in the lab, and are often more vigorous than the parents. Using a newly generated allopolyploid, the role of the clock in providing a genome-wide fitness was assessed [75, 76]. Epigenetic modification at two morning clock genes was found to associate with vigor through regulation of metabolic processes . In subsequent studies, this was further related to stress response pathways in a genome-wide analysis of mRNA decay . Thus, genome-wide polyploidy acts early on clock genes to partition metabolism and stress signaling.
High-throughput approaches have greatly advanced our understanding of the pervasive effect of the clock on the transcriptome and molecular underpinnings of rhythms in promoter activity. However, our knowledge of rhythms in protein abundance conferred by subsequent layers of regulation and of small RNA regulation in the plant circadian system is underdeveloped. Comparative genomics among different plant species have pointed to divergences in clock-output processes, and perhaps in the clock mechanism itself. Relating the orthologous function of a given clock protein across the function of the plant genomes will undoubtedly continue to require large-scale genomics.
Arabidopsis thaliana glycine-rich RNA binding protein
circadian clock associated 1
circadian hiking expedition
late elongated hypocotyl
naturally occurring antisense transcript
protein arginine methyltransferase 5
timing of CAB expression 1.
Research in our laboratories is supported by the DFG (STA653 and SPP1530) to DS and (DA1041/4, SFB635, and SPP1530) to SJD. JS and MJ both recognize Alexander von Humboldt support.