Detection and analysis of alternative splicing in Yarrowia lipolytica reveal structural constraints facilitating nonsense-mediated decay of intron-retaining transcripts
© Mekouar et al.; licensee BioMed Central Ltd. 2010
Received: 11 March 2010
Accepted: 23 June 2010
Published: 23 June 2010
Hemiascomycetous yeasts have intron-poor genomes with very few cases of alternative splicing. Most of the reported examples result from intron retention in Saccharomyces cerevisiae and some have been shown to be functionally significant. Here we used transcriptome-wide approaches to evaluate the mechanisms underlying the generation of alternative transcripts in Yarrowia lipolytica, a yeast highly divergent from S. cerevisiae.
Experimental investigation of Y. lipolytica gene models identified several cases of alternative splicing, mostly generated by intron retention, principally affecting the first intron of the gene. The retention of introns almost invariably creates a premature termination codon, as a direct consequence of the structure of intron boundaries. An analysis of Y. lipolytica introns revealed that introns of multiples of three nucleotides in length, particularly those without stop codons, were underrepresented. In other organisms, premature termination codon-containing transcripts are targeted for degradation by the nonsense-mediated mRNA decay (NMD) machinery. In Y. lipolytica, homologs of S. cerevisiae UPF1 and UPF2 genes were identified, but not UPF3. The inactivation of Y. lipolytica UPF1 and UPF2 resulted in the accumulation of unspliced transcripts of a test set of genes.
Y. lipolytica is the hemiascomycete with the most intron-rich genome sequenced to date, and it has several unusual genes with large introns or alternative transcription start sites, or introns in the 5' UTR. Our results suggest Y. lipolytica intron structure is subject to significant constraints, leading to the under-representation of stop-free introns. Consequently, intron-containing transcripts are degraded by a functional NMD pathway.
From a genomic point of view Yarrowia lipolytica is rather atypical among hemiascomycetous yeasts sequenced to date . Its genome is surprisingly large, consisting of six chromosomes, a total of about 20.5 Mb in size, more than one and a half times the size of the Saccharomyces cerevisiae genome and twice that of Kluyveromyces lactis. However, with an overall density of only one gene per 3 kb and 6,449 predicted protein-coding genes, the gene content of Y. lipolytica is similar to that of other hemiascomycetes. The complete genome has a mean G + C content of 49%, which is significantly higher than that in other yeast genomes [1, 2], with the exception of Eremothecium (Ashbyia) gossypii, which has a G + C content of 52% . The genome of Y. lipolytica is also unusual in several other ways: atypical structure of chromosomal origins of replication and centromeric DNA , large number of tRNA genes [1, 5], 5S rRNA genes dispersed throughout the genome [1, 6] and unique fusions between tRNA genes and 5S rRNA genes . Unlike most hemiascomycetes, in which ribosomal DNA loci are clustered into a single locus on one chromosomal arm, Y. lipolytica rDNA units, containing the 18S, 5.8S and 26S rRNA genes, are found in six subtelomeric clusters [1, 8], a distribution also observed in Pichia pastoris . Y. lipolytica is also unusual in having a highly diverse transposable element content [10–13]. Y. lipolytica genes also display an organization different from that of other hemiascomycetes, as some genes are interrupted by several spliceosomal introns, with up to five introns per gene [1, 14]. The total number of introns, first estimated at 742 in the 2004 annotation, has now reached 1,119 with the data presented in this study, and this number of introns is larger than that in any other hemiascomycetous genome sequenced to date (287 introns in S. cerevisiae ; 415 introns in Candida albicans ; 633 intron-containing genes in P. pastoris ). Thus, about 15% of the genes contain introns and the intron density is about 0.17.
Intron density varies considerably between eukaryotes , from a few introns per genome in Giardia , to more than eight per gene in humans . Y. lipolytica is thus considered to be an intron-poor species , but alternative splicing (AS) was fortuitously observed for the intron of the first gene of the Mutyl DNA transposon, for which a combination of alternative 5'-splice sites (5'ss) and 3'-splice sites (3'ss) is used . AS generally results from the combination of splice sites present in the pre-mRNA, and may occur through four basic modes: use of an alternative 5'ss, use of an alternative 3'ss, cassette-exon skipping and intron retention. AS is currently thought to occur in more than 60% of human genes [21–23], increasing the complexity of the transcriptome and leading to genetic or malignant diseases in some cases [24, 25]. By contrast, very few examples of AS resulting in the production of multiple proteins have been reported in yeasts, such as Schizosaccharomyces pombe  and S. cerevisiae [27, 28]. In a few additional cases, alternative transcripts have been predicted in S. cerevisiae [29–31] and C. albicans  although without supporting evidence for multiple functional proteins. Many other cases of alternative transcripts in yeasts, mostly identified by global transcriptomic approaches [32–34], involve intron retention and result in nonsense-containing mRNAs. These cases may result from inefficient splicing or missplicing  due to suboptimal splicing signals . These alternative transcripts were thought to be largely non-functional. However, in some cases, intron retention seems to be regulated by growth conditions, such as amino-acid starvation , or by a specific physiological state of the cells, such as meiosis [15, 38, 39]. Other examples of regulated splicing, in which the protein inhibits the splicing of its own pre-mRNA, include RPL30  and YRA1 [27, 41, 42].
Thus, the AS of mRNA generates two types of transcript: mRNAs to be translated into functional proteins (thereby increasing the complexity/diversity of the proteome) or nonsense-containing mRNAs that may generate truncated proteins potentially deleterious to the cell if translated. Nonsense-mediated mRNA decay (NMD) is a eukaryotic quality control mechanism that detects mRNAs with a premature termination codon (PTC), targeting them for degradation and thus preventing their translation (for review, see [43–45]). This RNA surveillance pathway is well documented in yeast, mammals, fruit flies, nematodes and plants [46, 47]. Different mechanisms of PTC recognition have been identified in different species, involving the exon-exon junction complex in mammals, and the distance between the PTC and the poly(A)-binding protein, also called the 'faux 3' UTR', in yeast and fruit fly . However, a unified model has also been proposed in recent studies .
When introns are retained, a PTC may be generated by the intron sequence itself or by the downstream exon sequence if the intron does not consist of a multiple of three nucleotides and thus generates a frameshift. This observation led Jaillon et al.  to suggest that introns are structured so as to favor their detection by the NMD pathway in cases of intron retention. These authors showed that, in different species from very different phyla, intron size was subjected to strong constraints leading to the counterselection of stop-less introns of size 3n (that is, consisting of a multiple of three nucleotides).
The mechanisms regulating AS and NMD are not fully understood. Yeasts are tractable unicellular models that could supply molecular information about such mechanisms. As Y. lipolytica has more introns than S. cerevisiae, it is likely to display more AS and thus to be useful for investigation of the associated molecular mechanisms. We therefore investigated, in this organism, the population of transcripts from intron-containing genes, and their likelihood of degradation by the NMD pathway, through a combination of several different experimental approaches.
cDNA sequencing shows Y. lipolytica to have four times as many introns as S. cerevisiae
We began our investigation of Y. lipolytica splicing by using cDNA sequencing to revisit the in silico predictions of intron-containing genes in this yeast. Three cDNA libraries were constructed from mRNAs obtained from cells grown under different conditions: exponential and stationary phases on YPD medium ('expo', 9,409 reads; and 'stat', 9,620 reads) and exponential phase on oleic acid medium ('oleic', 9,405 reads).
We found that 1,659 of the 28,434 cDNA sequences (5.8%) did not match the predicted coding sequence (CDS), with 455 of these sequences not matching the Y. lipolytica chromosome sequence but possibly corresponding to CDS in non-assembled contigs. Some of the remaining 1,204 non-matching sequences displayed a significant match with 21 of the 137 predicted pseudogenes in the sense (64 cDNA sequences) or anti-sense (22 cDNA sequences) orientation. The others corresponded to intergenic regions with no predicted genetic elements.
Another set of 1,053 cDNA sequences (3.7%) matched, in an anti-sense orientation, with 167 Y. lipolytica CDSs, one of which (YALI0A21351g) was highly represented, with 579 cDNA clones. YALI0A21351g has been predicted to encode a small gene product (89 amino acids) with no homolog in databases, and may therefore be a false open reading frame. The cDNA clones derived from the antisense transcripts may thus correspond to a non-coding RNA, the structure and function of which remain to be determined.
We found that 25,722 clones matched a CDS in the expected orientation: 8,936, 8,614 and 8,172 clones in the expo, stat and oleic libraries, respectively. About 59% of the predicted genes (3,818 of 6,449) were expressed and found in at least one library and about 70% of these expressed genes (2,647 genes) were represented by at least two different clones. Clone numbers per gene and per library are given in Additional file 1. A few genes (13 genes) were represented by more than 100 clones, but mostly by less than 200, in the different libraries. The major exceptions were YALI0D06237g and YALI0E15510g in the stat library, which had 713 clones (8.7% of the stat clones) and 679 clones (8.3% of the stat clones), respectively. YALI0D06237g encodes a putative sphingolipid delta 4 desaturase and YALI0E15510g a putative homeobox transcriptional repressor. Comparison between the cDNA sequences of the different libraries showed that only 20% of the sequenced cDNAs were expressed in all three growth conditions (Figure S1 in Additional file 2). About 12% of the sequenced cDNAs were specific to the oleic or stat libraries, but almost twice as many (22.6%) were specific to the expo library. However, these figures are only approximations, as cDNA library sequencing is certainly not the most sensitive way to quantify gene expression. Some overlap in expression patterns between the different conditions may therefore have been missed due to low levels of expression or cloning biases.
Distribution of introns and intron-containing genes in the E150 genome
Intron-containing genes (I-genes) with:
The number of predicted introns in the sequenced E150 genome increased from 742  to 1,083, and the number of intron-containing genes increased to 951. Most of these genes carry only one intron, but 109 multi-intronic genes with up to five introns were detected, most (93 of 109) carrying two introns (Table 1). The internal exons of the multi-intronic genes were mostly short, the shortest being only four nucleotides long, in YALI0E34170g, as validated by two cDNAs. Introns in 5' UTRs were not systematically predicted during in silico annotation by the Génolevures Consortium. Our data revealed the presence of at least 36 introns in these 5' non-coding regions of mRNAs, a number similar to that reported for S. cerevisiae . Thus, with 1,119 introns, Y. lipolytica is the hemiascomycete with the largest number of spliceosomal introns in its genome, with about four times as many introns as S. cerevisiae.
Y. lipolytica introns have several unique features
Several unique features were identified when the intron structure of Y. lipolytica was compared with that of other hemiascomycetous yeasts. First, the branch point (BP) and the 3'ss were found to form a combined sequence, with a mean interval of one nucleotide between the motifs (Figure S2a,b in Additional file 2). This finding was previously reported for a small subset of introns of strain W29  and for a larger subset of introns of Y. lipolytica sequenced strain [58, 59]. This juxtaposition may result from an evolutionary event that simplified the mechanism of spliceosomal assembly, combining the steps of BP and 3'ss recognition , as hypothesized for two other deep-branch eukaryotes, Trichomonas vaginalis and Giardia lamblia . Second, the consensus sequences at intron boundaries were also found to be unusual for yeasts. This was particularly true for the 5'ss, which had the sequence GTGAGT, rather than the GTATGT sequence found in most other hemiascomycetes [14, 58, 60, 61]. This 5'ss consensus, which is known to be essential for intron recognition by base-pairing to U1 snRNAs, is indeed perfectly complementary to both Y. lipolytica U1 RNAs (YALI0B14567r and YALI0B20936r; Figure S3 in Additional file 2). Third, the internal BP is less well conserved than in other hemiascomycetes sequenced to date, with only five highly conserved residues (CTAAC in more than 92% of the introns) and an upstream A less conserved (Actaac in more than 71%; Figure S2A in Additional file 2), rather than the seven (TACTAAC) reported for S. cerevisiae .
All intron patterns and sequences can be downloaded from the Génosplicing website .
Structural biases in Y. lipolytica introns
We investigated the distribution of introns as a function of the translation frame of upstream exons (an intron is considered to be in phase 0 if located between two codons and in phase 1 or 2 if it splits a codon after the first or second nucleotide, respectively), intron size and the number of in-frame stop codons. This analysis highlighted several constraints exerted on the introns interrupting CDS.
Second, introns of size 3n were underrepresented (29.4% of all introns versus 35.5% and 35.1% for 3n + 1 and 3n + 2, respectively; Figure 2c). This observation is consistent with the finding that stop-less 3n introns are counterselected in Paramecium tetraurelia . In Y. lipolytica, the underrepresentation of 3n introns seemed more marked if we considered only the first intron (28.3% versus 35.85% for each 3n + 1 and 3n + 2 intron), or if we considered only short introns of 41 to 60 nucleotides (25.5% versus 34.3% and 40.2% for 3n + 1 and 3n + 2 introns, respectively; Figure 1a). No statistically significant difference was found in the distribution of introns present in the 5' UTR: 11, 13 and 12 introns of size 3n, 3n + 1 and 3n + 2, respectively (Additional file 3).
Third, the proportion of introns containing in-frame stop codons was very high for 3n (93.7%), 3n + 1 (90.4%) and 3n + 2 introns (91.8%). The probability of an intron not containing a PTC (null expectation) in a non-constrained codon string is smaller than 0.05% for any string longer than 62 codons (Figure S4 in Additional file 2). We thus compared the distribution of PTCs in introns shorter than 186 nucleotides with the expected probability. The proportion of stop-containing introns was higher than would be expected by chance alone (Figure S4 in Additional file 2). Thus, stop-free introns are scarce (88 stop-free introns). Their distribution as a function of length and insertion frame was highly heterogeneous, with an overrepresentation of stop-free 3n + 1 introns inserted in phase 0 and of 3n + 2 introns in phase 1 (Figure 2d).
Y. lipolytica uses all modes of alternative splicing
AS events were sought by two different experimental approaches. First, transcripts of genes with multiple introns or with large introns (>900 bp) were investigated by RT-PCR. Subsequently, sequences obtained from cDNA libraries were screened for splicing variants.
RT-PCR was carried out on 93 genes of Y. lipolytica for which in silico predictions for more than one intron had been made at the beginning of this study (Additional file 5). For 68 of these genes, the predicted spliced transcript was confirmed and a single mRNA was detected. Two other gene models (YALI0F03817g and YALI0F31427g) were poorly predicted and, in both cases, the second intron was not spliced in any of the three RNA preparations. It was thus considered to be part of an exon, resulting in a monointronic gene model. In nine RT-PCRs, no result was obtained, due to an absence of PCR product or non-specific amplification. For two other predicted gene models (YALI0C07150g and YALI0D04554g), only partial data were obtained and we were able to confirm only the splicing of intron 2.
Genes bearing long introns (>900 bp)
Long introns are rare in S. cerevisiae, with all but five of the introns in this species being less than 700 nucleotides long and the largest intron being 1,002 bp long. In Y. lipolytica, gene model predictions indicate that there are 61 introns of more than 700 nucleotides in length, with a maximal intron size of 3,478 bp (see detailed analysis below). We focused on the genes with the largest introns, with a view to confirming these predictions. For this purpose, 17 introns exceeding 900 bp in length (from 901 to 1,551 bp) were reverse-transcribed and amplified with specific primers and mRNA extracted from cells grown under the three different sets of conditions. Thirteen of these introns were spliced as expected, one was not amplified (cDNA clones revealed a different gene model with no introns), two were found to have been poorly predicted (intron size larger than expected) and the last intron, in YALI0F32043g, was found to be a mosaic of five introns and exons (Additional file 6). Transcripts of this last gene displayed AS due to alternative 3'ss selection (extending exon 4 by 15 bases) and retention of the 60 nucleotides, stop-free intron 5 (Figure 4c; Additional file 5). The observed AS events did not generate in-frame stop codons and did not modify the translation phase. They may result in the generation of different, putatively functional proteins.
All these results demonstrate the efficient splicing of long introns not necessarily predicted in silico.
The different strategies used to detect alternative transcripts in Y. lipolytica revealed that such variants were generated from at least 88 genes (Additional files 7 and 8). All known modes of AS were observed: alternative 5'ss (3 genes) and 3'ss usage (6 genes), exon skipping (4 genes) and intron retention (76 genes). Alternative transcription start sites within or downstream of introns were detected in seven genes. Alternative transcripts were observed for 9.2% of the intron-containing genes, but for only 1.8% of these genes if intron retention was excluded. Most of the variants observed resulted from intron retention and, if only multi-intronic genes were considered, the intron retained was mostly the first intron (15 of 21 genes). In almost all cases, intron-containing transcripts revealed by our experimental approaches, carried a PTC. This type of mRNA is generally detected by the NMD pathway, a quality control mechanism that recognizes and degrades PTC-containing transcripts, preventing their translation.
The NMD machinery exists in Y. lipolytica, but some effectors are lacking
The presence and efficiency of NMD was investigated in Y. lipolytica. Homologs of UPF1 (YlUPF1, YALI0D23881g) and UPF2 (YlUPF2, YALI0E24629g) were detected in the Y. lipolytica genome by searches for similarity to known genes. UPF3, which is less well conserved in eukaryotes than UPF1 or UPF2, was not detected in the chromosomes or in any non-assembled reads, suggesting that this NMD effector is lacking or highly divergent in Y. lipolytica. We also looked for SMG1, SMG5, SMG6 and SMG7 (EBS1 in S. cerevisiae), but failed to detect any homologs in Y. lipolytica.
The YALI0D23881g and YALI0E24629g genes, encoding YlUPF1 and YlUPF2, were entirely deleted from the laboratory strain PO1d. None of the single-deletion mutants for these genes displayed a growth defect under the conditions tested, and no defect was observed for the double-mutant (Figure S5 in Additional file 2). This result is consistent with the absence of a growth defect in S. cerevisiae strains lacking the UPF1 or UPF2 gene [68, 69], suggesting that NMD is not an essential biological mechanism in yeasts.
We also focused on the homolog of YDR381W (YRA1), which is known to encode an mRNA not targeted by the NMD pathway in S. cerevisiae [41, 42]. Homologs of the YRA1 gene are conserved in all hemiascomycetous yeasts and present a long intron, the BP motif of which diverges from the canonical sequence in almost all species . The Y. lipolytica gene model for the YRA1 homolog (YALI0A20867g) follows this rule: the first intron is 850 bp long and its BP sequence, located three bases upstream of the 3'ss motif, is TGCTGAC. RT-PCR validated the splicing of the intron in wild-type and mutant strains but identified no transcripts in which intron 1 was retained. However, as the difference in the lengths of the spliced and unspliced forms may bias the PCR in favor of the spliced variant, northern blots were performed with probes binding to exons or the first intron (Figure 7b). Given that YRA1 mRNA degradation requires Xrn1p in S. cerevisiae , we also included a YlXRN1 mutant in our analysis. In hybridization studies, we observed a higher intensity of the bands corresponding to intron-retained transcripts in NMD- mutants only. No such increase in intensity was observed for the YlXRN1 mutant. This observation suggests that, as in S. cerevisiae, the YlYRA1 transcript is not efficiently spliced in Y. lipolytica. We also found that, by contrast to what has been reported for S. cerevisiae, unspliced transcripts were targeted by the NMD pathway, and their degradation seemed to be independent of the YlXrn1 protein. These results suggest that the S. cerevisiae YRA1 autoregulation mechanism based on the nuclear export and cytoplasmic Edc3p-mediated decay of the unspliced transcript , is probably not conserved in Y. lipolytica.
Hemiascomycetous yeasts are considered to have intron-poor genomes. We show here that despite this intron paucity, Y. lipolytica has four times as many introns as S. cerevisiae and is the hemiascomycetous genome with the largest number of intron-containing genes sequenced to date. The combination of approaches used made it possible to correct many predicted gene models, to identify new genes, such as SOA genes , to confirm the splicing of many introns, including both large introns and introns from weakly expressed genes, and to detect introns in 5' UTRs. From a structural point of view, the genome annotation of Y. lipolytica is now largely validated by experimental data and provides a reliable genome model complementary to that of S. cerevisiae.
We show here that Y. lipolytica produces alternative transcripts through several different mechanisms: intron retention, exon skipping, 3' and 5' alternative splice site usage and the use of alternative promoters. The frequency of AS in Y. lipolytica is not very high, particularly if intron retention is excluded from the analysis (1.8% of intron-containing genes), but remains higher than that reported for S. cerevisiae or other hemiascomycetous yeasts, in which few naturally occurring cases [16, 26, 27, 29, 31, 38, 39] or experimentally induced examples [70, 71] have been described. Additional cases have been detected in yeasts, thanks to the recent development of genome-wide technologies providing information about transcript polymorphism, such as tiling or RNA-seq approaches, but these cases mostly involve intron retention [32–34]. We report here a few interesting examples of exon skipping, alternative 3'ss usage or presence of an intronic gene, the expression of which depends on an alternative promoter. The situation is quite different in basidiomycetous yeasts, such as Cryptococcus neoformans, which has an intron-rich genome (mean of 5.3 introns per gene) and a high frequency of AS, with high levels of intron retention  but 4.2% of the transcripts nonetheless resulting from exon skipping and alternative 3'ss or 5'ss usage .
In Y. lipolytica, intron retention is the main model by which mRNA variants are generated, consistent with previous findings for ascomycetous fungi [74, 75]. However, the particular involvement of the first intron in intron retention has not been reported before, probably because AS was investigated mostly in hemiascomycetous yeasts with very few multi-intronic genes . It would be interesting to perform a similar analysis in other phyla of ascomycetous fungi or in basidiomycetes. If this phenomenon reflects an ancestral trait, then the bias should be more marked in filamentous fungi known to possess intron-rich genomes.
One of the key questions emerging from our study relates to whether intron retention in Y. lipolytica plays a physiological role, as observed for YRA1 or meiotic genes in S. cerevisiae, or reflects an underlying background of splicing failure. We addressed this question by investigating whether the retained introns were different from other introns, including, specifically, whether their 5'ss, 3'ss or BP were degenerate or whether the introns were particularly long, potentially accounting for the low splicing efficiency (Additional file 8). However, no bias was detected in primary structure, except that the first intron of YRA1 had a degenerate BP, as it does in S. cerevisiae . More recently, YRA1 splicing inhibition has been reported to be regulated by YRA1 exon 1, in a size-dependent but sequence-independent manner . We thus investigated the size of exon 1 (coding exon plus 5' UTR) in inefficiently spliced transcripts but, again, no bias was detected. Another possibility, requiring further investigation for Y. lipolytica introns, stems from the reported correlation between splicing efficiency and the spatial distance between 5'ss and BP [76, 77]. It has been suggested that a zipper stem in the secondary structure of three large introns of S. cerevisiae shortens the S1 distance and facilitates spliceosome assembly .
However, as the first intron is more often retained than downstream introns, it is tempting to speculate that, in most cases, intron retention probably results from a defect in the kinetics of spliceosome recruitment by the polymerase or in spliceosome assembly. Indeed, in S. cerevisiae , as in other eukaryotes, splicing is mostly cotranscriptional . It has also been shown that the efficiency of splicing factor recruitment during transcription may influence splicing efficiency  and that the carboxy-terminal domain (CTD) of the large subunit of polymerase II is involved in this mechanism. We can thus speculate that retention of the first intron of transcripts may result from a defect in the recruitment of the spliceosome by polymerase II during transcription. We are currently investigating this hypothesis in Y. lipolytica and determining whether there is a correlation between intron retention and the binding kinetics of introns, splicing factors and CTD.
Almost all the observed unspliced transcripts included a premature termination codon. During the first round of translation (before degradation by NMD), the ribosome is thus likely to be rapidly stopped by the PTC because the introns concerned were mostly located at the 5' end of the CDS, close to the start codon. A statistical analysis of the structural characteristics of Y. lipolytica introns (intron size, frame of integration within the coding sequence, PTC) revealed that up to 93% of introns generated a PTC, whereas only about 30% of introns in Paramecium generate PTCs . This high percentage is due to both intron size and, in half the cases, the sequence of intron boundaries. The presence of stop codons in 5'ss motifs is unusual for yeast introns, as the most frequent motif in the hemiascomycete genomes sequenced to date is GTATGT [14, 61], but the 5'ss motif in Y. lipolytica is GTGAGT. This observation highlights a specific evolution of intron features in Y. lipolytica, as proposed for various intron-poor lineages with strong 5'ss . Introns of size 3n were also found to be underrepresented, and stop-free 3n introns were particularly strongly underrepresented, as previously reported for other eukaryotes . If retained, 3n stop-free introns do not change the translation frame and are thus considered as coding sequences that may affect the structure and activity of the resulting protein, with possible deleterious consequences for the cell. In Y. lipolytica, the small number of 3n introns was particularly pronounced for short introns, probably reflecting an ancestral situation in which introns were numerous and short . Intron size has increased during the course of evolution in yeasts, including Y. lipolytica in particular, to a much greater extent than in filamentous ascomycetes. This size increase has increased the likelihood of introns containing a PTC, potentially limiting the need for specific constraints on intron size (3n, 3n + 1, 3n + 2).
In eukaryotes, mRNAs with PTCs are subject to NMD, a quality-control mechanism directing PTC-containing transcripts for degradation to prevent their translation (for reviews, see [43–45]). As most Y. lipolytica transcripts containing retained introns also contain PTCs, this would suggest that such transcripts are mostly targeted by NMD. This may account for their lack of detection in assays with wild-type strains, in which they were probably degraded by the NMD pathway too rapidly for detection. We therefore investigated whether NMD was active in Y. lipolytica. Genes for only two of the core NMD factors were detected in the sequenced strain, YlUPF1 and YlUPF2. This situation is exceptional among eukaryotes, as all other organisms in which NMD has been studied possess at least three major effectors (UPF1/SMG2, UPF2/SMG3 and UPF3/SMG4). Deleting YlUPF1 or YlUPF2 resulted in a significant increase in the proportion of unspliced transcripts for some, but not all intron-containing genes. This result confirms the existence of a functional NMD pathway in Y. lipolytica. However, the absence of significant growth defects in YlUPF1 and YlUPF2 mutants suggests that NMD is not an essential mechanism, as in S. cerevisiae and Caenorhabditis elegans, whereas it has been shown to be essential in plants and metazoans (for a review, see ). We now aim to determine, at the whole-genome scale, which genes are targets of NMD and how this pathway is regulated in this yeast.
We present here an extensive survey of the transcriptome of a yeast chosen for this study on the basis of its phylogenetic position, far removed from all other hemiascomycetous yeasts sequenced to date. This in-depth analysis of the transcriptome made it possible to improve the structural annotation of the Y. lipolytica genome and identified complex cases of alternative transcripts. With a genome slightly more complex than that of S. cerevisiae in terms of gene structure, together with its genetic and biochemical tractability, Y. lipolytica may be a valuable organism for studies of the regulation of AS and its impact on the evolution of gene structure. Although considered an intron-poor species, Y. lipolytica nonetheless displays significant biases in its intron structure, generating PTCs in cases of intron retention. However, further comparative studies at a larger phylogenetic scale are clearly required to determine whether the modeling of intron-containing genes corresponds to an ancestral characteristic or to an evolutionary phenomenon acquired in this particular lineage.
Materials and methods
Strains and media
Y. lipolytica strains E150 (CLIB122, MATB his-1 leu2-270 ura3-302 xpr2-322) and PO1d (CLIB139, MatA leu2-270 ura3-302 xpr2-322) were routinely grown at 28°C on YPD (yeast extract, peptone and glucose, 10 g/l each) or YNB (1.7 g/l Yeast Nitrogen Base (Difco, Detroit, MI, USA), 10 g/l glucose) supplemented for auxotrophy if necessary. Oleic acid medium was prepared as follows: 1.7 g/l Yeast Nitrogen Base (Difco), 50 g/l NH4Cl, 50 mM PO4NaK pH 6.8, a 100 ml/l emulsion of oleic acid (oleic acid 20% (v/v), 0.625% (v/v) Tween 40), 0.8 g/l yeast extract, 10 g/l glucose. Growth phenotypes were investigated for mutant strains of PO1d, on YPD and YNB medium, at 28°C and 18°C.
RNA extraction and RT-PCR
The RNeasy Midi Kit (Qiagen, Courtaboeuf, France) was used to extract total RNA from cells grown in three different conditions: exponential growth phase in YPD (called 'expo'), stationary phase in YPD ('stat') and exponential growth phase in oleic acid medium ('oleic'). DNA contamination was eliminated with the Turbo DNA-free kit (Applied Biosystems/Ambion, Austin, Texas, USA). RT-PCR was performed with Ready-To-Go™ RT-PCR Beads (GE Healthcare Life Sciences, Orsay, France) and PCR control with PuReTaq Ready-To-Go™ PCR Beads (GE Healthcare Life Sciences). Primers for RT-PCR were designed so as to obtain a 200-bp amplicon after splicing. The resulting amplicons were subsequently inserted into a Bluescript plasmid and sequenced to identify the different splicing variants. The relative intensities of RT-PCR products were estimated from ethidium bromide-stained gels, with the ImageJ software developed at the National Institutes of Health .
About 20 μg of RNA was separated by electrophoresis in a 1.2% agarose gel in 1× FA buffer (20 mM morpholinepropanesulfonic acid, 5 mM sodium acetate, 1 mM EDTA, pH 7) supplemented with 1.8% formaldehyde. After electrophoresis, the RNAs were transferred onto GeneScreen nylon membranes (Perkin-Elmer Life Sciences, Courtaboeuf, France), as previously described . DNA probes were amplified by PCR from the genomic DNA of strain E150. PCR products were purified by electrophoresis in a 1% low-melting point agarose gel. DNA probes were labeled with [α-32P]dCTP, with the Amersham Megaprime™ DNA labeling kit (GE Healthcare Life Sciences, Orsay, France), and hybridizations were performed in Denhardt's solution-containing buffer at 65°C . Final washes were performed at 65°C, in 0.2 × SSC (1 × SSC is 0.15 M NaCl plus 0.015 M sodium citrate)-0.1% sodium dodecyl sulfate.
cDNA library construction and sequencing
Total RNA was extracted from cells grown in three different sets of culture conditions (expo, stat and oleic; see above). We isolated mRNA from the total RNA preparation with the Oligotex mRNA kit (Qiagen) and the three libraries were constructed with the CloneMiner™ cDNA Library Construction Kit (Invitrogen, Cergy Pontoise, France), based on Gateway® technology. The resulting libraries were highly enriched in full-length, oriented clones. We sequenced 28,434 clones (9,409, 9,620 and 9,405 clones for the expo, stat and oleic libraries, respectively) by the Sanger method, first from the 5' end of the cloning cDNAs and then from the 3' end for 1,414 chosen clones. For 1,004 of these 1,414 selected clones, 5' and 3' sequences have been assembled, whereas for the remaining 410 clones, the 5' and 3' sequences were deposited individually in the EMBL database. The accession numbers for the resulting 28,844 sequences are [EMBL:FP671140-EMBL:FP680548], [EMBL:FP680607-EMBL:FP690338] and [EMBL:FP690350-EMBL:FP700052] for the expo, stat and oleic libraries, respectively.
The complete deletion of Y. lipolytica genes (YlUPF1, YALI0D23881g; YlUPF2, YALI0E24629g; YlXRN1, YALI0C23144g) was performed as previously described . Primers for the PCR amplification of promoter (P) and terminator (T) regions are listed in Additional file 9. The ML and/or MU cassettes  were introduced into the PT cassette. PO1d cells were transformed by the lithium acetate method , with about 400 ng of purified DNA from the disruption cassettes. Transformants were selected on YNB medium supplemented with NH4Cl (5 g/l), sodium potassium phosphate buffer, pH 6.8 (50 mM), agar (2%) and uracyl (100 mg/ml) or leucine (100 mg/ml). Gene deletion was checked by PCR, with primers external to the disruption cassette, upstream from P and downstream from T.
Auxotrophic mutants were complemented with the URA3 or LEU2 cassettes, for comparison of their growth rates with that of the wild-type strain, W29.
Genome sequence and sequence analysis
At the beginning of this study, the genome annotation of Y. lipolytica strain E150 included 6,703 CDSs (genes and pseudogenes ) and 742 introns (First annotation version 3 July 2004). The genomic sequence and the different versions of the genome annotation of Y. lipolytica strain E150, including the version updated with our data, are available from the Génolevures database .
The sequences of the cDNA clones were compared with sequences in a nucleotide sequence database of Y. lipolytica CDS using BLAST . Only the first hit was considered if the expected value was lower than 1.e-100 or between 1.e-50 and 1.e-100 with an identity score exceeding 95%.
3' splice site
5' splice site
nonsense-mediated mRNA decay
premature termination codon
We thank Stefan Kerscher for providing gene models experimentally validated for genes of the complex I, and Emmanuelle Beyne, Marek Elias and Dominique Swennen for gene model detection or modification. We also thank Stéphanie Kervestin, Olivier Jaillon and our colleagues from the Génolevures Consortium for helpful discussions, and Donald White of the ABIES doctoral school and Julie Sappa of Alex Edelman and Associates for their help in correcting the English version of the manuscript. This work was funded by the GDR CNRS 2354 'Génolevures-3' and the ANR 'Genarise' (ANR-05-BLAN-0331) programs.
- Dujon B, Sherman D, Fischer G, Durrens P, Casaregola S, Lafontaine I, De Montigny J, Marck C, Neuvéglise C, Talla E, Goffard N, Frangeul L, Aigle M, Anthouard V, Babour A, Barbe V, Barnay S, Blanchin S, Beckerich JM, Beyne E, Bleykasten C, Boisramé A, Boyer J, Cattolico L, Confanioleri F, De Daruvar A, Despons L, Fabre E, Fairhead C, Ferry-Dumazet H, et al: Genome evolution in yeasts. Nature. 2004, 430: 35-44. 10.1038/nature02579.PubMedView ArticleGoogle Scholar
- Souciet JL, Dujon B, Gaillardin C, Johnston M, Baret PV, Cliften P, Sherman DJ, Weissenbach J, Westhof E, Wincker P, Jubin C, Poulain J, Barbe V, Ségurens B, Artiguenave F, Anthouard V, Vacherie B, Val ME, Fulton RS, Minx P, Wilson R, Durrens P, Jean G, Marck C, Martin T, Nikolski M, Rolland T, Seret ML, Casarégola S, Despons L, et al: Comparative genomics of protoploid Saccharomycetaceae. Genome Res. 2009, 19: 1696-1709. 10.1101/gr.091546.109.PubMedPubMed CentralView ArticleGoogle Scholar
- Dietrich FS, Voegeli S, Brachat S, Lerch A, Gates K, Steiner S, Mohr C, Pöhlmann R, Luedi P, Choi S, Wing RA, Flavier A, Gaffney TD, Philippsen P: The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome. Science. 2004, 304: 304-307. 10.1126/science.1095781.PubMedView ArticleGoogle Scholar
- Vernis L, Poljak L, Chasles M, Uchida K, Casarégola S, Käs E, Matsuoka M, Gaillardin C, Fournier P: Only centromeres can supply the partition system required for ARS function in the yeast Yarrowia lipolytica. J Mol Biol. 2001, 305: 203-217. 10.1006/jmbi.2000.4300.PubMedView ArticleGoogle Scholar
- Marck C, Kachouri-Lafond R, Lafontaine I, Westhof E, Dujon B, Grosjean H: The RNA polymerase III-dependent family of genes in hemiascomycetes: comparative RNomics, decoding strategies, transcription and evolutionary implications. Nucleic Acids Res. 2006, 34: 1816-1835. 10.1093/nar/gkl085.PubMedPubMed CentralView ArticleGoogle Scholar
- Bergeron J, Drouin G: The evolution of 5S ribosomal RNA genes linked to the rDNA units of fungal species. Curr Genet. 2008, 54: 123-131. 10.1007/s00294-008-0201-2.PubMedView ArticleGoogle Scholar
- Acker J, Ozanne C, Kachouri-Lafond R, Gaillardin C, Neuvéglise C, Marck C: Dicistronic tRNA-5S rRNA genes in Yarrowia lipolytica: an alternative TFIIIA-independent way for expression of 5S rRNA genes. Nucleic Acids Res. 2008, 36: 5832-5844. 10.1093/nar/gkn549.PubMedPubMed CentralView ArticleGoogle Scholar
- Fournier P, Gaillardin C, Persuy MA, Klootwijk J, van Heerikhuizen H: Heterogeneity in the ribosomal family of the yeast Yarrowia lipolytica: genomic organization and segregation studies. Gene. 1986, 42: 273-282. 10.1016/0378-1119(86)90231-3.PubMedView ArticleGoogle Scholar
- De Schutter K, Lin Y, Tiels P, Van Hecke A, Glinka S, Weber-Lehmann J, Rouzé P, Van de Peer Y, Callewaert N: Genome sequence of the recombinant protein production host Pichia pastoris. Nat Biotechnol. 2009, 27: 561-566. 10.1038/nbt.1544.PubMedView ArticleGoogle Scholar
- Casaregola S, Neuvéglise C, Bon E, Gaillardin C: Ylli, a non-LTR retrotransposon L1 family in the dimorphic yeast Yarrowia lipolytica. Mol Biol Evol. 2002, 19: 664-677.PubMedView ArticleGoogle Scholar
- Kovalchuk A, Senam S, Mauersberger S, Barth G: Tyl6, a novel Ty3/gypsy-like retrotransposon in the genome of the dimorphic fungus Yarrowia lipolytica. Yeast. 2005, 22: 979-991. 10.1002/yea.1287.PubMedView ArticleGoogle Scholar
- Neuvéglise C, Feldmann H, Bon E, Gaillardin C, Casaregola S: Genomic evolution of the long terminal repeat retrotransposons in hemiascomycetous yeasts. Genome Res. 2002, 12: 930-943. 10.1101/gr.219202.PubMedPubMed CentralView ArticleGoogle Scholar
- Neuvéglise C, Chalvet F, Wincker P, Gaillardin C, Casaregola S: Mutator-like element in the yeast Yarrowia lipolytica displays multiple alternative splicings. Eukaryot Cell. 2005, 4: 615-624. 10.1128/EC.4.3.615-624.2005.PubMedPubMed CentralView ArticleGoogle Scholar
- Bon E, Casaregola S, Blandin G, Llorente B, Neuvéglise C, Munsterkotter M, Guldener U, Mewes HW, Van Helden J, Dujon B, Gaillardin C: Molecular evolution of eukaryotic genomes: hemiascomycetous yeast spliceosomal introns. Nucleic Acids Res. 2003, 31: 1121-1135. 10.1093/nar/gkg213.PubMedPubMed CentralView ArticleGoogle Scholar
- Juneau K, Palm C, Miranda M, Davis RW: High-density yeast-tiling array reveals previously undiscovered introns and extensive regulation of meiotic splicing. Proc Natl Acad Sci USA. 2007, 104: 1522-1527. 10.1073/pnas.0610354104.PubMedPubMed CentralView ArticleGoogle Scholar
- Mitrovich QM, Tuch BB, Guthrie C, Johnson AD: Computational and experimental approaches double the number of known introns in the pathogenic yeast Candida albicans. Genome Res. 2007, 17: 492-502. 10.1101/gr.6111907.PubMedPubMed CentralView ArticleGoogle Scholar
- Jeffares DC, Mourier T, Penny D: The biology of intron gain and loss. Trends Genet. 2006, 22: 16-22. 10.1016/j.tig.2005.10.006.PubMedView ArticleGoogle Scholar
- Vanácová S, Yan W, Carlton JM, Johnson PJ: Spliceosomal introns in the deep-branching eukaryote Trichomonas vaginalis. Proc Natl Acad Sci USA. 2005, 102: 4430-4435. 10.1073/pnas.0407500102.PubMedPubMed CentralView ArticleGoogle Scholar
- Collins JE, Wright CL, Edwards CA, Davis MP, Grinham JA, Cole CG, Goward ME, Aguado B, Mallya M, Mokrab Y, Huckle EJ, Beare DM, Dunham I: A genome annotation-driven approach to cloning the human ORFeome. Genome Biol. 2004, 5: R84-10.1186/gb-2004-5-10-r84.PubMedPubMed CentralView ArticleGoogle Scholar
- Stajich JE, Dietrich FS, Roy SW: Comparative genomic analysis of fungal genomes reveals intron-rich ancestors. Genome Biol. 2007, 8: R223-10.1186/gb-2007-8-10-r223.PubMedPubMed CentralView ArticleGoogle Scholar
- Johnson JM, Castle J, Garrett-Engele P, Kan Z, Loerch PM, Armour CD, Santos R, Schadt EE, Stoughton R, Shoemaker DD: Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays. Science. 2003, 302: 2141-2144. 10.1126/science.1090100.PubMedView ArticleGoogle Scholar
- Kim E, Magen A, Ast G: Different levels of alternative splicing among eukaryotes. Nucleic Acids Res. 2007, 35: 125-131. 10.1093/nar/gkl924.PubMedPubMed CentralView ArticleGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, et al: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.PubMedView ArticleGoogle Scholar
- Srebrow A, Kornblihtt AR: The connection between splicing and cancer. J Cell Sci. 2006, 119: 2635-2641. 10.1242/jcs.03053.PubMedView ArticleGoogle Scholar
- Venables JP: Unbalanced alternative splicing and its significance in cancer. Bioessays. 2006, 28: 378-386. 10.1002/bies.20390.PubMedView ArticleGoogle Scholar
- Habara Y, Urushiyama S, Tani T, Ohshima Y: The fission yeast prp10(+) gene involved in pre-mRNA splicing encodes a homologue of highly conserved splicing factor, SAP155. Nucleic Acids Res. 1998, 26: 5662-5669. 10.1093/nar/26.24.5662.PubMedPubMed CentralView ArticleGoogle Scholar
- Preker PJ, Kim KS, Guthrie C: Expression of the essential mRNA export factor Yra1p is autoregulated by a splicing-dependent mechanism. RNA. 2002, 8: 969-980. 10.1017/S1355838202020046.PubMedPubMed CentralView ArticleGoogle Scholar
- Juneau K, Nislow C, Davis RW: Alternative splicing of PTC7 in Saccharomyces cerevisiae determines protein localization. Genetics. 2009, 183: 185-194. 10.1534/genetics.109.105155.PubMedPubMed CentralView ArticleGoogle Scholar
- Davis CA, Grate L, Spingola M, Ares MJ: Test of intron predictions reveals novel splice sites, alternatively spliced mRNAs and new introns in meiotically regulated genes of yeast. Nucleic Acids Res. 2000, 28: 1700-1706. 10.1093/nar/28.8.1700.PubMedPubMed CentralView ArticleGoogle Scholar
- Rodríguez-Navarro S, Igual JC, Pérez-Ortín JE: SRC1: an intron-containing yeast gene involved in sister chromatid segregation. Yeast. 2002, 19: 43-54. 10.1002/yea.803.PubMedView ArticleGoogle Scholar
- Miura F, Kawaguchi N, Sese J, Toyoda A, Hattori M, Morishita S, Ito T: A large-scale full-length cDNA analysis to explore the budding yeast transcriptome. Proc Natl Acad Sci USA. 2006, 103: 17846-17851. 10.1073/pnas.0605645103.PubMedPubMed CentralView ArticleGoogle Scholar
- David L, Huber W, Granovskaia M, Toedling J, Palm CJ, Bofkin L, Jones T, Davis RW, Steinmetz LM: A high-resolution map of transcription in the yeast genome. Proc Natl Acad Sci USA. 2006, 103: 5320-5325. 10.1073/pnas.0601091103.PubMedPubMed CentralView ArticleGoogle Scholar
- Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M: The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008, 320: 1344-1349. 10.1126/science.1158441.PubMedPubMed CentralView ArticleGoogle Scholar
- Wilhelm BT, Marguerat S, Watt S, Schubert F, Wood V, Goodhead I, Penkett CJ, Rogers J, Bähler J: Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. Nature. 2008, 453: 1239-1243. 10.1038/nature07002.PubMedView ArticleGoogle Scholar
- Roy SW, Irimia M: Intron mis-splicing: no alternative?. Genome Biol. 2008, 9: 208-10.1186/gb-2008-9-2-208.PubMedPubMed CentralView ArticleGoogle Scholar
- Sayani S, Janis M, Lee CY, Toesca I, Chanfreau GF: Widespread impact of nonsense-mediated mRNA decay on the yeast intronome. Mol Cell. 2008, 31: 360-370. 10.1016/j.molcel.2008.07.005.PubMedPubMed CentralView ArticleGoogle Scholar
- Pleiss JA, Whitworth GB, Bergkessel M, Guthrie C: Rapid, transcript-specific changes in splicing in response to environmental stress. Mol Cell. 2007, 27: 928-937. 10.1016/j.molcel.2007.07.018.PubMedPubMed CentralView ArticleGoogle Scholar
- Engebrecht JA, Voelkel-Meiman K, Roeder GS: Meiosis-specific RNA splicing in yeast. Cell. 1991, 66: 1257-1268. 10.1016/0092-8674(91)90047-3.PubMedView ArticleGoogle Scholar
- Nakagawa T, Ogawa H: The Saccharomyces cerevisiae MER3 gene, encoding a novel helicase-like protein, is required for crossover control in meiosis. EMBO J. 1999, 18: 5714-5723. 10.1093/emboj/18.20.5714.PubMedPubMed CentralView ArticleGoogle Scholar
- Vilardell J, Chartrand P, Singer RH, Warner JR: The odyssey of a regulated transcript. RNA. 2000, 6: 1773-1780. 10.1017/S135583820000145X.PubMedPubMed CentralView ArticleGoogle Scholar
- Preker PJ, Guthrie C: Autoregulation of the mRNA export factor Yra1p requires inefficient splicing of its pre-mRNA. RNA. 2006, 12: 994-1006. 10.1261/rna.6706.PubMedPubMed CentralView ArticleGoogle Scholar
- Dong S, Li C, Zenklusen D, Singer RH, Jacobson A, He F: YRA1 autoregulation requires nuclear export and cytoplasmic Edc3p-mediated degradation of its pre-mRNA. Mol Cell. 2007, 25: 559-573. 10.1016/j.molcel.2007.01.012.PubMedPubMed CentralView ArticleGoogle Scholar
- Behm-Ansmant I, Kashima I, Rehwinkel J, Saulière J, Wittkopp N, Izaurralde E: mRNA quality control: an ancient machinery recognizes and degrades mRNAs with nonsense codons. FEBS Lett. 2007, 581: 2845-2853. 10.1016/j.febslet.2007.05.027.PubMedView ArticleGoogle Scholar
- Rehwinkel J, Raes J, Izaurralde E: Nonsense-mediated mRNA decay: Target genes and functional diversification of effectors. Trends Biochem Sci. 2006, 31: 639-646. 10.1016/j.tibs.2006.09.005.PubMedView ArticleGoogle Scholar
- Stalder L, Mühlemann O: The meaning of nonsense. Trends Cell Biol. 2008, 18: 315-321. 10.1016/j.tcb.2008.04.005.PubMedView ArticleGoogle Scholar
- Kertész S, Kerényi Z, Mérai Z, Bartos I, Pálfy T, Barta E, Silhavy D: Both introns and long 3'-UTRs operate as cis-acting elements to trigger nonsense-mediated decay in plants. Nucleic Acids Res. 2006, 34: 6147-6157. 10.1093/nar/gkl737.PubMedPubMed CentralView ArticleGoogle Scholar
- Kerényi Z, Mérai Z, Hiripi L, Benkovics A, Gyula P, Lacomme C, Barta E, Nagy F, Silhavy D: Inter-kingdom conservation of mechanism of nonsense-mediated mRNA decay. EMBO J. 2008, 27: 1585-1595. 10.1038/emboj.2008.88.PubMedPubMed CentralView ArticleGoogle Scholar
- Amrani N, Ganesan R, Kervestin S, Mangus DA, Ghosh S, Jacobson A: A faux 3'-UTR promotes aberrant termination and triggers nonsense-mediated mRNA decay. Nature. 2004, 432: 112-118. 10.1038/nature03060.PubMedView ArticleGoogle Scholar
- Mühlemann O, Eberle AB, Stalder L, Zamudio Orozco R: Recognition and elimination of nonsense mRNA. Biochim Biophys Acta. 2008, 1779: 538-549.PubMedView ArticleGoogle Scholar
- Jaillon O, Bouhouche K, Gout JF, Aury JM, Noel B, Saudemont B, Nowacki M, Serrano V, Porcel BM, Ségurens B, Le Mouël A, Lepère G, Schächter V, Bétermier M, Cohen J, Wincker P, Sperling L, Duret L, Meyer E: Translational control of intron splicing in eukaryotes. Nature. 2008, 451: 359-362. 10.1038/nature06495.PubMedView ArticleGoogle Scholar
- Desfougères T, Haddouche R, Fudalej F, Neuvéglise C, Nicaud J: SOA genes encode proteins controlling lipase expression in response to triacylglycerol utilization in the yeast Yarrowia lipolytica. FEMS Yeast Res. 2009, 10: 93-103. 10.1111/j.1567-1364.2009.00590.x.PubMedView ArticleGoogle Scholar
- Génolevures Database. [http://www.genolevures.org/yali.html]
- Kupfer DM, Drabenstot SD, Buchanan KL, Lai H, Zhu H, Dyer DW, Roe BA, Murphy JW: Introns and splicing elements of five diverse fungi. Eukaryot Cell. 2004, 3: 1088-1100. 10.1128/EC.3.5.1088-1100.2004.PubMedPubMed CentralView ArticleGoogle Scholar
- Spingola M, Grate L, Haussler D, Ares MJ: Genome-wide bioinformatic and molecular analysis of introns in Saccharomyces cerevisiae. RNA. 1999, 5: 221-234. 10.1017/S1355838299981682.PubMedPubMed CentralView ArticleGoogle Scholar
- Lin K, Zhang D: The excess of 5' introns in eukaryotic genomes. Nucleic Acids Res. 2005, 33: 6522-6527. 10.1093/nar/gki970.PubMedPubMed CentralView ArticleGoogle Scholar
- Mourier T, Jeffares DC: Eukaryotic intron loss. Science. 2003, 300: 1393-10.1126/science.1080559.PubMedView ArticleGoogle Scholar
- Hong X, Scofield DG, Lynch M: Intron size, abundance, and distribution within untranslated regions of genes. Mol Biol Evol. 2006, 23: 2392-2404. 10.1093/molbev/msl111.PubMedView ArticleGoogle Scholar
- Schwartz SH, Silva J, Burstein D, Pupko T, Eyras E, Ast G: Large-scale comparative analysis of splicing signals and their corresponding splicing factors in eukaryotes. Genome Res. 2008, 18: 88-103. 10.1101/gr.6818908.PubMedPubMed CentralView ArticleGoogle Scholar
- Irimia M, Roy SW: Evolutionary convergence on highly-conserved 3' intron structures in intron-poor eukaryotes and insights into the ancestral eukaryotic genome. PLoS Genet. 2008, 4: e1000148-10.1371/journal.pgen.1000148.PubMedPubMed CentralView ArticleGoogle Scholar
- Irimia M, Penny D, Roy SW: Coevolution of genomic intron number and splice sites. Trends Genet. 2007, 23: 321-325. 10.1016/j.tig.2007.04.001.PubMedView ArticleGoogle Scholar
- Lopez PJ, Séraphin B: Genomic-scale quantitative analysis of yeast pre-mRNA splicing: implications for splice-site recognition. RNA. 1999, 5: 1135-1137. 10.1017/S135583829999091X.PubMedPubMed CentralView ArticleGoogle Scholar
- Génosplicing. [http://genome.jouy.inra.fr/genosplicing/index.html]
- Fedorov A, Suboch G, Bujakov M, Fedorova L: Analysis of nonuniformity in intron phase distribution. Nucleic Acids Res. 1992, 20: 2553-2557. 10.1093/nar/20.10.2553.PubMedPubMed CentralView ArticleGoogle Scholar
- Ruvinsky A, Eskesen ST, Eskesen FN, Hurst LD: Can codon usage bias explain intron phase distributions and exon symmetry?. J Mol Evol. 2005, 60: 99-104. 10.1007/s00239-004-0032-9.PubMedView ArticleGoogle Scholar
- Long M, de Souza SJ, Rosenberg C, Gilbert W: Relationship between "proto-splice sites" and intron phases: evidence from dicodon analysis. Proc Natl Acad Sci USA. 1998, 95: 219-223. 10.1073/pnas.95.1.219.PubMedPubMed CentralView ArticleGoogle Scholar
- Whamond GS, Thornton JM: An analysis of intron positions in relation to nucleotides, amino acids, and protein secondary structure. J Mol Biol. 2006, 359: 238-247. 10.1016/j.jmb.2006.03.029.PubMedView ArticleGoogle Scholar
- Long M, de Souza SJ, Gilbert W: The yeast splice site revisited: new exon consensus from genomic analysis. Cell. 1997, 91: 739-740. 10.1016/S0092-8674(00)80462-6.PubMedView ArticleGoogle Scholar
- Leeds P, Wood JM, Lee BS, Culbertson MR: Gene products that promote mRNA turnover in Saccharomyces cerevisiae. Mol Cell Biol. 1992, 12: 2165-2177.PubMedPubMed CentralView ArticleGoogle Scholar
- Cui Y, Hagan KW, Zhang S, Peltz SW: Identification and characterization of genes that are required for the accelerated degradation of mRNAs containing a premature translational termination codon. Genes Dev. 1995, 9: 423-436. 10.1101/gad.9.4.423.PubMedView ArticleGoogle Scholar
- Howe KJ, Kane CM, Ares MJ: Perturbation of transcription elongation influences the fidelity of internal exon inclusion in Saccharomyces cerevisiae. RNA. 2003, 9: 993-1006. 10.1261/rna.5390803.PubMedPubMed CentralView ArticleGoogle Scholar
- Romfo CM, Alvarez CJ, van Heeckeren WJ, Webb CJ, Wise JA: Evidence for splice site pairing via intron definition in Schizosaccharomyces pombe. Mol Cell Biol. 2000, 20: 7955-7970. 10.1128/MCB.20.21.7955-7970.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- McGuire AM, Pearson MD, Neafsey DE, Galagan JE: Cross-kingdom patterns of alternative splicing and splice recognition. Genome Biol. 2008, 9: R50-10.1186/gb-2008-9-3-r50.PubMedPubMed CentralView ArticleGoogle Scholar
- Loftus BJ, Fung E, Roncaglia P, Rowley D, Amedeo P, Bruno D, Vamathevan J, Miranda M, Anderson IJ, Fraser JA, Allen JE, Bosdet IE, Brent MR, Chiu R, Doering TL, Donlin MJ, D'Souza CA, Fox DS, Grinberg V, Fu J, Fukushima M, Haas BJ, Huang JC, Janbon G, Jones SJ, Koo HL, Krzywinski MI, Kwon-Chung JK, Lengeler KB, Maiti R, et al: The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science. 2005, 307: 1321-1324. 10.1126/science.1103773.PubMedPubMed CentralView ArticleGoogle Scholar
- Ebbole DJ, Jin Y, Thon M, Pan H, Bhattarai E, Thomas T, Dean R: Gene discovery and gene expression in the rice blast fungus, Magnaporthe grisea: analysis of expressed sequence tags. Mol Plant Microbe Interact. 2004, 17: 1337-1347. 10.1094/MPMI.2004.17.12.1337.PubMedView ArticleGoogle Scholar
- Galagan JE, Henn MR, Ma L, Cuomo CA, Birren B: Genomics of the fungal kingdom: insights into eukaryotic biology. Genome Res. 2005, 15: 1620-1631. 10.1101/gr.3767105.PubMedView ArticleGoogle Scholar
- Goguel V, Rosbash M: Splice site choice and splicing efficiency are positively influenced by pre-mRNA intramolecular base pairing in yeast. Cell. 1993, 72: 893-901. 10.1016/0092-8674(93)90578-E.PubMedView ArticleGoogle Scholar
- Rogic S, Montpetit B, Hoos HH, Mackworth AK, Ouellette BF, Hieter P: Correlation between the secondary structure of pre-mRNA introns and the efficiency of splicing in Saccharomyces cerevisiae. BMC Genomics. 2008, 9: 355-10.1186/1471-2164-9-355.PubMedPubMed CentralView ArticleGoogle Scholar
- Görnemann J, Kotovic KM, Hujer K, Neugebauer KM: Cotranscriptional spliceosome assembly occurs in a stepwise fashion and requires the cap binding complex. Mol Cell. 2005, 19: 53-63. 10.1016/j.molcel.2005.05.007.PubMedView ArticleGoogle Scholar
- Kornblihtt AR, de la Mata M, Fededa JP, Munoz MJ, Nogues G: Multiple links between transcription and splicing. RNA. 2004, 10: 1489-1498. 10.1261/rna.7100104.PubMedPubMed CentralView ArticleGoogle Scholar
- Kornblihtt AR: Promoter usage and alternative splicing. Curr Opin Cell Biol. 2005, 17: 262-268. 10.1016/j.ceb.2005.04.014.PubMedView ArticleGoogle Scholar
- ImageJ. [http://rsbweb.nih.gov/ij/index.html]
- Zimmermann M, Fournier P: Electrophoretic karyotyping of yeasts. Nonconventional Yeasts in Biotechnology. 1996, Berlin, Germany: Springer-Verlag, 101-116. Wolf KView ArticleGoogle Scholar
- Sambrook J, Fritsch EF, Maniatis T: Molecular Cloning: a Laboratory Manual. 1989, Cold Spring Harbor NY: Cold Spring Harbor Laboratory Press, 2Google Scholar
- Fickers P, Le Dall MT, Gaillardin C, Thonart P, Nicaud JM: New disruption cassettes for rapid gene disruption and marker rescue in the yeast Yarrowia lipolytica. J Microbiol Methods. 2003, 55: 727-737. 10.1016/j.mimet.2003.07.003.PubMedView ArticleGoogle Scholar
- Le Dall MT, Nicaud JM, Gaillardin C: Multiple-copy integration in the yeast Yarrowia lipolytica. Curr Genet. 1994, 26: 38-44. 10.1007/BF00326302.PubMedView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.PubMedView ArticleGoogle Scholar
- Crooks GE, Hon G, Chandonia J, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14: 1188-1190. 10.1101/gr.849004.PubMedPubMed CentralView ArticleGoogle Scholar
- Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18: 6097-6100. 10.1093/nar/18.20.6097.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.