Insights into snoRNA biogenesis and processing from PAR-CLIP of snoRNA core proteins and small RNA sequencing
- Shivendra Kishore†1,
- Andreas R Gruber†1,
- Dominik J Jedlinski1,
- Afzal P Syed1,
- Hadi Jorjani1 and
- Mihaela Zavolan1Email author
© Kishore et al.; licensee BioMed Central Ltd. 2012
Received: 18 March 2013
Accepted: 26 May 2013
Published: 26 May 2013
In recent years, a variety of small RNAs derived from other RNAs with well-known functions such as tRNAs and snoRNAs, have been identified. The functional relevance of these RNAs is largely unknown. To gain insight into the complexity of snoRNA processing and the functional relevance of snoRNA-derived small RNAs, we sequence long and short RNAs, small RNAs that co-precipitate with the Argonaute 2 protein and RNA fragments obtained in photoreactive nucleotide-enhanced crosslinking and immunoprecipitation (PAR-CLIP) of core snoRNA-associated proteins.
Analysis of these data sets reveals that many loci in the human genome reproducibly give rise to C/D box-like snoRNAs, whose expression and evolutionary conservation are typically less pronounced relative to the snoRNAs that are currently cataloged. We further find that virtually all C/D box snoRNAs are specifically processed inside the regions of terminal complementarity, retaining in the mature form only 4-5 nucleotides upstream of the C box and 2-5 nucleotides downstream of the D box. Sequencing of the total and Argonaute 2-associated populations of small RNAs reveals that despite their cellular abundance, C/D box-derived small RNAs are not efficiently incorporated into the Ago2 protein.
We conclude that the human genome encodes a large number of snoRNAs that are processed along the canonical pathway and expressed at relatively low levels. Generation of snoRNA-derived processing products with alternative, particularly miRNA-like, functions appears to be uncommon.
Small nucleolar RNAs (snoRNAs) are a specific class of small non-protein coding RNAs that are best known for their function as guides of modifications (2'-O-methylation and pseudouridylation) of other non-protein coding RNAs such as ribosomal, small nuclear and transfer RNAs (rRNAs, snRNAs and tRNAs, respectively) [1–3]. Based on sequence and structural features, snoRNAs are divided into two classes. C/D box snoRNAs share the consensus C (RUGAUGA, R = A or G) and D (CUGA) box motifs, which are brought into close proximity by short regions of complementarity between the snoRNA 5' and 3' ends [4, 5] and are bound by the four core proteins of the small ribonucleoprotein complex (snoRNP), namely 15.5K, NOP56, NOP58 and Fibrillarin (FBL) [6–8] during snoRNA maturation. Fibrillarin is the methyltransferase that catalyzes the 2'-O-methylation of the ribose in target RNAs . Most C/D box snoRNAs also contain additional conserved C' and D' motifs located in the central region of the snoRNA. The other class of snoRNAs is defined by a double-hairpin structure with two single-stranded H (ANANNA, N = A, C, G or U) and ACA box domains , and are therefore called H/ACA box snoRNAs. They associate with four conserved proteins, Dyskerin (DKC1), Nhp2, Nop10 and Gar1, to form snoRNPs that are functionally active in pseudouridylation. Although all four H/ACA proteins are necessary for efficient pseudouridylation , it is Dyskerin that provides the pseudouridine synthase activity . While H/ACA and C/D box snoRNAs accumulate in the nucleolus, some snoRNAs reside in the nucleoplasmic Cajal bodies (CBs) where they guide modifications of snRNAs  and are called small Cajal body-specific RNAs (scaRNAs). In addition to the typical H/ACA snoRNA features, vertebrate H/ACA box scaRNAs carry a CB localization signal called CAB box (UGAG) in the loop of their 5' and/or 3' hairpins .
Immediately upstream of the D box and/or the D' box, C/D box snoRNAs contain 10 to 21 nucleotide-long antisense elements that are complementary to sites in their target RNAs [13–15]. The nucleotide in the target RNA that is complementary to the fifth nucleotide upstream from the D/D' box of the snoRNA is targeted for 2'-O-methylation by the snoRNP [14, 15]. H/ACA box snoRNAs contain two antisense elements termed pseudouridylation pockets, located in the 5' and 3' hairpin domains of the snoRNA [16, 17]. Substrate uridines are selected through base-pairing interactions between the pseudouridylation pocket and target RNA sequences that flank the targeted uridine.
Deep-sequencing studies revealed a surprising diversity of small RNAs derived from non-coding RNAs (ncRNAs) known as small derived RNAs (sdRNAs) with well-established functions such as tRNAs [18, 19], Y RNAs , vault RNAs , ribosomal RNAs , spliceosomal RNAs  and snoRNAs [24–26]. In fact, the profile of sequenced reads observed for some of these small RNA species are very characteristic and have even been used for ncRNA gene finding based on sequencing data [27, 28]. The majority of C/D box and H/ACA snoRNAs seems to be extensively processed, producing stable small RNAs from the termini of the mature snoRNA  and the processing pattern is conserved across cell types . Thus, it appears that snoRNAs are versatile molecules that give rise to snoRNA-derived miRNAs [24, 31], other small RNAs [25, 29] or longer processing fragments .
To gain insight into the complexity of snoRNA processing and the functional relevance of the derived sdRNAs, we undertook a comprehensive characterization of products generated from snoRNA loci, combining high-throughput sequencing of long and short RNA fragments with photoactivatable-ribonucleoside-enhanced cross-linking and immunoprecipitation (PAR-CLIP) of core snoRNA-associated proteins and with data from Argonaute 2 (Ago2) immunoprecipitation sequencing (IP-seq) experiments. We found that many loci in the human genome can give rise to C/D box-like snoRNAs. Among the novel snoRNAs that we identified are very short sequences, extending little beyond the C and D boxes, which are essential for the binding of core snoRNA proteins. Compared to the snoRNAs that are already known, the novel snoRNA candidates exhibit a lower level of evolutionary conservation and a lower expression level. These findings indicate that the C/D box snoRNA structure evolves relatively easily and that C/D box snoRNA-like molecules are produced from many more genomic loci than are currently annotated. We further found that C/D box snoRNAs are very specifically processed inside the regions of terminal complementarity, retaining in the mature form only four to five nucleotides upstream of the C box and two to five nucleotides downstream of the D box. Sequencing of the small RNA population as well as of the small RNAs isolated after Ago2 immunoprecipitation revealed that despite their cellular abundance, C/D box-derived small RNAs are not efficiently incorporated into the Ago2 protein. Our extensive data thus indicate that, contrary to previous suggestions [25, 33], snoRNA-derived small RNAs that carry out non-canonical, particularly miRNA-like, functions are rare.
PAR-CLIP of C/D box and H/ACA box snoRNP core proteins identifies their RNA binding partners
Summary of CLIPZ mapping statistics and annotation categories for PAR-CLIP samples.
NOP58 rep A
NOP58 rep B
Ago2 rep A
HuR rep A
T→C mutations among all observed mutations
Annotation summary of the top 200 clusters inferred from PAR-CLIP experiments with snoRNA core proteins.
C/D box snoRNAs
H/ACA box snoRNAs
NOP58 rep A
NOP58 rep B
Ago2 rep A
HuR rep A
Binding patterns of core proteins on snoRNAs
We further asked whether the binding pattern of Fibrillarin reflected in the abundance of CLIP reads differs between guide regions of the snoRNAs that have a target annotated in snoRNA-LBME-db and orphan guide regions. For guide regions, we took the nine nucleotides upstream of the D and D' boxes and as a reference we compared the coverage of the D and D' boxes themselves (Figure 1F). We found that guide regions with a known target and their associated D/D' boxes generally have a higher coverage compared to those that are orphan (70% compared to 40% positive z-scores of the average coverage per position in the guide region relative to the entire snoRNA, Figure 1G). This could indicate that the binding to the target renders the snoRNA-core protein complex more accessible to cross-linking.
For H/ACA box snoRNAs we found that Dyskerin strongly prefers the H box nucleotides (Figure 1G), which in 70% of the snoRNAs have a positive z-score for coverage compared to the entire snoRNA. This is expected because these snoRNAs are highly structured, with most nucleotides being engaged in base pairs in the two hairpin stems and a few nucleotides are free to interact with the proteins.
Identification of novel snoRNA genes from PAR-CLIP and small RNA sequencing
We screened the top 500 clusters from each PAR-CLIP library that did not overlap with known ncRNAs, mRNAs or repeat elements for potentially novel snoRNA genes. To identify H/ACA box genes we employed the SnoReport program , while for C/D box snoRNA detection we applied a custom approach searching for a C box motif (RUGAUGA, R = A or G; allowing one mismatch) at the 5' end and a D box motif (MUGA, M = A or C) at the 3' end, requiring that a terminal stem of at least four canonical base pairs can be formed by the nucleotides flanking the C and D boxes. We combined these computational screens with isolation and sequencing of the 20 to 200 nucleotide RNA fraction from HEK293 cells, which provides evidence for expression of the predicted snoRNAs. Requiring a minimal average coverage per nucleotide of at least 1 tag per million (TPM) in least one type-specific CLIP library as well as in the small RNA-seq library, we identified 77 and 20 putative C/D and H/ACA box snoRNAs, respectively (Additional files 2 and 3). We additionally screened 14 distinct small RNA sequence libraries from the recently released ENCODE data  and found that more than 75% of our putative C/D box snoRNAs were detected in at least one cell type other than HEK293 (see Additional file 4). We further tested the expression of the 20 most abundantly sequenced candidate snoRNAs by Northern blotting (see Additional file 5). Nine of the twenty candidates were also detectable in this assay, while an additional nine C/D box snoRNAs are supported by the ENCODE data (see Additional file 4).
To determine whether the candidates we identified as described are entirely novel snoRNA genes or so far undescribed homologs of known snoRNAs, we performed a BLAST search against the snoRNA genes from snoRNA-LBME-db (requiring an E-value ≤ 10-3). We further compared the loci of the putative snoRNAs with the snoRNA annotation available in ENSEMBL release 65 , which is based on automatic annotation with sequence/structure models available in the Rfam database . Out of the 20 H/ACA box snoRNA candidates, 18 show sequence or structural homology to known snoRNAs, while candidates ZL4 (annotated as nc053 in , but not classified as a snoRNA by the authors) and ZL36 appear to be novel H/ACA box snoRNAs without a known homolog. The homology search additionally revealed that ZL4 is conserved until Xenopus tropicalis.
Of the 77 C/D box snoRNAs, only seven showed sequence homology to known C/D box snoRNA genes, but in one case (ZL1) the homology consisted solely of a long GU-rich region. The evolutionary conservation of the guide regions of five of these snoRNAs (ZL11, ZL109, ZL126, ZL127 and ZL132) suggests that they target the same nucleotides on ribosomal RNA as their homologs. A sixth snoRNA, ZL142, appears to be a human homolog of the GGN68 snoRNA of chickens [43, 44]. An additional comparison with the results of another large snoRNA analysis , revealed that ZL2 and ZL107 have been previously described as SNORD41B and Z39, respectively. In order to further characterize the 69 potentially novel C/D box snoRNAs (including ZL1, which only had homology with a known snoRNA in a GU-rich region), we first asked whether their C and D boxes are evolutionarily conserved (Additional file 1). To this end, we computed their average position-wise phastCons scores , which we obtained from the UCSC genome browser. Five candidates including ZL1 showed an average phastCons score per nucleotide higher than 0.25 for C and D box nucleotides. A comprehensive homology search of vertebrate genomes allowed us to trace the evolutionary origin of these snoRNAs and to annotate C' and D' boxes as well as putative guide regions based on sequence conservation. ZL1 is highly conserved in vertebrates including Petromyzon marinus, while for ZL5, ZL6, ZL8 and ZL24 we were not able to retrieve any homologs outside of mammals.
Our screen could further identify a snoRNA with mixed C/D box and H/ACA box structure. SCARNA21, a computationally predicted H/ACA box snoRNA , is surrounded by conserved C and D box elements enclosed by a terminal stem structure (Additional file 7). Northern blot analysis revealed that the prevalent form in the cells is the one that contains the C/D box elements and not the short form, which would be the single H/ACA box snoRNA.
Target prediction for newly identified snoRNA genes
We were especially interested to find out whether the non-conserved C/D box-like snoRNAs and in particular the mini-snoRNAs, could guide 2'-O-methylations. To this end, we took a simple approach searching for 8-mer Watson-Crick complementarity between the putative guide regions upstream of the D boxes to ribosomal and spliceosomal RNAs. We did indeed identify seven putative interaction sites, but none of these are known modification sites (Additional file 2). Thus, the targets of these C/D box-like snoRNAs remain to be identified.
Non-canonical RNA partners of core snoRNA proteins
Although snoRNAs are best known for guiding modifications of rRNAs, snRNAs and tRNAs [1–3], some evidence has emerged for the involvement of full-length mature snoRNAs also in other biological processes such as alternative splicing . To investigate this possibility, we searched our PAR-CLIP data sets for RNAs that were abundantly cross-linked, yet not known to associate with the core snoRNA proteins. In contrast to the HuR PAR-CLIP that we performed before , the PAR-CLIP experiments conducted with C/D box snoRNP core proteins repeatedly identified several non-coding RNAs including vault RNA 1-2, 7SK RNA and 7SL RNA as well as H/ACA box snoRNAs. Similarly, in the Dyskerin PAR-CLIP we observed cross-linking of several C/D box snoRNAs.
We performed primer extension experiments to determine potential sites for 2'-O-methyl and pseudouridine modification in prominent ncRNAs such as 7SK RNA, 7SL RNA and vault RNA 1-2 (see Additional file 9 for primer extension assays and Additional file 10 for a catalog of identified modifications sites and target predictions). Indeed, we found that all three of these RNA species carry modifications. Vault RNA 1-2 contains four 2'-O-methyl sites, 7SK RNA carries at least six 2'-O-methyl sites and one pseudouridylation site, and 7SL RNA contains several sites of pseudouridylation. Additionally, we sought to determine whether C/D box and H/ACA box snoRNAs guide modifications on each other. We thus performed 2'-O-methylation primer extension assays on SNORA61 and pseudouridylation assays on SNORD16 and SNORD35A. We found that SNORA61 potentially carries one 2'-O-methylation, while SNORD16 and SNORD35A carry two and six pseudouridylated residues, respectively. To identify C/D box snoRNAs that could guide the observed 2'-O-methylations, we searched for 8-mer complementarity upstream of D and D' boxes of C/D box and C/D box-like snoRNAs, but we did not find sequences complementary to the modification sites. To predict guiding H/ACA box snoRNAs we employed the program RNAsnoop using stringent filtering criteria. We identified potential guiding H/ACA box snoRNAs for 7SK RNA residue Ψ250 and 7SL RNA residue Ψ226.
Previous studies reported that snoRNAs may function in alternative splicing [32, 51] and we also repeatedly observed cross-linking of C/D box core proteins to regions that are annotated as exons of protein coding genes. To determine whether these mRNA regions are targeted by snoRNAs, we selected, from the top 1,000 clusters located in mRNA exons in NOP58 libraries, the 157 that were present in both NOP58 replicates and a third CLIP library with at least 10 TPM per nucleotide (Additional file 11). We identified complementarities to the 8-mer guide regions of snoRNAs in 79 of these clusters. In contrast, in shuffled CLIPed regions we only found 60 complementarities to snoRNA guide regions (average of 100 simulations on shuffled sequences). Thus, the mRNA sequences that we isolated in the CLIP experiments are consistent with the possibility that snoRNAs act as guides in some steps of mRNA processing.
snoRNA processing patterns
Functional annotation of sequencing reads obtained in sRNA sequencing and HeLa Ago2 IP sequencing.
HEK293 sRNA sequencing (18 to 30 nucleotides)
HeLa Ago2 immunoprecipitation sequencing (asynchronous cells)
HeLa Ago2 immunoprecipitation sequencing (mitotic cells)
C/D box snoRNAs
H/ACA box snoRNAs
Small RNAs derived from C/D box snoRNA termini appear to be abundant in the cells, and can be incorporated into Argonaute proteins to act as miRNAs . To determine the relative participation of various small RNA classes in the Argonaute-dependent gene silencing, we immunopurified Ago2 from HeLa cells and sequenced the associated small RNA fraction. We found that, as expected, miRNAs constitute the most abundant RNA class that associates with Ago2 (approximately 90%), while C/D box snoRNAs account only for 0.005% of the IP-seq reads (Table 3). Assuming that overall proportions of small RNAs derived from tRNAs and snoRNAs are fairly constant across cell types, we can estimate the efficiency with which small RNAs (from the total small RNA pool) are incorporated in the Argonaute proteins. We found, for example, that although small RNAs derived from tRNAs are 5.6 times more abundant than C/D box derived snoRNAs, tRNA fragments are 40 times more abundant in the Ago2-associated fraction. Thus, tRNA-derived small RNAs appear to be more efficiently incorporated in Ago2 than C/D box snoRNA fragments. This is consistent with observations that tRNAs are cleaved by nucleases such as Angiogenin and even Dicer to generate processing fragments that are active in translation regulation [54, 55]. Similarly, small RNAs derived from H/ACA box snoRNAs are 5.5 times less abundant than small RNAs derived from C/D box snoRNAs in the total RNA fraction, but are 5.2 times more efficiently picked up by Ago2. The H/ACA box snoRNA SCARNA15, which has been shown to be processed into smaller fragments that act as microRNAs , is represented in this library with 3,636 reads, 29% of all reads mapped to H/ACA box snoRNA loci (see Additional file 12 for a full listing of all snoRNAs). The C/D box snoRNA with the highest number of reads in the Ago2 IP library is SNORD1A with 1,140 reads, but the majority of C/D box snoRNAs are represented by less than 50 reads.
Of all categories of small RNAs, C/D box snoRNA fragments are those that show the strongest nuclear retention, and are found in the cytoplasm with only low frequency . Thus, this physical separation could account for the low frequency of association between C/D box snoRNA-derived RNAs and Ago2. We therefore wondered whether the association of this abundant class of RNA fragments with Ago2 increases in the mitotic phase of the cell cycle, when the nuclear membrane is dissolved. We collected HeLa cells that were in the mitotic phase through mitotic shake off, immunopurified Ago2 and again sequenced the Ago2-associated small RNA fraction. We found that, indeed, the relative abundance of C/D box-derived fragments in Argonaute increased in this condition (Table 3), to 0.054% relative to 0.005%. Nonetheless, these results indicate that C/D box snoRNAs do not generally carry out miRNA-like functions, and that the number of H/ACA box snoRNAs with a dual function is very limited.
To gain insight into the processing of snoRNAs and the functions of snoRNA-derived small RNAs, we performed PAR-CLIP experiments with snoRNP core proteins. Analysis of PAR-CLIP reads showed that C/D box core proteins Fibrillarin, NOP56 and NOP58 have a very similar binding pattern, overlapping with the box elements. Excluding snoRNA families SNORD113 to SNORD116, which are multi-copy families and do not have guide complementarity to rRNAs or snRNAs, snoRNA-LBME-db currently lists 153 C/D box snoRNAs, of which 40 and 78 have a guide region targeting a known modification at the D box and D' box, respectively. Evolutionary conservation profiles of the remaining putative guide regions suggest that most of them are not functional. In support of this concept, our analysis revealed that C/D box core proteins cross-linked more effectively to guide regions that are known to have a target compared to orphan guide regions.
Combining computational prediction with data from small RNA sequencing and PAR-CLIP we identified novel C/D and H/ACA box snoRNAs, and assigned guiding snoRNAs to several modifications on rRNAs and snRNAs that were previously described as orphans. In addition to these bona fide snoRNAs, we uncovered a group of C/D box-like snoRNAs that only have a C and a D box as opposed to the common C-D'-C'-D architecture. These C/D box-like snoRNAs are only weakly conserved and most of them are expressed at low levels. The unusual architecture and the weak evolutionary conservation are likely reasons why these RNA species have not been uncovered by computational ncRNA gene finders . Some of the identified C/D box-like snoRNAs are extremely short, one being only 27 nucleotides in length, leaving hardly enough space for a guide region. The requirements for C/D box snoRNA biogenesis appear to be simply the presence of C and D boxes and a short region of complementarity flanking these boxes, leading probably to the production of many snoRNA-like molecules as the C/D box core proteins scan intronic regions of pre-mRNAs. An interesting lead to follow in further investigating the potential function of the C/D box-like snoRNAs originating in the introns of many genes comes from a recent study conducted in Drosophila, in which Schubert and colleagues showed that snoRNAs are required for maintenance of higher-order structures of chromatin accessibility .
In our PAR-CLIP experiments we also repeatedly cross-linked ncRNAs that are not usual snoRNA targets. We observed H/ACA box snoRNAs in PAR-CLIP experiments targeting the C/D box core proteins. Vice versa, we found C/D box snoRNAs in the PAR-CLIP targeting Dyskerin, which is an essential component of H/ACA box snoRNPs. Primer extension assays indicated that these snoRNAs carry modifications that would be expected from the protein complexes to which they were cross-linked, but we were, in general, not able to identify snoRNAs that could guide these modifications. One drawback may be that in the case of the 2'-O-methyl primer extension assays we cannot be sure that it was indeed a 2'-O-methyl modification as opposed to any other nucleoside modification that caused the stoppage of the reverse transcriptase. However, we can be fairly certain that we identified bona fide pseudouridylation sites. Particularly, in the case of SNORD35A we were able to identify five putative pseudouridylated residues but no convincing guiding sequence in a known H/ACA box snoRNA. This suggests either that even more snoRNAs remain to be identified or that these pseudouridylations are caused by a protein-only mechanism not requiring guidance by H/ACA box snoRNAs.
The processing patterns of snoRNAs have raised substantial interest and some controversy in recent years [30, 32, 59]. We strikingly found that snoRNA excision out of the intron follows a well-defined pattern leaving mature snoRNAs with four to five nucleotides upstream of the C box, and two to five nucleotides downstream of the D box, irrespective of the length of the terminal closing stem. Our data support the observations of Darzacq and Kiss  that the terminal stem serves to bring the C and D box elements into close proximity so as to be more easily recognized by snoRNP proteins, which then protect the snoRNA from further trimming by the exosome, but may not be needed for the functional, mature snoRNA. This implies that the core proteins actively protect and stabilize the maturing snoRNA.
We further quantified the abundance of snoRNA-derived small RNAs in HEK293 cells, and consistent with other studies , we found that small RNAs derived from the ends of C/D box snoRNAs are indeed abundant. However, we did not find evidence that these sdRNAs efficiently associate with Ago2 to act as microRNAs, even in conditions when the accessibility of these sdRNAs to Ago2 should be higher, such as in mitotic cells. We thus conclude that a microRNA-like function of snoRNA-derived small RNAs is an exception rather than a rule. Most of the sdRNAs from C/D box snoRNAs originate from the termini of mature snoRNAs, and hence carry C and D box motifs. It might be that snoRNA core proteins are still attached to these fragments, protect them from total degradation, sequester them in the nucleus and prevent these sdRNAs from being loaded into Ago2.
Deep-sequencing-based studies revealed a very complex landscape of transcription and processing of RNAs. The non-canonical products identified initially in such studies raises the question of additional, yet unknown, functions of molecules that have been studied for many years. What has become apparent more recently, however, is that deep sequencing allows us to construct a very detailed picture of the kinetics of processing various classes of RNAs and of their interactions with proteins that protect them from degradation. Intersection of many data sets such as those generated in our study will eventually reveal kinetic and regulatory aspects of cellular processes at a fine level of detail.
Materials and methods
PAR-CLIP was performed with HEK293 Flp-In cells (Invitrogen). Cells were grown in thirty 15-cm cell culture plates per experiment to approximately 80% confluency. At 12 h before harvest, 4-thiouridine (Sigma) was added to the cells to a final concentration of 100 µM. PAR-CLIP was carried out as described previously . For immunoprecipitation, antibodies were coupled to protein-A or protein-G Dynabeads (Invitrogen). Antibodies used against endogenous proteins were α-NOP58 (sc-23705 from Santa Cruz Biotechnology), α-Dyskerin H-300 (sc-48794, Santa Cruz Biotechnology), α-Dyskerin C-15 (sc-26982, Santa Cruz Biotechnology) and α-Fibrillarin AFB01 monoclonal antibody line 72B9, lot 011 (from Cytoskeleton, Inc, AFB01). The α-Ago2 (11A9) monoclonal antibody was a gift from Gunter Meister. For PAR-CLIP with NOP56 we used a HEK293 cell line with a stably integrated FLAG-NOP56 fusion gene and IP was done with monoclonal α-FLAG antibody M2 from Sigma. For one Fibrillarin targeted PAR-CLIP the immunoprecipitated complexes were treated with micrococcal nuclease (MNase, from New England Biolabs) for 5 min at 37°C . After SDS-PAGE, gels were blotted onto nitrocellulose membranes to reduce the background from free RNAs . The PAR-CLIP libraries were prepared as described in Additional file 13 and submitted to deep sequencing on an Illumina HiSeq 2000.
The reads obtained from PAR-CLIP experiments were mapped to the human genome (hg19 assembly from UCSC, February 2009) and annotated with the CLIPZ server . Reads marked with the CLIPZ annotation categories 'fungal', 'bacterial,' or 'vector' were discarded and only reads that mapped uniquely to the genome were used in the analyses. The library size was scaled to 1,000,000 for all samples to obtain a normalized expression value (tags per million).
Small RNA sequencing
Small RNA sequencing libraries were prepared from size-selected RNAs of 18 to 30 nucleotides (sdRNA sequencing) and 20 to 200 nucleotides (sRNA sequencing). HEK293 total RNA was extracted and treated with DNase. Next, 20 units of T4 polynucleotide kinase and 2 µl of [γ-32P] ATP (10 µCi/µl) were used to radiolabel 10 µg of RNA at the 5'-ends. The RNA was separated together with a radiolabeled 20-nucleotide ladder on a 12% polyacrylamide gel, the bands corresponding to 18 to 30 nucleotides (for sdRNA sequencing libraries) or 20 to 200 nucleotides (for sRNA sequencing libraries) were excised, the RNA was extracted overnight in a 0.4-M NaCl solution and finally precipitated with ethanol. Small RNA libraries were prepared according to a published protocol  and sequenced on an Illumina HiSeq 2000 instrument, for 36 (sdRNA sequencing) and 150 cycles (sRNA sequenicng library). Adaptor removal was done with the CLIPZ server, and the mapping to the human genome was then done with the Segemehl software (v. 0.1.3) with parameters '-D 1 -A 90' . The Gene Expression Omnibus (GEO) accession number for the PAR-CLIP and sRNA-seq data is GSE43666.
Identification of novel C/D snoRNAs and H/ACA snoRNAs from PAR-CLIP and small RNA sequencing data
For each PAR-CLIP library we inferred binding regions of the proteins of interest by clustering reads whose corresponding loci were at most 25 nucleotides apart. To annotate known snoRNA and scaRNA genes we first retrieved sequences from the snoRNA-LBME-db , mapped them to the human genome (a list of motif and secondary structure annotated snoRNAs is available in Additional file 13). The 500 binding regions that accumulated the highest number of reads in each individual CLIP library, but did not overlap with known snoRNA or scaRNA genes, ncRNA genes or repeat elements, were screened for novel snoRNA candidates. We used SnoReport  to detect H/ACA box snoRNAs, while for detection of C/D box snoRNAs we searched for protein-binding regions that contained motifs corresponding to the C box (RTGATGA; allowing one mismatch) and to the two most common D box motifs (CTGA and ATGA). Sequences that contained both a C box and a D box motif were extended by ten nucleotides in order to search for a terminal closing stem. If a compact closing stem composed of at least four canonical base pairs with at least two G-C/C-G base pairs was found, the sequence was considered a snoRNA candidate. To evaluate the specificity of our C/D box snoRNA gene finding approach, we applied the same procedure to two types of clusters of PAR-CLIP reads from the NOP58 rep A sample both extended by 25 nucleotides on each side. First were the top 100 clusters (defined in terms of the number of reads associated with the cluster) that overlapped with C/D box snoRNA annotation, which served as a positive control. In this set, our program reported 80 sequences as putative snoRNAs. The second type of cluster contained the top 100 clusters that overlap with mRNA exon annotation. These should not contain snoRNAs, and indeed, we only obtained five putative C/D box snoRNAs candidates. Similarly low numbers of snoRNA candidates were obtained from randomized sequences (not shown). Altogether, these tests indicated that our method has very good specificity. In contrast, the number of predictions we obtained from CLIPed clusters without a known annotation was 11 for the top 100 such clusters.
Candidates that showed expression of at least 1 TPM per nucleotide in the 20 to 200 nucleotides small RNA sequencing run (only uniquely mapped reads that covered at least 50% of the candidate snoRNA sequence were considered), and had at least 1 TPM per nucleotide in at least one of the type-specific CLIP libraries were considered putative snoRNAs. They were consecutively numbered, and named as 'ZL#'. To further validate the newly found snoRNAs, we searched for evidence of expression in recently published small RNA-seq libraries from the ENCODE project . Files with the genome coordinates of mapped reads (BAM files) were obtained from the ENCODE data coordination center at UCSC  and uniquely mapping reads were used for the analysis. In addition, we selected the 20 candidate C/D box snoRNAs with the highest read count in our data for validation by Northern blotting (see Additional file 13 for details on the experiment). To evaluate the evolutionary conservation of the putative snoRNAs, we carried out a homology search against the vertebrate genomes available in the UCSC genome browser. Once an initial set of homologs was identified, we built sequence/structure models and continued to search for more distant homologs with the Infernal software .
Detection of 2'-O-ribose-methylated and pseudouridylated residues
To identify 2'-O-methylated residues we used a reverse transcriptase-based method coupled with polyacrylamide gel analysis as described in . The method is based on the observation that cDNA synthesis is noticeably impaired in the presence of a 2'-O-methyl when deoxynucleotide triphosphate fragments (dNTPs) are limiting [65, 66], giving rise to a characteristic pattern of gel banding immediately preceding the 2'-O-methyls, with strong bands at low dNTP concentrations (0.004 mM) , becoming weaker with increasing concentrations of dNTPs.
To map pseudouridines in candidate RNAs we used a method that relies on chemical modification of RNA bases with N-cyclohexyl-N'-β (4-methyl morpholinium) -ethylcarbodiimide (CMC) . The method involves carbodiimide adduct formation with U, G and pseudouridine followed by mild alkali treatment, which removes the adduct from U and G but not from the N-3 of pseudouridine. This modification results in the blockage of reverse transcription one residue 3' of the pseudouridine on the sequencing gel. For a detailed description of assays used to map 2'-O-methyls and pseudouridines see Additional file 13. As a proof of principle, we first applied these assays to the spliceosomal RNA U6, which is known to carry 2'-O-methylated and pseudouridylidated residues. In addition to the well-documented sites, we also observed novel 2'-O-methyl sites that have not been previously reported so far (Additional file 14).
To predict C/D box snoRNAs that could guide 2'-O-methylation, we searched for 8-mer complementarity (only canonical base pairs allowed) to regions immediately or one nucleotide upstream of the D and D' boxes of C/D box and C/D box-like snoRNAs. To predict H/ACA box snoRNAs that could guide pseudouridylations, we used the program RNAsnoop . We first determined for each H/ACA snoRNA stem an energy cutoff value by running simulations on 1,000 random sequences of length 100. Only if an RNAsnoop prediction had an energy value lower than 90% of the random sequences, and at least three canonical base pairs on each side of the binding pocket, did we consider it as a hit.
Ago2 immunoprecipitation sequencing of asynchronous and mitotic cells
Mitotic cells were collected using mitotic shake-off [68, 69], a technique based on the observation that cells become rounded and more easily detachable from the culture vessel as they progress into metaphase during mitosis . Details of the experimental setup are given in Additional file 13. To be able to confirm microscopically that we collected mitotic cells we used HeLa cells with the human histone H2B gene fused to green fluorescent protein (see Additional file 15).
Ago2 was immunoprecipitated from mitotic and asynchronous cells; the Ago2-associated RNAs were extracted and used to prepare cDNA libraries as described above , which were then submitted to deep sequencing. Adaptor removal was with the CLIPZ server, and reads were then mapped with Segemehl as described above. In the analysis of small RNA libraries (Ago2-IP and HEK293 sdRNA sequencing (18 to 30 nucleotides)), we considered both uniquely and multi-mapping reads that were annotated based on their mapping to genes in one of the following categories: tRNAs (from the UCSC Table Browser), microRNAs (from mirBase) and snRNAs (from ENSEMBL release 59), C/D box snoRNAs and H/ACA box snoRNAs (curated data set from this work).
cross-linking and immunoprecipitation
photoactivatable-ribonucleoside-enhanced cross-linking and immunoprecipitation
small Cajal body-specific RNA
small derived RNA
small nucleolar RNA
small nucleolar ribonucleoprotein
small nuclear RNA
tags per million
The authors thank Erich Nigg (Biozentrum) for providing the HeLa cells in which the histone H2B gene is fused to green fluorescent protein and for advice on the isolation of mitotic cells. We are grateful to Gunter Meister for the anti-Ago2 antibody and Ina Nissen and Christian Beisel from the Quantitative Genomics Facility of the D-BSSE of ETH Zurich, for help with deep sequencing. All computations were carried out on the [BC]2 HPC infrastructure at the University of Basel. Work in the Zavolan lab is supported by the University of Basel and the Swiss National Science Foundation (grant #31003A 127307). SK acknowledges the support of the Gebert Rüf Foundation Rare Diseases Program (grant GRS-046/11).
- Decatur W, Fournier M: rRNA modifications and ribosome function. Trends Biochem Sci. 2002, 27: 344-51. 10.1016/S0968-0004(02)02109-6.PubMedView ArticleGoogle Scholar
- Darzacq X, Jády B, Verheggen C, Kiss A, Bertrand E, Kiss T: Cajal body-specific small nuclear RNAs: a novel class of 2'-O-methylation and pseudouridylation guide RNAs. EMBO J. 2002, 21: 2746-56. 10.1093/emboj/21.11.2746.PubMedPubMed CentralView ArticleGoogle Scholar
- Clouet d'Orval B, Bortolin ML, Gaspin C, Bachellerie JP: Box C/D RNA guides for the ribose methylation of archaeal tRNAs. The tRNATrp intron guides the formation of two ribose-methylated nucleosides in the mature tRNATrp. Nucleic Acids Res. 2001, 29: 4518-29. 10.1093/nar/29.22.4518.PubMedView ArticleGoogle Scholar
- Tollervey D, Kiss T: Function and synthesis of small nucleolar RNAs. Curr Opin Cell Biol. 1997, 9: 337-42. 10.1016/S0955-0674(97)80005-1.PubMedView ArticleGoogle Scholar
- Darzacq X, Kiss T: Processing of intron-encoded box C/D small nucleolar RNAs lacking a 5',3'-terminal stem structure. Mol Cell Biol. 2000, 20: 4522-31. 10.1128/MCB.20.13.4522-4531.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Brown JW, Echeverria M, Qu LH: Plant snoRNAs: functional evolution and new modes of gene expression. Trends Plant Sci. 2003, 8: 42-9.PubMedView ArticleGoogle Scholar
- Kiss T: Small nucleolar RNA-guided post-transcriptional modification of cellular RNAs. EMBO J. 2001, 20: 3617-22. 10.1093/emboj/20.14.3617.PubMedPubMed CentralView ArticleGoogle Scholar
- McKeegan KS, Debieux CM, Boulon S, Bertrand E, Watkins NJ: A dynamic scaffold of pre-snoRNP factors facilitates human box C/D snoRNP assembly. Mol Cell Biol. 2007, 27: 6782-93. 10.1128/MCB.01097-07.PubMedPubMed CentralView ArticleGoogle Scholar
- Tollervey D, Lehtonen H, Jansen R, Kern H, Hurt EC: Temperature-sensitive mutations demonstrate roles for yeast fibrillarin in pre-rRNA processing, pre-rRNA methylation, and ribosome assembly. Cell. 1993, 72: 443-57. 10.1016/0092-8674(93)90120-F.PubMedView ArticleGoogle Scholar
- Kiss T, Fayet-Lebaron E, Jády BE: Box H/ACA small ribonucleoproteins. Mol Cell. 2010, 37: 597-606. 10.1016/j.molcel.2010.01.032.PubMedView ArticleGoogle Scholar
- Lafontaine DL, Bousquet-Antonelli C, Henry Y, Caizergues-Ferrer M, Tollervey D: The box H + ACA snoRNAs carry Cbf5p, the putative rRNA pseudouridine synthase. Genes Dev. 1998, 12: 527-37. 10.1101/gad.12.4.527.PubMedPubMed CentralView ArticleGoogle Scholar
- Richard P, Darzacq X, Bertrand E, Jády BE, Verheggen C, Kiss T: A common sequence motif determines the Cajal body-specific localization of box H/ACA scaRNAs. EMBO J. 2003, 22: 4283-93. 10.1093/emboj/cdg394.PubMedPubMed CentralView ArticleGoogle Scholar
- Nicoloso M, Qu LH, Michot B, Bachellerie JP: Intron-encoded, antisense small nucleolar RNAs: the characterization of nine novel species points to their direct role as guides for the 2'-O-ribose methylation of rRNAs. J Mol Biol. 1996, 260: 178-95. 10.1006/jmbi.1996.0391.PubMedView ArticleGoogle Scholar
- Kiss-László Z, Henry Y, Bachellerie JP, Caizergues-Ferrer M, Kiss T: Site-specific ribose methylation of preribosomal RNA: a novel function for small nucleolar RNAs. Cell. 1996, 85: 1077-88. 10.1016/S0092-8674(00)81308-2.PubMedView ArticleGoogle Scholar
- Cavaillé J, Nicoloso M, Bachellerie JP: Targeted ribose methylation of RNA in vivo directed by tailored antisense RNA guides. Nature. 1996, 383: 732-5. 10.1038/383732a0.PubMedView ArticleGoogle Scholar
- Ganot P, Bortolin ML, Kiss T: Site-specific pseudouridine formation in preribosomal RNA is guided by small nucleolar RNAs. Cell. 1997, 89: 799-809. 10.1016/S0092-8674(00)80263-9.PubMedView ArticleGoogle Scholar
- Bortolin ML, Ganot P, Kiss T: Elements essential for accumulation and function of small nucleolar RNAs directing site-specific pseudouridylation of ribosomal RNAs. EMBO J. 1999, 18: 457-69. 10.1093/emboj/18.2.457.PubMedPubMed CentralView ArticleGoogle Scholar
- Lee Y, Shibata Y, Malhotra A, Dutta A: A novel class of small RNAs: tRNA-derived RNA fragments (tRFs). Genes Dev. 2009, 23: 2639-49. 10.1101/gad.1837609.PubMedPubMed CentralView ArticleGoogle Scholar
- Haussecker D, Huang Y, Lau A, Parameswaran P, Fire A, Kay M: Human tRNA-derived small RNAs in the global regulation of RNA silencing. RNA. 2010, 16: 673-95. 10.1261/rna.2000810.PubMedPubMed CentralView ArticleGoogle Scholar
- Nicolas F, Hall A, Csorba T, Turnbull C, Dalmay T: Biogenesis of Y RNA-derived small RNAs is independent of the microRNA pathway. FEBS Lett. 2012, 586: 1226-30. 10.1016/j.febslet.2012.03.026.PubMedView ArticleGoogle Scholar
- Persson H, Kvist A, Vallon-Christersson J, Medstrand P, Borg A, Rovira C: The non-coding RNA of the multidrug resistance-linked vault particle encodes multiple regulatory small RNAs. Nat Cell Biol. 2009, 11: 1268-71. 10.1038/ncb1972.PubMedView ArticleGoogle Scholar
- Zywicki M, Bakowska-Zywicka K, Polacek N: Revealing stable processing products from ribosome-associated small RNAs by deep-sequencing data analysis. Nucleic Acids Res. 2012, 40: 4013-24. 10.1093/nar/gks020.PubMedPubMed CentralView ArticleGoogle Scholar
- Kawaji H, Nakamura M, Takahashi Y, Sandelin A, Katayama S, Fukuda S, Daub C, Kai C, Kawai J, Yasuda J, Carninci P, Hayashizaki Y: Hidden layers of human small RNAs. BMC Genomics. 2008, 9: 157-10.1186/1471-2164-9-157.PubMedPubMed CentralView ArticleGoogle Scholar
- Ender C, Krek A, Friedlander M, Beitzinger M, Weinmann L, Chen W, Pfeffer S, Rajewsky N, Meister G: A human snoRNA with microRNA-like functions. Mol Cell. 2008, 32: 519-28. 10.1016/j.molcel.2008.10.017.PubMedView ArticleGoogle Scholar
- Taft R, Glazov E, Lassmann T, Hayashizaki Y, Carninci P, Mattick J: Small RNAs derived from snoRNAs. RNA. 2009, 15: 1233-40. 10.1261/rna.1528909.PubMedPubMed CentralView ArticleGoogle Scholar
- Shen M, Eyras E, Wu J, Khanna A, Josiah S, Rederstorff M, Zhang M, Stamm S: Direct cloning of double-stranded RNAs from RNase protection analysis reveals processing patterns of C/D box snoRNAs and provides evidence for widespread antisense transcript expression. Nucleic Acids Res. 2011, 39: 9720-30. 10.1093/nar/gkr684.PubMedPubMed CentralView ArticleGoogle Scholar
- Jung C, Hansen M, Makunin I, Korbie D, Mattick J: Identification of novel non-coding RNAs using profiles of short sequence reads from next generation sequencing data. BMC Genomics. 2010, 11: 77-10.1186/1471-2164-11-77.PubMedPubMed CentralView ArticleGoogle Scholar
- Langenberger D, Pundhir S, Ekstrøm C, Stadler P, Hoffmann S, Gorodkin J: deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns. Bioinformatics. 2012, 28: 17-24. 10.1093/bioinformatics/btr598.PubMedPubMed CentralView ArticleGoogle Scholar
- Li Z, Ender C, Meister G, Moore P, Chang Y, John B: Extensive terminal and asymmetric processing of small RNAs from rRNAs, snoRNAs, snRNAs, and tRNAs. Nucleic Acids Res. 2012, 40: 6787-99. 10.1093/nar/gks307.PubMedPubMed CentralView ArticleGoogle Scholar
- Scott M, Ono M, Yamada K, Endo A, Barton G, Lamond A: Human box C/D snoRNA processing conservation across multiple cell types. Nucleic Acids Res. 2012, 40: 3676-88. 10.1093/nar/gkr1233.PubMedPubMed CentralView ArticleGoogle Scholar
- Li W, Saraiya A, Wang C: The profile of snoRNA-derived microRNAs that regulate expression of variant surface proteins in Giardia lamblia. Cell Microbiol. 2012, 14: 1455-73. 10.1111/j.1462-5822.2012.01811.x.PubMedPubMed CentralView ArticleGoogle Scholar
- Kishore S, Khanna A, Zhang Z, Hui J, Balwierz P, Stefan M, Beach C, Nicholls R, Zavolan M, Stamm S: The snoRNA MBII-52 (SNORD 115) is processed into smaller RNAs and regulates alternative splicing. Hum Mol Genet. 2010, 19: 1153-64. 10.1093/hmg/ddp585.PubMedPubMed CentralView ArticleGoogle Scholar
- Brameier M, Herwig A, Reinhardt R, Walter L, Gruber J: Human box C/D snoRNAs with miRNA like functions: expanding the range of regulatory RNAs. Nucleic Acids Res. 2011, 39: 675-86. 10.1093/nar/gkq776.PubMedPubMed CentralView ArticleGoogle Scholar
- Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano M, Jungkamp A, Munschauer M, Ulrich A, Wardle G, Dewell S, Zavolan M, Tuschl T: Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010, 141: 129-41. 10.1016/j.cell.2010.03.009.PubMedPubMed CentralView ArticleGoogle Scholar
- Kishore S, Jaskiewicz L, Burger L, Hausser J, Khorshid M, Zavolan M: A quantitative analysis of CLIP methods for identifying binding sites of RNA-binding proteins. Nat Methods. 2011, 8: 559-64. 10.1038/nmeth.1608.PubMedView ArticleGoogle Scholar
- Khorshid M, Rodak C, Zavolan M: CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins. Nucleic Acids Res. 2011, 39: D245-52. 10.1093/nar/gkq940.PubMedPubMed CentralView ArticleGoogle Scholar
- Lestrade L, Weber M: snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs. Nucleic Acids Res. 2006, 34: D158-62. 10.1093/nar/gkj002.PubMedPubMed CentralView ArticleGoogle Scholar
- Hertel J, Hofacker I, Stadler P: SnoReport: computational identification of snoRNAs with unknown targets. Bioinformatics. 2008, 24: 158-64. 10.1093/bioinformatics/btm464.PubMedView ArticleGoogle Scholar
- Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T, Mortazavi A, Tanzer A, Lagarde J, Lin W, Schlesinger F, Xue C, Marinov GK, Khatun J, Williams BA, Zaleski C, Rozowsky J, Roder M, Kokocinski F, Abdelhamid RF, Alioto T, Antoshechkin I, Baer MT, Bar NS, Batut P, Bell K, Bell I, Chakrabortty S, Chen X, Chrast J, Curado J, et al: Landscape of transcription in human cells. Nature. 2012, 489: 101-8. 10.1038/nature11233.PubMedPubMed CentralView ArticleGoogle Scholar
- ENSEMBL release 65. [http://www.ensembl.org]
- Burge S, Daub J, Eberhardt R, Tate J, Barquist L, Nawrocki E, Eddy S, Gardner P, Bateman A: Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 2013, 41: D226-32. 10.1093/nar/gks1005.PubMedPubMed CentralView ArticleGoogle Scholar
- Yan D, He D, He S, Chen X, Fan Z, Chen R: Identification and analysis of intermediate size noncoding RNAs in the human fetal brain. PLoS One. 2011, 6: e21652-10.1371/journal.pone.0021652.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang Y, Wang J, Huang S, Zhu X, Liu J, Yang N, Song D, Wu R, Deng W, Skogerbo G, Wang XJ, Chen R, Zhu D: Systematic identification and characterization of chicken (Gallus gallus) ncRNAs. Nucleic Acids Res. 2009, 37: 6562-74. 10.1093/nar/gkp704.PubMedPubMed CentralView ArticleGoogle Scholar
- Marz M, Gruber AR, Höner Zu Siederdissen C, Amman F, Badelt S, Bartschat S, Bernhart SH, Beyer W, Kehr S, Lorenz R, Tanzer A, Yusuf D, Tafer H, Hofacker IL, Stadler PF: Animal snoRNAs and scaRNAs with exceptional structures. RNA Biol. 2011, 8: 938-46. 10.4161/rna.8.6.16603.PubMedPubMed CentralView ArticleGoogle Scholar
- Yang J, Zhang X, Huang Z, Zhou H, Huang M, Zhang S, Chen Y, Qu L: snoSeeker: an advanced computational package for screening of guide and orphan snoRNA genes in the human genome. Nucleic Acids Res. 2006, 34: 5112-23. 10.1093/nar/gkl672.PubMedPubMed CentralView ArticleGoogle Scholar
- Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM, Wilson RK, Gibbs RA, Kent WJ, Miller W, Haussler D: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 2005, 15: 1034-50. 10.1101/gr.3715005.PubMedPubMed CentralView ArticleGoogle Scholar
- Schattner P, Barberan-Soler S, Lowe T: A computational screen for mammalian pseudouridylation guide H/ACA RNAs. RNA. 2006, 12: 15-25. 10.1261/rna.2210406.PubMedPubMed CentralView ArticleGoogle Scholar
- Tafer H, Kehr S, Hertel J, Hofacker IL, Stadler PF: RNAsnoop: efficient target prediction for H/ACA snoRNAs. Bioinformatics. 2010, 26: 610-16. 10.1093/bioinformatics/btp680.PubMedView ArticleGoogle Scholar
- Kehr S, Bartschat S, Stadler PF, Tafer H: PLEXY: efficient target prediction for box C/D snoRNAs. Bioinformatics. 2011, 27: 279-80. 10.1093/bioinformatics/btq642.PubMedView ArticleGoogle Scholar
- Karijolich J, Yu Y: Spliceosomal snRNA modifications and their function. RNA Biol. 2010, 7: 192-204. 10.4161/rna.7.2.11207.PubMedPubMed CentralView ArticleGoogle Scholar
- Kishore S, Stamm S: The snoRNA HBII-52 regulates alternative splicing of the serotonin receptor 2C. Science. 2006, 311: 230-2. 10.1126/science.1118265.PubMedView ArticleGoogle Scholar
- Berninger P, Jaskiewicz L, Khorshid M, Zavolan M: Conserved generation of short products at piRNA loci. BMC Genomics. 2011, 12: 46-10.1186/1471-2164-12-46.PubMedPubMed CentralView ArticleGoogle Scholar
- Valen E, Preker P, Andersen PR, Zhao X, Chen Y, Ender C, Dueck A, Meister G, Sandelin A, Jensen TH: Biogenic mechanisms and utilization of small RNAs derived from human protein-coding genes. Nat Struct Mol Biol. 2011, 18: 1075-82. 10.1038/nsmb.2091.PubMedView ArticleGoogle Scholar
- Cole C, Sobala A, Lu C, Thatcher SR, Bowman A, Brown JW, Green PJ, Barton GJ, Hutvagner G: Filtering of deep sequencing data reveals the existence of abundant Dicer-dependent small RNAs derived from tRNAs. RNA. 2009, 15: 2147-60. 10.1261/rna.1738409.PubMedPubMed CentralView ArticleGoogle Scholar
- Yamasaki S, Ivanov P, Hu GF, Anderson P: Angiogenin cleaves tRNA and promotes stress-induced translational repression. J Cell Biol. 2009, 185: 35-42. 10.1083/jcb.200811106.PubMedPubMed CentralView ArticleGoogle Scholar
- Liao J, Ma L, Guo Y, Zhang Y, Zhou H, Shao P, Chen Y, Qu L: Deep sequencing of human nuclear and cytoplasmic small RNAs reveals an unexpectedly complex subcellular distribution of miRNAs and tRNA 3' trailers. PLoS One. 2010, 5: e10563-10.1371/journal.pone.0010563.PubMedPubMed CentralView ArticleGoogle Scholar
- Bernhart SH, Hofacker IL: From consensus structure prediction to RNA gene finding. Brief Funct Genomic Proteomic. 2009, 8: 461-71. 10.1093/bfgp/elp043.PubMedView ArticleGoogle Scholar
- Schubert T, Pusch M, Diermeier S, Benes V, Kremmer E, Imhof A, Längst G: Df31 protein and snoRNAs maintain accessible higher-order structures of chromatin. Mol Cell. 2012, 48: 434-44. 10.1016/j.molcel.2012.08.021.PubMedView ArticleGoogle Scholar
- Scott MS, Ono M: From snoRNA to miRNA: dual function regulatory non-coding RNAs. Biochimie. 2011, 93: 1987-92. 10.1016/j.biochi.2011.05.026.PubMedPubMed CentralView ArticleGoogle Scholar
- Ule J, Ule A, Spencer J, Williams A, Hu J, Cline M, Wang H, Clark T, Fraser C, Ruggiu M, Zeeberg B, Kane D, Weinstein J, Blume J, Darnell R: Nova regulates brain-specific splicing to shape the synapse. Nat Genet. 2005, 37: 844-52. 10.1038/ng1610.PubMedView ArticleGoogle Scholar
- Hafner M, Landgraf P, Ludwig J, Rice A, Ojo T, Lin C, Holoch D, Lim C, Tuschl T: Identification of microRNAs and other small regulatory RNAs using cDNA library sequencing. Methods. 2008, 44: 3-12. 10.1016/j.ymeth.2007.09.009.PubMedPubMed CentralView ArticleGoogle Scholar
- Hoffmann S, Otto C, Kurtz S, Sharma C, Khaitovich P, Vogel J, Stadler P, Hackermüller J: Fast mapping of short sequences with mismatches, insertions and deletions using index structures. PLoS Comput Biol. 2009, 5: e1000502-10.1371/journal.pcbi.1000502.PubMedPubMed CentralView ArticleGoogle Scholar
- ENCODE data coordination center at UCSC. [http://genome.ucsc.edu/ENCODE/downloads.html]
- Nawrocki EP, Kolbe DL, Eddy SR: Infernal 1.0: inference of RNA alignments. Bioinformatics. 2009, 25: 1335-7. 10.1093/bioinformatics/btp157.PubMedPubMed CentralView ArticleGoogle Scholar
- Maden BE, Corbett ME, Heeney PA, Pugh K, Ajuh PM: Classical and novel approaches to the detection and localization of the numerous modified nucleotides in eukaryotic ribosomal RNA. Biochimie. 1995, 77: 22-9. 10.1016/0300-9084(96)88100-4.PubMedView ArticleGoogle Scholar
- Maden BE: Mapping 2'-O-methyl groups in ribosomal RNA. Methods. 2001, 25: 374-82. 10.1006/meth.2001.1250.PubMedView ArticleGoogle Scholar
- Ofengand J, Del Campo M, Kaya Y: Mapping pseudouridines in RNA molecules. Methods. 2001, 25: 365-73. 10.1006/meth.2001.1249.PubMedView ArticleGoogle Scholar
- Morla AO, Draetta G, Beach D, Wang JY: Reversible tyrosine phosphorylation of cdc2: dephosphorylation accompanies activation during entry into mitosis. Cell. 1989, 58: 193-203. 10.1016/0092-8674(89)90415-7.PubMedView ArticleGoogle Scholar
- Pines J, Hunter T: Isolation of a human cyclin cDNA: evidence for cyclin mRNA and protein regulation in the cell cycle and for interaction with p34cdc2. Cell. 1989, 58: 833-46. 10.1016/0092-8674(89)90936-7.PubMedView ArticleGoogle Scholar
- Elvin P, Evans CW: Cell adhesiveness and the cell cycle: correlation in synchronized Balb/c 3T3 cells. Biol Cell. 1983, 48: 1-9.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.