EST analyses predict the existence of a population of chimeric microRNA precursor-mRNA transcripts expressed in normal human and mouse tissues
© BioMed Central Ltd 2003
Published: 18 June 2003
A significant population of expressed sequence tags (ESTs) encodes chimeric transcripts containing microRNA (miRNA) precursor sequences as well as pieces of adjacent mRNAs in sense orientation. These chimeric transcripts may potentially be involved in miRNA biosynthesis, and/or affect expression of adjacent mammalian mRNAs.
Hundreds of microRNAs (miRNAs) have recently been discovered in species ranging from plants to humans. They are encoded by genes that express transcripts of single or clustered miRNA precursors of around 70 nucleotides in size, which form imperfect hairpin structures and are further processed to 17-23 nucleotide miRNAs by the action of Dicer . The miRNAs appear to have quite diverse roles: some induce translational arrest, whereas others induce RNA interference (RNAi). Although miRNAs are clearly important in the genome, the biology of miRNA precursors and their transcription is still not well understood. I examined 90 previously characterized miRNAs expressed in mouse and human [2, 3] for their homology to sequences present in the NCBI Entrez EST database, and found that many expressed sequence tags (ESTs) encoded chimeric miRNA precursor transcripts that also contained pieces of mRNAs.
Less than half the miRNAs examined (41 of 90) had sequences that exactly matched (in either orientation) one or more ESTs in the publicly accessible NIH database; 36 of these miRNAs matched ESTs in human, rat or mouse. This might reflect a lack of coverage of ESTs in this database, but more probably it reflects the fact that EST sequencing strategies favor long, stable, poly (A)+ transcripts which may not be general features of miRNA transcription and processing pathways .
ESTS encoding chimeric miRNA precursor mRNA transcripts
Number of ESTs with mRNA
Any with poly(A)+ tail?
Length of mRNA contained in nucleotides
Location of mRNA contained
Human normal amnion
NM_030938 vesicular membrane protein 1
Human pigmented retinal epithelium
AF070569* clone 24659
Mouse lactating mammary gland
XM_124678† mini chromosome maintenance deficient 7
Rat mixed tissues
NM_139104 estrogen-regulated protein
Human normal placenta
NM_147207 ischemia related factor vof-16
Human metastatic chondrosarcoma
XM_173924† hypothetical protein
It is important to emphasize that most of the ESTs were fundamentally different from the reference mRNAs whose sequences they shared; that is, most contained sequences external to the reference mRNA (with no known variants including them), and/or were spliced differently from the reference mRNA. It is known that mammalian miRNA precursors can be located within introns of both protein-coding and noncoding genes , so any EST that expresses the mRNA along with retained introns might erroneously appear to be 'chimeric'. Although four of the miRNA sequences described here do reside within introns, in at least three of these the ESTs described do not appear simply to represent mRNA sequences that contain retained introns (the other, miRNA 124a, cannot be assessed because it lacks corresponding genomic clones to identify intronic borders). In three cases (miRNA 21/104, 22 and 125b) the miRNA precursor sequences are located in intergenic regions, beyond the borders of the reference mRNAs. The miRNA precursor hairpin was generally not located at one end of the EST, but had flanking 5' and 3' sequences. For each miRNA listed in Table 1, at least one of the corresponding ESTs was derived from normal fetal or adult tissues, though some were also found expressed in cancer tissue. Most involve mRNAs that encode well-characterized protein products, though two correspond to hypothetical proteins.
The ESTs encoding miRNAs and mRNA pieces were apparently transcribed by RNA polymerase II, as many had poly(A)+ tails. RNA polymerase III is unlikely to be responsible for transcribing the chimeric transcripts, as the majority of the ESTs in Table 1 have internal stretches of four or more Ts in sense orientation that are thought to act as termination signals for the polymerase. However, as potential RNA polymerase III termination signals were encountered in only one miRNA precursor hairpin region, this polymerase may still be involved in transcribing primary miRNA precursor transcripts in other situations.
Examination of ESTs is fraught with potential problems, including cloning artifacts, uncertain orientation, and inclusion of unprocessed or aberrantly processed transcripts. However, chimeric miRNA-mRNA transcripts were detected for numerous miRNAs, with multiple EST isolates, and from several different tissues and different species, so they are likely to represent a regular phenomenon. Furthermore, many had poly (A)+ tails and were spliced, indicating that they can be extensively processed.
It is uncertain whether these chimeric transcripts are further processed to functionally active miRNAs. This is a possibility, as Zeng and Cullen reported that certain miRNA precursors could be processed effectively when they were expressed as RNA polymerase II transcripts containing flanking 5' and 3' sequences . Yet this would still not explain why so many of the matching ESTs also transcribed into the neighboring mRNA. Is this a clue to a distinctive function for chimeric transcripts, or does this reflect the nature of controls on their transcription and termination?
Another intriguing question is why, in each case, the miRNA sequence was oriented in the same direction as the mRNA, regardless of whether the miRNA was positioned upstream, within, or downstream of the mRNA sequence itself. Certainly, miRNAs are not in general forbidden from being located on the opposite strand of mRNAs. For example, in the course of the present study, miRNA 127 was found to be encoded by a precursor (EST BE294363) that lies on the opposite strand from the protein-coding region of a transposon-associated polyprotein mRNA that makes Gag protein and reverse transcriptase (XM_090919). A miRNA in this location would be in a position to inhibit transposon function by perfect antisense pairing to the polyprotein mRNA followed by RNAi. One conceivable function of chimeric transcripts that express pieces of mRNA in sense orientation may be to downregulate the endogenous mRNAs via sense co-suppression , as previously proposed for chimeric transcripts arising from the antisense promoter of L1 retrotransposons [8, 9].
Finally, it is not clear how the RNA polymerase II transcription of these ESTs is regulated. Is it governed by external sequences, perhaps related to the neighboring mRNA, or by internal sequences related to the miRNA precursor hairpins? Of the chimeric ESTs that could be lined up against genomic clones of the same species, I could not identify any obvious polymerase II promoter regions using the ProScan algorithm [10, 11] either within or up to 4 kb upstream of the EST sequences (the exception being ESTs matching miRNA 22, which resides upstream of a mRNA 5' UTR). As an alternative, one can conceive of the possibility that at least some miRNA precursor hairpins (that is, those associated with chimeric transcripts) may bind specific proteins that affect RNA polymerase II transcription. There is already evidence that proteins may recognize the loops of miRNA precursor hairpins to allow their transport into the cytoplasm . Hairpins may also arguably have some activity as internal promoters or enhancers, since Llave et al.  tested a single plant pre-miRNA construct lacking an exogenous promoter and found that it had detectable, albeit limited, expression when transfected into cells.
Ultimately, the purpose of bioinformatic analyses is to suggest new laboratory experiments. Identifying a population of chimeric ESTs within GenBank is merely the starting point for asking whether one can validate and characterize full-length endogenous chimeric transcripts made within cells. If so, then it will be possible to learn how these relate to other potential biosynthetic routes for miRNA production, how their transcription is regulated, and what functions (if any) they may have.
This work was supported by NIH grants LM07292, DA15450 and the Human Brain Project. I thank Vetle Torvik for assistance with batch BLAST searches.
- Ambros V: microRNAs: tiny regulators with great potential. Cell. 2001, 107: 823-826.PubMedView ArticleGoogle Scholar
- Lagos-Quintana M, Rauhut R, Yalcin A, Meyer J, Lendeckel W, Tuschl T: Identification of tissue-specific microRNAs from mouse. Curr Biol. 2002, 12: 735-739. 10.1016/S0960-9822(02)00809-6.PubMedView ArticleGoogle Scholar
- Mourelatos Z, Dostie J, Paushkin S, Sharma A, Charroux B, Abel L, Rappsilber J, Mann M, Dreyfuss G: miRNPs: a novel class of ribonucleoproteins containing numerous microRNAs. Genes Dev. 2002, 16: 720-728. 10.1101/gad.974702.PubMedPubMed CentralView ArticleGoogle Scholar
- Lee Y, Jeon K, Lee JT, Kim S, Kim VN: MicroRNA maturation: stepwise processing and subcellular localization. EMBO J. 2002, 21: 4663-4670. 10.1093/emboj/cdf476.PubMedPubMed CentralView ArticleGoogle Scholar
- Lagos-Quintana M, Rauhut R, Meyer J, Borkhardt A, Tuschl T: New microRNAs from mouse and human. RNA. 2003, 9: 175-179. 10.1261/rna.2146903.PubMedPubMed CentralView ArticleGoogle Scholar
- Zeng Y, Cullen B: Sequence requirements for micro RNA processing and function in human cells. RNA. 2003, 9: 112-123. 10.1261/rna.2780503.PubMedPubMed CentralView ArticleGoogle Scholar
- Zamore PD: Ancient pathways programmed by small RNAs. Science. 2002, 296: 1265-1269. 10.1126/science.1072457.PubMedView ArticleGoogle Scholar
- Nigumann P, Redik K, Matlik K, Speek M: Many human genes are transcribed from the antisense promoter of L1 retrotransposon. Genomics. 2002, 79: 628-634. 10.1006/geno.2002.6758.PubMedView ArticleGoogle Scholar
- Speek M: Antisense promoter of human L1 retrotransposon drives transcription of adjacent cellular genes. Mol Cell Biol. 2001, 21: 1973-1985. 10.1128/MCB.21.6.1973-1985.2001.PubMedPubMed CentralView ArticleGoogle Scholar
- Prestridge DS: Predicting Pol II promoter sequences using transcription factor binding sites. J Mol Biol. 1995, 249: 923-932. 10.1006/jmbi.1995.0349.PubMedView ArticleGoogle Scholar
- Web Promoter Scan Service. [http://bimas.cit.nih.gov/molbio/proscan/index.html]
- Kawasaki H, Taira K: Short hairpin type of dsRNAs that are controlled by tRNA(Val) promoter significantly induce RNAi-mediated gene silencing in the cytoplasm of human cells. Nucleic Acids Res. 2003, 31: 700-707. 10.1093/nar/gkg158.PubMedPubMed CentralView ArticleGoogle Scholar
- Llave C, Xie Z, Kasschau KD, Carrington JC: Cleavage of Scarecrow-like mRNA targets directed by a class of Arabidopsis miRNA. Science. 2002, 297: 2053-2056. 10.1126/science.1076311.PubMedView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410. 10.1006/jmbi.1990.9999.PubMedView ArticleGoogle Scholar
- NCBI BLAST Server. [http://www.ncbi.nlm.nih.gov/BLAST]
- Mathews DH, Sabina J, Zuker M, Turner DH: Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J Mol Biol. 1999, 288: 911-940. 10.1006/jmbi.1999.2700.PubMedView ArticleGoogle Scholar
- mfold. [http://www.bioinfo.rpi.edu/applications/mfold/old/rna]