Skip to main content
Fig. 7 | Genome Biology

Fig. 7

From: Long non-coding RNAs display higher natural expression variation than protein-coding genes in healthy humans

Fig. 7

Increasing donor number identifies more lncRNA loci. a Example of a highly variable LCL lncRNA locus lcl1580 not in public annotations. GENCODE-v19 annotates lncRNA RP11-555G19.1 and protein coding gene AP003062.1 transcribed in antisense direction to lcl1580 (top). Normalized non-strand-specific PolyA+ RNA-seq signal for three donors is displayed (scaling from 0 to 0.6). RPKM of the *transcript isoform is shown for each sample. b Analysis overview. GEUVADIS project LCL RNA-seq data from 120 donors was used to create 30 data pools (each with 100 million reads from two female (red) and two male (blue) donors) and to assemble 30 transcriptomes (Methods). An increasing number of assemblies (corresponding to from 4 to up to 120 donors) was merged to serve as input into the de novo lncRNA and mRNA identification pipeline (Additional file 1: Figure S1A). This created a series of LCL de novo lncRNA and mRNA annotations from an increasing number of donors. c LCL de novo lncRNA (green) and mRNA (blue) loci number annotated using increased donor number. Left: Y-axis for lncRNA loci (green). Right: Y-axis for mRNA loci (blue). The range of values is set to 3,500 on both Y-axes. Maximum number of lncRNA / mRNA loci annotated (at 120 donors): 4,166 / 12,857. Error bars: standard deviation of loci number between three replicates of random picking for each number of assemblies used (Additional file 11C)

Back to article page