Skip to main content
Figure 2 | Genome Biology

Figure 2

From: Determining the quality and complexity of next-generation sequencing data without a reference genome

Figure 2

Evaluating data quality for mRNA sequencing samples across different laboratories. (A) Scatter plot showing for each sample the median pairwise Spearman correlation for exon quantification and the median k-mer distance measures (K distance) after scaling. Problematic samples are highlighted in different colors. (B) Histogram of median K distance (scaled) for each individual sample. (C) Distribution of median K distance (scaled) for each sequencing laboratory (indicated by different colors). (D) Scatter plot of median pairwise Spearman correlation between exon quantification and K distance (smoothed and scaled). (E) Histogram of median K distance (smoothed and scaled) for each individual sample. (F) Distribution of median K distance (smoothed and scaled) for each sequencing laboratory (indicated by different colors). (G) Scatter plot of the total number of reads per sample versus the K distance of 9-mers (scaled). The poly2 fitted line and the 95% confidence intervals are indicated. (H) Scatter plot of the total number of reads per sample versus the K distance of 12-mers (scaled). (I) Scatter plot of the total number of reads per sample versus the K distance of 12-mers (smoothed and scaled). Lab, laboratory; QC, quality control.

Back to article page