Skip to main content
Figure 4 | Genome Biology

Figure 4

From: Determining the quality and complexity of next-generation sequencing data without a reference genome

Figure 4

k -mer distances in whole exome sequencing data are associated with data quality and choice of capture protocol. (A) PCA of pairwise distance measures. Blue circles indicate samples with poor capture performance. The red circles highlight the WE10_F1L3_NIM sample, which suffers from multiple problems. Samples that passed the QC measures are indicated by different types of black circle based on the choice of capture kit (Nimblegen or Agilent SureSelect). (B) Hierarchical clustering of pairwise k-mer distance measures across WES samples. Different clusters are indicated by color. AGI, Agilent SureSelect; NIM, Nimblegen; PCA, principal component analysis; QC, quality control; WES, whole exome sequencing.

Back to article page