Skip to main content
Fig. 2 | Genome Biology

Fig. 2

From: Terminating contamination: large-scale search identifies more than 2,000,000 contaminated entries in GenBank

Fig. 2

Results of contamination within the RefSeq and GenBank. a Distribution of contaminated species in RefSeq across five kingdoms: Bacteria and Archaea (violet), Fungi (yellow), Metazoa (red), Viridiplantae (green) and other Eukaryotes (turquoise). b Sankey plot of the top 13 contaminated species in RefSeq. We show the taxonomic ranks domain, kingdom, phylum, and species. Numbers shown above each taxonomic node indicate the total number of contaminated sequences. The tree uses the same color code for kingdoms as in a. c, d Same as a, b but for GenBank

Back to article page