Skip to main content
Figure 10 | Genome Biology

Figure 10

From: Community transcriptomics reveals universal patterns of protein sequence conservation in natural microbial communities

Figure 10

Database-independent cluster statistics. (a) Size and (b) percentage identity of clusters containing amino acid sequences present only in DNA datasets or in both DNA + RNA datasets from five representative samples. Cluster sizes are based on counts of only the DNA-derived sequences within each cluster type. Numbers in legends indicate mean cluster size (a) and mean amino acid identity (b). Amino acid sequences were clustered above a threshold identity of 55%.

Back to article page