Skip to main content


Fig. 4 | Genome Biology

Fig. 4

From: Hundreds of novel composite genes and chimeric genes with bacterial origins contributed to haloarchaeal evolution

Fig. 4

Distribution of the 320 composite gene families in Haloarchaea. The heatmap represents the presence (black line) or absence (white line) of a given composite gene family in Haloarchaea genomes (each line represents a given genome, each column represents a gene family). Haloarchaea genomes are colored with respect to their classification into major clades according to the study by [26] (red: clade B, blue: clade A, green: clade C, yellow: clade D, and black: unassigned). The colored horizontal top bar (a) indicates the mean percentage of protein identity of each gene family (red > 80%, orange > 60%, yellow > 40%, white > 25%). The colored horizontal top bar (b) indicates the type of composite family (red: clusters 3, 4, 6, and 9, blue: clusters 1 and 10, white: clusters 2, 5, 7, and 8). The colored horizontal top bar (c) indicates the functional annotation of the gene families according to COG categories (red: metabolism, blue: information storage and processing, green: cellular processes and signaling, white: poorly characterized). A hierarchical clustering has been performed both on columns and rows using the Jaccard distance and a complete linkage method. The hierarchical clustering of the protein families (columns) highlights two distinct sets of proteins, proteins that are widespread (2) and those with a sparse distribution (1)

Back to article page