Fig. 4From: Hundreds of novel composite genes and chimeric genes with bacterial origins contributed to haloarchaeal evolutionDistribution of the 320 composite gene families in Haloarchaea. The heatmap represents the presence (black line) or absence (white line) of a given composite gene family in Haloarchaea genomes (each line represents a given genome, each column represents a gene family). Haloarchaea genomes are colored with respect to their classification into major clades according to the study by [26] (red: clade B, blue: clade A, green: clade C, yellow: clade D, and black: unassigned). The colored horizontal top bar (a) indicates the mean percentage of protein identity of each gene family (red > 80%, orange > 60%, yellow > 40%, white > 25%). The colored horizontal top bar (b) indicates the type of composite family (red: clusters 3, 4, 6, and 9, blue: clusters 1 and 10, white: clusters 2, 5, 7, and 8). The colored horizontal top bar (c) indicates the functional annotation of the gene families according to COG categories (red: metabolism, blue: information storage and processing, green: cellular processes and signaling, white: poorly characterized). A hierarchical clustering has been performed both on columns and rows using the Jaccard distance and a complete linkage method. The hierarchical clustering of the protein families (columns) highlights two distinct sets of proteins, proteins that are widespread (2) and those with a sparse distribution (1)Back to article page