Comparisons of protein length, intergenic region length, exon number and gene family size in the E . invadens and E. histolytica genomes. In all plots, the black line represents E. invadens and the red line represents E. histolytica. (a) Distribution of protein lengths for the translated coding sequences. (b) Distribution of intergenic sequence lengths. For display purposes, the x-axis is plotted in a log scale. (c) Distribution of the number of exons per gene. (d) Distribution of the number of putative paralogous genes belonging to multi-gene families. Gene families were defined based on shared functional domains (see Materials and methods for detailed description).