Skip to main content

Table 1 Genomes and motifs

From: Characterizing and measuring bias in sequence data

   GC extremes Special motifs
Sample Genome size GC ≤ 10% GC ≥ 75% GC ≥ 85% (AT)15 G|C ≥ 80% Bad promoters
P. falciparum 23,263,391 10,030,724 (43%) 0 0 1,258,098 (5.4%) 0 -
E. coli 4,638,920 0 2,705 (0.058%) 0 0 0 -
R. sphaeroides 4,131,450 0 2,479,536 (60%) 90,207 (2.2%) 0 0 -
Human 2,684,573,005 6,228,029 (0.23%) 20,669,681 (0.77%) 2,980,450 (0.11%) 1,253,245 (0.047%) 802,554 (0.030%) 190,041 (0.0071%)
  1. For each genome sequenced as part of this work, we show its size in bases, along with the number of bases of each bias motif (see text). Only unambiguous (A, C, T, or G) bases from each reference are included. Plasmids, mitochondria, and sex chromosomes were excluded from the counts.