Bipartition (Split) | Pattern KOOCSS | Number of SNDs | Relative frequency |
---|
((KSSOO) C) | 111211 | 50,354 | 38.73 |
((KSSC)(OO)) | 122111 | 19,678 | 15.14 |
((KOOC)(SS)) | 111122 | 18,490 | 14.22 |
(K(OOSSC)) | 122222 | 14,115 | 10.86 |
((KSS)(OOC)) = KS | 122211 | 9,882 | 7.60 |
((KOO)(SSC)) = KO | 111222 | 6,890 | 5.30 |
((KC)(OOSS)) = KC | 122122 | 5,874 | 4.52 |
- Common single nucleotide differences have two alleles. Each such nucleotide difference separates the six genomes into two classes. Pattern codes are represented as 6-tuples of ones and twos (for allele 1 and allele 2) in the following order: (K) E. coli K-12 MG1655, (O) E. coli O157:H7 EDL933, (O) E. coli O157:H7 Sakai strain RIMD0509952, (C) E. coli CFT073, (S) Shigella flexneri 2A 301, and (S) Shigella flexneri 2A 2457T. By convention, K-12 is always allele one. For brevity, key groupings are denoted as KS, KO, or KC. The remaining 3.6% SNDs come in over 50 different patterns, including one quadripartition. See appendix 1 in Additional data file 1 for additional frequencies.