Skip to main content

Table 1 Performance comparison of different clustering tools and datasets

From: RabbitTClust: enabling fast clustering analysis of millions of bacteria genomes with MinHash sketches

Dataset

Tool

Time

SpeedUpa

Memory (GB)

NMI

bact-RefSeq

MeShClust3

>14days

-

-

-

Gclust

-

-

OOMb

-

Mash &Mothur

365m14s

66.4

7.33

0.961

clust-mst

5m30s

-

10.70

0.961

clust-greedy

5m05s

-

4.83

0.959

sub-Bact

MeShClust3

3,096m18s

2,996.4

139.17

0.920

Gclust

1,502m05s

1,454.6

156.35

0.812

Mash &Mothur

4m37s

4.5

1.19

0.973

clust-mst

1m02s

-

5.77

0.973

clust-greedy

59s

-

5.17

0.970

  1. aSpeedUp: SpeedUp for clust-mst module of RabbitTClust
  2. bOOM: Out Of Memory