Skip to main content

Table 1 Time, space, peak RAM, and peak RAM (aggregate) to construct variant index on the 1000 Genomes and TCGA (OV, LUAD, and BRCA) data using VariantStore and VG toolkit. ∗VG toolkit could not build GBWT index embedding all sample paths for TCGA data. Space reported is for the XG index that does not contain any path information. We constructed all 24 chromosomes (1–22 and X and Y) in parallel. The time and peak RAM reported is for the biggest chromosome (usually chromosome 1 or 2). The space reported is the total space on disk for all 24 chromosomes. The peak RAM (aggregate) is the aggregate peak RAM for all 24 processes

From: VariantStore: an index for large-scale genomic variant search

System

Time

Disk space

Peak RAM

Peak RAM Agg.

Dataset

1000 Genomes

VariantStore

3 h 25 min

41 GB

8.8 GB

153 GB

VG toolkit

11 h 10 min

50 GB

37 GB

450 GB

Dataset

TCGA (OV)

VariantStore

1 h 5 min

3.4 GB

1.1 GB

17.45 GB

VG-toolkit

 

11 GB ∗

  

Dataset

TCGA (LUAD)

VariantStore

1 h 20 min

3.5 GB

2.3 GB

36.05 GB

VG toolkit

 

12 GB ∗

  

Dataset

TCGA (BRCA)

VariantStore

4 h 36 min

4.2 GB

3.2 GB

53.21 GB

VG toolkit

 

14 GB ∗