Skip to main content

Table 2 HiC-Pro performance and comparison with hiclib

From: HiC-Pro: an optimized and flexible pipeline for Hi-C data processing

Dataset

IMR90

IMR90

IMR90

IMR90_CCL186

Number of reads

397,200,000

397,200,000

397,200,000

1,535,222,082

Pipeline

hiclib

HiC-Pro

HiC-Pro parallel

HiC-Pro parallel

Number of input files

10

10

84

160

Number of jobs

1

1

42

80

Number of CPUs per job

8

8

4

4

Maximum memory

10

7

7

24

Wall time

28:24

14:32

02:15

11:49

 Mapping

22:03

10:31

00:21

05:56

 Filtering

00:30

03:10

00:05

00:36

 Merge

 

00:20

00:18

00:50

 Contacts maps

01:45

00:15

00:15

00:42

Normalization

04:06

01:16

01:16

03:49

  1. HiC-Pro was run on the IMR90 Hi-C dataset from Dixon et al. and Rao et al. in order to generate contact maps at resolutions of 20 kb, 40 kb, 150 kb, 500 kb and 1 Mb. Contact maps at 5 kb were also generated for the IMR90_CCL186 dataset. The CPU time for each step of the pipeline is reported and compared with the hiclib Python library. The reported results include time of writing contact maps in text format. Times are minutes:seconds