Skip to main content

Table 1 Results of simulations indicating the reconstruction efficiency of PEMer for heterozygous deletions of different sizes

From: PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data

SV size Single cutoff Multi-cutoff Simplified multi-cutoff Multi-cutoff* Simplified multi-cutoff*
1000 3 (4) 3 (4) 3 (4) 3 (4) 3 (4)
2000 12 (13) 23 (26) 21 (23) 11 (13) 6 (6)
3000 52 (57) 61 (68) 61 (68) 49 (52) 44 (46)
4000 84 (85) 85 (86) 85 (86) 80 (82) 80 (82)
5000 91 (93) 91 (93) 91 (93) 91 (93) 91 (93)
6000 92 (92) 92 (92) 92 (92) 92 (92) 92 (92)
10000 88 (91) 88 (91) 88 (91) 88 (91) 88 (91)
Total 422 (435) 443 (460) 441 (457) 414 (427) 404 (414)
False positives (chromosome 2) 31 (31) 31 (31) 26 (31) 5 (4) 2 (1)
  1. Each row displays statistics for reconstructed heterozygous deletions of a particular size, derived from simulations of 454/Roche based PEM data. Columns show numbers of identified SVs for each reconstruction strategy. Numbers in parentheses correspond to simulated SV reconstructions without sequencing error. All SVs were reconstructed at an effective span coverage λ = 5× (where, λ was assessed after optimal paired-end placement) of a simulated diploid chromosome 2. Note that for the single cutoff strategy events ≥ 5 kb were reconstructed at a level near the theoretical maximum of 95% derived from the Poisson approximation (see supplementary methods and notes in Additional data file 1). However, the multi-cutoff and simplified multi-cutoff strategies outperformed the single cutoff strategy in detecting SVs < 4 kb. *We also applied alternative - that is, optimal - cutoff parameters, for which the sensitivity is similar to the single cutoff, but for which a false positive rate of approximately 5% was observed.