Skip to main content

Table 6 Predicted proportion of exonic variants from two gene subsets (tumor suppressor versus oncogenes) that disrupt pre-mRNA splicing in human genetic disease (Inherited disease that is, germline and Cancer that is, somatic) and also identified in the general population (1000 Genomes project participants)

From: MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing

Data set

Proportion of SAVs in data set (predicted SAVs/total variants)

Tumor suppressor

Oncogenes

Inherited disease

25.3% (1,130/4,463)

10.9% (132/1,207)

Cancer

16.0% (1,612/10,082)

10.9% (525/4,831)

1000 Genomes

7.4% (84/1,133)

8.0% (49/612)

  1. The somatic Cancer data set includes driver and passenger mutations recorded in COSMIC [63]. The 1000 Genomes Project data set was derived from the 1000 Genomes Project without any MAF filter having been applied, that is, all rare and common variants were included. The proportion of predicted SAVs for each data set is shown, together with the frequencies of predicted SAVs; the sizes of the data sets are shown in parentheses.