Skip to main content

Table 1 Results for 7-gram model using entire dataset

From: ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes

Location

Code

Precision

Sensitivity

FPR

Specificity

MCC

Cytoplasm

CYT

0.828

0.775

0.020

0.980

0.777

Cytoskeleton

CSK

0.882

0.452

0.001

0.999

0.629

Endoplasmic Reticulum

END

0.961

0.789

0.001

0.999

0.867

Extracellular

EXC

0.949

0.939

0.021

0.979

0.921

Golgi Apparatus

GOL

0.891

0.550

0.001

0.999

0.697

Lysosome

LYS

0.953

0.855

0.000

1.000

0.902

Mitochrondria

MIT

0.964

0.799

0.003

0.997

0.867

Nuclear

NUC

0.807

0.906

0.048

0.952

0.821

Plasma Membrane

PLA

0.883

0.958

0.043

0.957

0.892

Perixosome

POX

0.938

0.748

0.000

1.000

0.836

Single-localized % overall accuracy

89.03

Multi-localized % overall accuracy (at least 1 correct)

81.88

Multi-localized % overall accuracy (both correct)

59.70

  1. The performance results of ngLOC on a tenfold cross-validation are displayed. The overall accuracy is also reported for multi-localized sequences, comparing at least one localization predicted correctly against both localizations predicted correctly. FPR, false positive rate; MCC, Matthews correlation coefficient.