Skip to main content

Table 4 Misclassification rates based on random splitting

From: Supervised clustering of genes

Leukemia q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 6.58% 4.62% 4.21% 3.75% 3.33% 3.38% 3.25%
Aggregated trees 6.58% 6.12% 3.71% 3.54% 2.79% 2.71% 2.62%
Breast q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 1.00% 0.75% 0.75% 1.00% 0.83% 1.00% 1.00%
Aggregated trees 1.00% 1.58% 1.67% 2.33% 2.58% 2.42% 3.00%
Prostate q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 14.47% 11.68% 9.62% 7.97% 7.26% 6.94% 6.91%
Aggregated trees 14.47% 16.47% 10.32% 8.79% 8.12% 8.00% 7.79%
Colon q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 23.35% 20.35% 19.10% 16.95% 16.45% 16.05% 15.95%
Aggregated trees 23.35% 21.80% 19.70% 18.10% 16.95% 16.20% 16.45%
SRBCT q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 1.33% 0.48% 0.43% 0.48% 0.76% 0.95% 1.05%
Aggregated trees 5.76% 0.95% 0.71% 1.10% 1.76% 1.90% 2.14%
Lymphoma q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 2.15% 2.20% 1.50% 0.85% 0.65% 0.50% 0.50%
Aggregated trees 3.45% 2.45% 1.40% 0.80% 0.25% 0.20% 0.30%
Brain q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 31.21% 27.50% 26.36% 24.71% 23.86% 23.71% 23.36%
Aggregated trees 35.43% 28.43% 24.43% 22.14% 19.64% 18.29% 16.86%
NCI q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 45.25% 40.25% 37.90% 34.80% 32.10% 30.50% 29.65%
Aggregated trees 51.85% 42.35% 38.05% 34.05% 29.30% 27.75% 26.50%
  1. Misclassification rates for out-of-sample classification with q gene clusters as features, based on N = 100 random divisions into learning set (two thirds of the data) and test set (one third of the data).