Skip to main content

Table 5 Benchmark misclassification rates

From: Supervised clustering of genes

Leukemia q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 6.33% 4.79% 4.50% 4.08% 3.67% 3.75% 3.79%
Aggregated trees 8.50% 6.04% 4.54% 3.92% 4.83% 6.79% 8.46%
Breast q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 1.08% 0.83% 0.92% 1.17% 1.33% 1.50% 1.58%
Aggregated trees 5.42% 2.50% 1.83% 2.42% 4.17% 5.42% 8.33%
Prostate q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 13.24% 10.68% 9.15% 8.44% 7.76% 8.18% 7.85%
Aggregated trees 25.47% 21.29% 18.56% 17.44% 16.65% 17.65% 18.94%
Colon q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 23.40% 21.95% 20.15% 18.90% 16.65% 16.25% 15.70%
Aggregated trees 30.95% 29.70% 30.20% 31.20% 33.55% 34.15% 34.90%
SRBCT q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 1.76% 0.86% 0.81% 1.05% 1.19% 1.43% 1.48%
Aggregated trees 4.38% 2.00% 2.62% 3.95% 6.48% 6.95% 8.43%
Lymphoma q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 2.43% 2.29% 1.76% 1.05% 0.81% 0.81% 0.86%
Aggregated trees 4.38% 2.81% 2.10% 1.00% 0.81% 1.05% 1.24%
Brain q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 30.79% 29.07% 29.50% 27.57% 28.50% 28.00% 27.50%
Aggregated trees 40.14% 35.29% 34.64% 33.50% 34.36% 34.79% 35.29%
NCI q = 1 q = 2 q = 3 q = 5 q = 10 q = 15 q = 20
Nearest neighbor 39.63% 34.89% 32.84% 31.95% 30.68% 29.74% 28.95%
Aggregated trees 56.58% 49.53% 44.84% 42.42% 39.21% 39.05% 37.79%
  1. Benchmark misclassification rates for out-of-sample classification with the very same but non-averaged genes from q clusters as features, based on N = 100 random divisions into learning set (two thirds of the data) and test set (one third of the data).