Table 4 Cluster accuracy and stability on yeast galactose data

From: Clustering gene-expression data with repeated measurements

(a) Cluster accuracy*
Algorithm	Similarity measure/model	Average	SD-weighted	CV-weighted	FITSS	True mean	IMM
Centroid linkage	Distance	0.968	0.849	0.802	0.968	0.159	NA
MCLUST-HC	NA	0.968	NA	NA	0.968	0.806	NA
Complete linkage	Distance	0.957	0.968	0.957	0.643	0.695	NA
Complete linkage	Spherical	NA	NA	NA	NA	NA	0.968
Complete linkage	Elliptical	NA	NA	NA	NA	NA	0.968
Centroid linkage	Correlation	0.942	0.807	0.753	0.942	0.942	NA
k-means	Correlation	0.871	0.640	0.827	NA	0.897	NA
Average linkage	Spherical	NA	NA	NA	NA	NA	0.897
Average linkage	Elliptical	NA	NA	NA	NA	NA	0.897
Average linkage	Distance	0.858	0.858	0.847	0.869	0.159	NA
Average linkage	Correlation	0.866	0.817	0.841	0.865	0.857	NA
k-means	Distance	0.857	0.857	0.767	NA	0.159	NA
Complete linkage	Correlation	0.677	0.724	0.730	0.503	0.744	NA
(b) Cluster stability^†
Algorithm	Similarity measure/model	Average	SD-weighted	CV-weighted	FITSS	IMM
Complete linkage	Elliptical	NA	NA	NA	NA	0.998
Complete linkage	Spherical	NA	NA	NA	NA	0.991
Average linkage	Distance	0.820	0.985	0.914	0.650	NA
MCLUST-HC	NA	0.963	NA	NA	0.916	NA
Complete linkage	Distance	0.927	0.937	0.830	0.441	NA
Centroid linkage	Distance	0.893	0.924	0.841	0.893	NA
Average linkage	Spherical	NA	NA	NA	NA	0.923
k-means	Distance	0.905	0.867	0.798	NA	NA
Average linkage	Elliptical	NA	NA	NA	NA	0.895
Centroid linkage	Correlation	0.889	0.758	0.644	0.889	NA
Average linkage	Correlation	0.842	0.842	0.855	0.828	NA
k-means	Correlation	0.799	0.709	0.781	NA	NA
Complete linkage	Correlation	0.655	0.700	0.666	0.577	NA

*Each entry shows the adjusted Rand index of the corresponding clustering approach with the four functional categories. A high adjusted Rand index represents close agreement with the external knowledge. ^†Each entry shows the average adjusted Rand index of the original clustering result with clusters from ten synthetic re-measured datasets. A high average adjusted Rand index means that clusters from synthetic remeasured data are in close agreement with clusters from the original dataset. For both parts of the table, the maximum adjusted Rand index of each row is shown in bold. The algorithms (rows) are sorted in descending order of the maximum average adjusted Rand index in each row. The external knowledge is not used in evaluating cluster stability.

Back to article page

ISSN: 1474-760X

Contact us

Submission enquiries: editorial@genomebiology.com
General enquiries: info@biomedcentral.com