TY - JOUR
T1 - Dunn's index for cluster tendency assessment of pharmacological data sets
AU - Rivera-Borroto, Oscar Miguel
AU - Rabassa-Gutiérrez, Mónica
AU - Grau-Ábalo, Ricardo del Corazón
AU - Marrero-Ponce, Yovani
AU - García de la Vega, José Manuel
PY - 2012/4
Y1 - 2012/4
N2 - Cluster tendency assessment is an important stage in cluster analysis. In this sense, a group of promising techniques named visual assessment of tendency (VAT) has emerged in the literature. The presence of clusters can be detected easily through the direct observation of a dark blocks structure along the main diagonal of the intensity image. Alternatively, if the Dunn's index for a single linkage partition is greater than 1, then it is a good indication of the blocklike structure. In this report, the Dunn's index is applied as a novel measure of tendency on 8 pharmacological data sets, represented by machine- learning-selected molecular descriptors. In all cases, observed values are less than 1, thus indicating a weak tendency for data to form compact clusters. Other results suggest that there is an increasing relationship between the Dunn's index as a measure of cluster separability and the classification accuracy of various cluster algorithms tested on the same data sets.
AB - Cluster tendency assessment is an important stage in cluster analysis. In this sense, a group of promising techniques named visual assessment of tendency (VAT) has emerged in the literature. The presence of clusters can be detected easily through the direct observation of a dark blocks structure along the main diagonal of the intensity image. Alternatively, if the Dunn's index for a single linkage partition is greater than 1, then it is a good indication of the blocklike structure. In this report, the Dunn's index is applied as a novel measure of tendency on 8 pharmacological data sets, represented by machine- learning-selected molecular descriptors. In all cases, observed values are less than 1, thus indicating a weak tendency for data to form compact clusters. Other results suggest that there is an increasing relationship between the Dunn's index as a measure of cluster separability and the classification accuracy of various cluster algorithms tested on the same data sets.
KW - Classification accuracy
KW - Cluster analysis
KW - Cluster tendency
KW - Clusters overlap
KW - Dunn's index
KW - Pharmacological data sets
KW - VAT techniques
UR - http://www.scopus.com/inward/record.url?scp=84859112247&partnerID=8YFLogxK
U2 - 10.1139/Y2012-002
DO - 10.1139/Y2012-002
M3 - Artículo
C2 - 22443093
AN - SCOPUS:84859112247
SN - 0008-4212
VL - 90
SP - 425
EP - 433
JO - Canadian Journal of Physiology and Pharmacology
JF - Canadian Journal of Physiology and Pharmacology
IS - 4
ER -