TY - JOUR
T1 - A comparison of indices for identifying the number of clusters in hierarchical clustering
T2 - A study on cognition in schizophrenia patients
AU - Islam, Md Atiqul
AU - Alizadeh, Behrooz Z.
AU - van den Heuvel, Edwin R.
AU - Bruggeman, Richard
AU - Cahn, Wiepke
AU - de Haan, Lieuwe
AU - Kahn, René S.
AU - Meijer, Carin
AU - Myin-Germeys, Inez
AU - van Os, Jim
AU - Wiersma, Durk
PY - 2015/4/3
Y1 - 2015/4/3
N2 - Finding clusters in a complex dataset is not straightforward. Different indices were developed to quantify the number of clusters. Their performances were studied using unrealistic simulations, since they were considered at low dimensions. We investigated 14 indices for eight-dimensional data using simulations based on cognition measures. We focused on hierarchical clustering with Ward’s agglomerative technique. Results indicated that Duda and Hart, Hartigan and Gap/pc were best performing. They estimated the number of clusters within ±1 with high probabilities. Duda and Hart index was most consistent, while Gap/pc and WGap/pc together made a good distinction between single and multiple clusters.
AB - Finding clusters in a complex dataset is not straightforward. Different indices were developed to quantify the number of clusters. Their performances were studied using unrealistic simulations, since they were considered at low dimensions. We investigated 14 indices for eight-dimensional data using simulations based on cognition measures. We focused on hierarchical clustering with Ward’s agglomerative technique. Results indicated that Duda and Hart, Hartigan and Gap/pc were best performing. They estimated the number of clusters within ±1 with high probabilities. Duda and Hart index was most consistent, while Gap/pc and WGap/pc together made a good distinction between single and multiple clusters.
KW - Cluster analysis
KW - cluster indices
KW - hierarchical clustering
KW - homogeneous subgroups
KW - number of clusters
UR - http://www.scopus.com/inward/record.url?scp=85026827819&partnerID=8YFLogxK
U2 - 10.1080/23737484.2015.1103670
DO - 10.1080/23737484.2015.1103670
M3 - Article
AN - SCOPUS:85026827819
VL - 1
SP - 98
EP - 113
JO - Communications in Statistics Case Studies Data Analysis and Applications
JF - Communications in Statistics Case Studies Data Analysis and Applications
IS - 2
ER -