dc.contributor.author | Zogheib, Bashar | |
dc.date.accessioned | 2020-04-10T14:16:58Z | |
dc.date.available | 2020-04-10T14:16:58Z | |
dc.date.issued | 2019 | |
dc.identifier.uri | https://doi.org/10.1080/24709360.2019.1615770 | |
dc.identifier.uri | https://dspace.auk.edu.kw/handle/11675/5752 | |
dc.description.abstract | Finding the number of clusters in a data set is considered as one of the fundamental problems in cluster analysis. This paper integrates maximum clustering similarity (MCS), for finding the optimal number of clusters, into R©statistical software through the package MCSim. The similarity between the two clustering methods is calculated at the same number of clusters, using Rand [Objective criteria for the evaluation of clustering methods. J Am Stat Assoc. 1971;66:846–850.] and Jaccard [The distribution of the flora of the alpine zone. New Phytologist. 1912;11:37–50.] indices, corrected for chance agreement. The number of clusters at which the index attains its maximum with most frequency is a candidate for the optimal number of clusters. Unlike other criteria, MCS can be used with circular data. Seven clustering algorithms, existing in R©, are implemented in MCSim. A graph of the number of clusters vs. clusters similarity using corrected similarity indices is produced. Values of the similarity indices and a clustering tree (dendrogram) are produced. Several examples including simulated, real, and circular data sets are presented to show how MCSim successfully works in practice. | |
dc.publisher | Taylor & Francis | |
dc.relation.journal | Biostatistics and Epidemiology | |
dc.title | How many clusters exist? Answer via maximum clustering similarity implemented in R | |
dc.type | Journal Article | |
dcterms.bibliographicCitation | Albatineh, A. N., Wilcox,M. L., Zogheib, B., & Niewiadomska-Bugaj, M. (2019). How many clustersexist? Answer via maximum clustering similarity implemented in R. Biostatistics& Epidemiology, 3(1), 62-79. | |
dc.journal.volume | 3 | |
dc.journal.issue | 1 | |
dc.article.pages | 62-79 | |