arXiv Analytics

Sign in

arXiv:2303.14366 [stat.ML]AbstractReferencesReviewsResources

Hybrid Fuzzy-Crisp Clustering Algorithm: Theory and Experiments

Akira R. Kinjo, Daphne Teck Ching Lai

Published 2023-03-25Version 1

With the membership function being strictly positive, the conventional fuzzy c-means clustering method sometimes causes imbalanced influence when clusters of vastly different sizes exist. That is, an outstandingly large cluster drags to its center all the other clusters, however far they are separated. To solve this problem, we propose a hybrid fuzzy-crisp clustering algorithm based on a target function combining linear and quadratic terms of the membership function. In this algorithm, the membership of a data point to a cluster is automatically set to exactly zero if the data point is ``sufficiently'' far from the cluster center. In this paper, we present a new algorithm for hybrid fuzzy-crisp clustering along with its geometric interpretation. The algorithm is tested on twenty simulated data generated and five real-world datasets from the UCI repository and compared with conventional fuzzy and crisp clustering methods. The proposed algorithm is demonstrated to outperform the conventional methods on imbalanced datasets and can be competitive on more balanced datasets.

Related articles: Most relevant | Search more
arXiv:1612.05730 [stat.ML] (Published 2016-12-17)
Towards Wide Learning: Experiments in Healthcare
arXiv:1308.1196 [stat.ML] (Published 2013-08-06, updated 2018-02-23)
The Group Lasso for Design of Experiments
arXiv:1910.05484 [stat.ML] (Published 2019-10-12)
Bayesian Optimization using Pseudo-Points