Centroid-Based Clustering with αβ-Divergences
- PMID: 33266911
- PMCID: PMC7514678
- DOI: 10.3390/e21020196
Centroid-Based Clustering with αβ-Divergences
Abstract
Centroid-based clustering is a widely used technique within unsupervised learning algorithms in many research fields. The success of any centroid-based clustering relies on the choice of the similarity measure under use. In recent years, most studies focused on including several divergence measures in the traditional hard k-means algorithm. In this article, we consider the problem of centroid-based clustering using the family of α β -divergences, which is governed by two parameters, α and β . We propose a new iterative algorithm, α β -k-means, giving closed-form solutions for the computation of the sided centroids. The algorithm can be fine-tuned by means of this pair of values, yielding a wide range of the most frequently used divergences. Moreover, it is guaranteed to converge to local minima for a wide range of values of the pair ( α , β ). Our theoretical contribution has been validated by several experiments performed with synthetic and real data and exploring the ( α , β ) plane. The numerical results obtained confirm the quality of the algorithm and its suitability to be used in several practical applications.
Keywords: centroid-based clustering; k-means algorithm; musical genre clustering; unsupervised classification; αβ-divergence.
Conflict of interest statement
The authors declare no conflict of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Figures







Similar articles
-
Analysis of k-means clustering approach on the breast cancer Wisconsin dataset.Int J Comput Assist Radiol Surg. 2016 Nov;11(11):2033-2047. doi: 10.1007/s11548-016-1437-9. Epub 2016 Jun 16. Int J Comput Assist Radiol Surg. 2016. PMID: 27311823
-
An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.Comput Biol Med. 2017 Dec 1;91:213-221. doi: 10.1016/j.compbiomed.2017.10.014. Epub 2017 Oct 23. Comput Biol Med. 2017. PMID: 29100115
-
Centroid neural network for unsupervised competitive learning.IEEE Trans Neural Netw. 2000;11(2):520-8. doi: 10.1109/72.839021. IEEE Trans Neural Netw. 2000. PMID: 18249781
-
Comprehensive analysis of clustering algorithms: exploring limitations and innovative solutions.PeerJ Comput Sci. 2024 Aug 29;10:e2286. doi: 10.7717/peerj-cs.2286. eCollection 2024. PeerJ Comput Sci. 2024. PMID: 39314716 Free PMC article. Review.
-
Optimized Clustering Algorithms for Large Wireless Sensor Networks: A Review.Sensors (Basel). 2019 Jan 15;19(2):322. doi: 10.3390/s19020322. Sensors (Basel). 2019. PMID: 30650551 Free PMC article. Review.
Cited by
-
A Method for Unsupervised Semi-Quantification of Inmunohistochemical Staining with Beta Divergences.Entropy (Basel). 2022 Apr 13;24(4):546. doi: 10.3390/e24040546. Entropy (Basel). 2022. PMID: 35455209 Free PMC article.
-
Information Theory Applications in Signal Processing.Entropy (Basel). 2019 Jul 3;21(7):653. doi: 10.3390/e21070653. Entropy (Basel). 2019. PMID: 33267367 Free PMC article.
References
-
- Amari S. α-Divergence Is Unique, Belonging to Both f-Divergence and Bregman Divergence Classes. IEEE Trans. Inf. Theory. 2009;55:4925–4931. doi: 10.1109/TIT.2009.2030485. - DOI
-
- Taneja I.J., Kumar P. Relative Information of Type s, CsiszáR’s F-divergence, and Information Inequalities. Inf. Sci. 2004;166:105–125. doi: 10.1016/j.ins.2003.11.002. - DOI
-
- Cichocki A., Amari S.i. Families of Alpha-Beta- and Gamma-Divergences: Flexible and Robust Measures of Similarities. Entropy. 2010;12:1532–1568. doi: 10.3390/e12061532. - DOI
-
- Banerjee A., Merugu S., Dhillon I.S., Ghosh J. Clustering with Bregman Divergences. J. Mach. Learn. Res. 2005;6:1705–1749.
-
- Nielsen F., Nock R. Sided and Symmetrized Bregman Centroids. IEEE Trans. Inf. Theory. 2009;55:2882–2904. doi: 10.1109/TIT.2009.2018176. - DOI
Grants and funding
LinkOut - more resources
Full Text Sources