Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 12;20(6):e0326145.
doi: 10.1371/journal.pone.0326145. eCollection 2025.

Federated k-means based on clusters backbone

Affiliations

Federated k-means based on clusters backbone

Zilong Deng et al. PLoS One. .

Abstract

Federated clustering is a distributed clustering algorithm that does not require the transmission of raw data and is widely used. However, it struggles to handle Non-IID data effectively because it is difficult to obtain accurate global consistency measures under Non-Independent and Identically Distributed (Non-IID) conditions. To address this issue, we propose a federated k-means clustering algorithm based on a cluster backbone called FKmeansCB. First, we add Laplace noise to all the local data, and run k-means clustering on the client side to obtain cluster centers, which faithfully represent the cluster backbone (i.e., the data structures of the clusters). The cluster backbone represents the client's features and can approximatively capture the features of different labeled data points in Non-IID situations. We then upload these cluster centers to the server. Subsequently, the server aggregates all cluster centers and runs the k-means clustering algorithm to obtain global cluster centers, which are then sent back to the client. Finally, the client assigns all data points to the nearest global cluster center to produce the final clustering results. We have validated the performance of our proposed algorithm using six datasets, including the large-scale MNIST dataset. Compared with the leading non-federated and federated clustering algorithms, FKmeansCB offers significant advantages in both clustering accuracy and running time.

PubMed Disclaimer

Conflict of interest statement

No authors have competing interests.

Figures

Fig 1
Fig 1. A basic federated clustering framework.
Fig 2
Fig 2. The framework of the proposed FKmeansCB.
Fig 3
Fig 3. Running time comparison.
Fig 4
Fig 4. The influence of parametersk1 and k2 on clustering results.
Fig 5
Fig 5. Clustering results under different numbers of clients.
Fig 6
Fig 6. Clustering results on R15 and MNIST by using FKmeansCB.

References

    1. Ikotun AM, Ezugwu AE, Abualigah L, Abuhaija B, Heming J. K-means clustering algorithms: a comprehensive review, variants analysis, and advances in the era of big data. Inf Sci. 2023;622:178–210. doi: 10.1016/j.ins.2022.11.139 - DOI
    1. Liu X. Simplemkkm: simple multiple kernel k-means. IEEE Trans Pattern Anal Mach Intell. 2022;45:5174–86. - PubMed
    1. Xia C, Hua J, Tong W, Zhong S. Distributed K-means clustering guaranteeing local differential privacy. Comput Secur. 2020;90:101699.
    1. Tabianan K, Velu S, Ravi V. K-means clustering approach for intelligent customer segmentation using customer purchase behavior data. Sustainability. 2022;14(2022):7243.
    1. Basar S, Ali M, Ochoa-Ruiz G, Zareei M, Waheed A, Adnan A. Unsupervised color image segmentation: a case of RGB histogram based K-means clustering initialization. PLoS One. 2020;15(10):e0240015. doi: 10.1371/journal.pone.0240015 - DOI - PMC - PubMed

LinkOut - more resources