Reference Vector-guided Evolutionary Algorithm for cluster analysis of single-cell transcriptomes
- PMID: 40499345
- DOI: 10.1016/j.cmpb.2025.108873
Reference Vector-guided Evolutionary Algorithm for cluster analysis of single-cell transcriptomes
Abstract
Background and objective: Single-cell RNA-sequencing (scRNA-seq) has revolutionized transcriptomic studies by providing detailed insights into gene expression profiles at the single-cell level. This technology allows researchers to capture expression patterns of thousands of genes across hundreds or thousands of individual cells. Clustering is a crucial step in the analysis of scRNA-seq data, since it enables the identification of distinct cell populations based on their transcriptomic profiles and serves as a foundation for downstream analysis. Given that clustering scRNA-seq data is a challenging task that involves different conflicting objectives, our goal is to tackle it from a multi-objective optimization perspective.
Methods: This study proposes a Reference Vector-guided Evolutionary Algorithm for Cluster Analysis of Single-cell Transcriptomes (RVEA-CAST) to address the clustering task as a multi-objective optimization problem. Our approach considers three objectives to optimize: clustering deviation, clustering compactness, and the Davies-Bouldin index. The algorithmic design of RVEA-CAST incorporates three problem-aware mutation operators specifically designed to improve each objective, which are orchestrated under a multi-objective search engine based on the use of reference vectors.
Results: RVEA-CAST is evaluated on ten real scRNA-seq datasets using standard clustering evaluation metrics, such as Normalized Mutual Information (NMI) and Adjusted Rand Index (ARI). The attained results reveal the improved performance and robustness of the proposed approach compared to other previously proposed methods. Specifically, statistically significant improvements of up to 66.7% and 261.5% were achieved for NMI and ARI, respectively. Furthermore, the analysis of differentially expressed genes in the predicted and real clusters showcased greater agreement of our solutions with actual cell populations, underscoring the biological relevance of our approach.
Conclusions: The results highlight that RVEA-CAST is an effective and versatile approach for clustering scRNA-seq data, outperforming existing methods across diverse biological scenarios in both widely used clustering evaluation metrics and biological relevance.
Keywords: Cluster analysis; Multi-objective optimization; Reference Vector-guided Evolutionary Algorithm; ScRNA-seq; Single-cell transcriptome.
Copyright © 2025 The Authors. Published by Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Soft graph clustering for single-cell RNA sequencing data.BMC Bioinformatics. 2025 Jul 25;26(1):195. doi: 10.1186/s12859-025-06231-z. BMC Bioinformatics. 2025. PMID: 40713495 Free PMC article.
-
Short-Term Memory Impairment.2024 Jun 8. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. 2024 Jun 8. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. PMID: 31424720 Free Books & Documents.
-
Differentiable graph clustering with structural grouping for single-cell RNA-seq data.Bioinformatics. 2025 Jul 1;41(7):btaf347. doi: 10.1093/bioinformatics/btaf347. Bioinformatics. 2025. PMID: 40511990 Free PMC article.
-
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320. Health Technol Assess. 2001. PMID: 12065068
-
Incentives for preventing smoking in children and adolescents.Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3. Cochrane Database Syst Rev. 2017. PMID: 28585288 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous