Topological data analysis and machine learning for COVID-19 detection in CT scan lung images
- PMID: 40170109
- PMCID: PMC11963280
- DOI: 10.1186/s42490-025-00089-1
Topological data analysis and machine learning for COVID-19 detection in CT scan lung images
Abstract
COVID-19 has claimed the lives of thousands over the past years. Although pathogenic laboratory testing is the established standard, it carries a significant drawback with a notable rate of false negatives. Consequently, there is an urgent need for alternative diagnostic approaches to combat this threat. In response to this pressing need for accurate and parameter-free methods for COVID-19 identification, particularly within lung images, we introduce a novel approach that combines the principles of topological data analysis with the capabilities of machine learning. Our proposed methodology entails the extraction of persistent homology features from lung images, effectively capturing the intrinsic topological properties inherent in the data. These extracted persistent homology features then serve as inputs for various machine learning methods employed for classification purposes. Our primary objective is to achieve exceptional accuracy in the detection of COVID-19 all while showcasing the effectiveness of these topological features. The experimental results demonstrate that the Random Forest Classifier and the Support Vector Machine models outperform the rest, showcasing their effectiveness in classifying CT scan lung images with remarkable precision-an accuracy rate of 97.5% for the Random Forest model and an AUC score that surpasses 0.99 for the SVM. Results of the model on the same data after exclusion of the topological features and on other data with application of the same model with topological features showed the efficiency of these features in the classification task.
Keywords: COVID-19 detection; Lung images; Machine learning; Topological data analysis.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: The need for ethics approval is deemed unnecessary according to national regulations, no relevant legislation exists in the country of the origin of the data. The data was anonymized prior to use, and all reasonable steps were taken to ensure confidentiality and adherence to ethical standards for research involving medical data. Accordance: As clinical/pathological data is analyzed in this study, we confirm that all methods were performed in accordance with the Declaration of Helsinki. Consent for publication: Not applicable Competing interests: The authors declare no competing interests.
Figures





Similar articles
-
Classification of COVID-19 via Homology of CT-SCAN.Comput Biol Med. 2025 Jul;193:110226. doi: 10.1016/j.compbiomed.2025.110226. Epub 2025 May 27. Comput Biol Med. 2025. PMID: 40435668
-
Transfer learning-based ensemble support vector machine model for automated COVID-19 detection using lung computerized tomography scan data.Med Biol Eng Comput. 2021 Apr;59(4):825-839. doi: 10.1007/s11517-020-02299-2. Epub 2021 Mar 18. Med Biol Eng Comput. 2021. PMID: 33738639 Free PMC article.
-
Complex features extraction with deep learning model for the detection of COVID19 from CT scan images using ensemble based machine learning approach.Expert Syst Appl. 2022 Jun 1;195:116554. doi: 10.1016/j.eswa.2022.116554. Epub 2022 Feb 4. Expert Syst Appl. 2022. PMID: 35136286 Free PMC article.
-
Two-step machine learning to diagnose and predict involvement of lungs in COVID-19 and pneumonia using CT radiomics.Comput Biol Med. 2022 Nov;150:106165. doi: 10.1016/j.compbiomed.2022.106165. Epub 2022 Oct 5. Comput Biol Med. 2022. PMID: 36215849 Free PMC article.
-
Advancing lung cancer diagnosis: Combining 3D auto-encoders and attention mechanisms for CT scan analysis.J Xray Sci Technol. 2025 Mar;33(2):376-392. doi: 10.1177/08953996241313120. Epub 2025 Jan 28. J Xray Sci Technol. 2025. PMID: 39973792
Cited by
-
AFTG-Net: A Deep Attention-based Fusion Framework of Topological and Gradient Features for Pathological Image Analysis.Res Sq [Preprint]. 2025 Jul 11:rs.3.rs-6710077. doi: 10.21203/rs.3.rs-6710077/v1. Res Sq. 2025. PMID: 40671798 Free PMC article. Preprint.
References
LinkOut - more resources
Full Text Sources
Miscellaneous