Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 2;7(1):4.
doi: 10.1186/s42490-025-00089-1.

Topological data analysis and machine learning for COVID-19 detection in CT scan lung images

Affiliations

Topological data analysis and machine learning for COVID-19 detection in CT scan lung images

Rabih Assaf et al. BMC Biomed Eng. .

Abstract

COVID-19 has claimed the lives of thousands over the past years. Although pathogenic laboratory testing is the established standard, it carries a significant drawback with a notable rate of false negatives. Consequently, there is an urgent need for alternative diagnostic approaches to combat this threat. In response to this pressing need for accurate and parameter-free methods for COVID-19 identification, particularly within lung images, we introduce a novel approach that combines the principles of topological data analysis with the capabilities of machine learning. Our proposed methodology entails the extraction of persistent homology features from lung images, effectively capturing the intrinsic topological properties inherent in the data. These extracted persistent homology features then serve as inputs for various machine learning methods employed for classification purposes. Our primary objective is to achieve exceptional accuracy in the detection of COVID-19 all while showcasing the effectiveness of these topological features. The experimental results demonstrate that the Random Forest Classifier and the Support Vector Machine models outperform the rest, showcasing their effectiveness in classifying CT scan lung images with remarkable precision-an accuracy rate of 97.5% for the Random Forest model and an AUC score that surpasses 0.99 for the SVM. Results of the model on the same data after exclusion of the topological features and on other data with application of the same model with topological features showed the efficiency of these features in the classification task.

Keywords: COVID-19 detection; Lung images; Machine learning; Topological data analysis.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: The need for ethics approval is deemed unnecessary according to national regulations, no relevant legislation exists in the country of the origin of the data. The data was anonymized prior to use, and all reasonable steps were taken to ensure confidentiality and adherence to ethical standards for research involving medical data. Accordance: As clinical/pathological data is analyzed in this study, we confirm that all methods were performed in accordance with the Declaration of Helsinki. Consent for publication: Not applicable Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The flowchart for the computation of persistent homology classes
Fig. 2
Fig. 2
An example of a pixels’ complex composed from vertices, edges and squares
Fig. 3
Fig. 3
Example of filtration scheme of the pixels complex in Fig. 2
Fig. 4
Fig. 4
The flowchart for PHML modeling. It entails the selection of specific cubical complexes based on the data types in use. By tuning an appropriate filtration parameter, one can apply persistent homology (PH) analysis through custom software such as PHAT. The output generated by PH is then converted into feature vectors, which are subsequently integrated with supervised machine learning methods
Fig. 5
Fig. 5
Images (a) and (b) illustrate chest CT scans of individuals without any abnormalities, whereas images (c) and (d) represent scans of patients diagnosed with COVID-19

Similar articles

Cited by

References

    1. Ciotti M, et al. The COVID-19 pandemic. Crit Rev Clin Lab Sci. 2020;57:365–88. - PubMed
    1. Shi Y, et al. An overview of COVID-19. J Zhejiang Univ Sci B. 2020;21:343–60. - PMC - PubMed
    1. Peaper DR, Kerantzas CA, Durant TJS. Advances in molecular infectious diseases testing in the time of COVID-19. Clin Biochem. 2023;117:94–101. - PMC - PubMed
    1. Stolberg-Stolberg J, et al. COVID-19 rapid molecular point-of-care testing is effective and cost-beneficial for the acute care of trauma patients. Eur J Trauma Emerg Surg. 2023;49:487–93. - PMC - PubMed
    1. Hayden MK, et al. The infectious diseases Society of America Guidelines on the Diagnosis of COVID-19: antigen Testing. Clinl Infect Dis. 2023. 10.1093/cid/ciad032. - PubMed

LinkOut - more resources