Explaining machine learning based diagnosis of COVID-19 from routine blood tests with decision trees and criteria graphs
- PMID: 33812263
- PMCID: PMC7962588
- DOI: 10.1016/j.compbiomed.2021.104335
Explaining machine learning based diagnosis of COVID-19 from routine blood tests with decision trees and criteria graphs
Abstract
The sudden outbreak of coronavirus disease 2019 (COVID-19) revealed the need for fast and reliable automatic tools to help health teams. This paper aims to present understandable solutions based on Machine Learning (ML) techniques to deal with COVID-19 screening in routine blood tests. We tested different ML classifiers in a public dataset from the Hospital Albert Einstein, São Paulo, Brazil. After cleaning and pre-processing the data has 608 patients, of which 84 are positive for COVID-19 confirmed by RT-PCR. To understand the model decisions, we introduce (i) a local Decision Tree Explainer (DTX) for local explanation and (ii) a Criteria Graph to aggregate these explanations and portrait a global picture of the results. Random Forest (RF) classifier achieved the best results (accuracy 0.88, F1-score 0.76, sensitivity 0.66, specificity 0.91, and AUROC 0.86). By using DTX and Criteria Graph for cases confirmed by the RF, it was possible to find some patterns among the individuals able to aid the clinicians to understand the interconnection among the blood parameters either globally or on a case-by-case basis. The results are in accordance with the literature and the proposed methodology may be embedded in an electronic health record system.
Keywords: COVID–19; Criteria graph; Decision tree; Explainable artificial intelligence; Machine learning.
Copyright © 2021 Elsevier Ltd. All rights reserved.
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures










Similar articles
-
Development and External Validation of a Machine Learning Tool to Rule Out COVID-19 Among Adults in the Emergency Department Using Routine Blood Tests: A Large, Multicenter, Real-World Study.J Med Internet Res. 2020 Dec 2;22(12):e24048. doi: 10.2196/24048. J Med Internet Res. 2020. PMID: 33226957 Free PMC article.
-
Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study.J Med Syst. 2020 Jul 1;44(8):135. doi: 10.1007/s10916-020-01597-4. J Med Syst. 2020. PMID: 32607737 Free PMC article.
-
Prediction of diagnosis and prognosis of COVID-19 disease by blood gas parameters using decision trees machine learning model: a retrospective observational study.Med Gas Res. 2022 Apr-Jun;12(2):60-66. doi: 10.4103/2045-9912.326002. Med Gas Res. 2022. PMID: 34677154 Free PMC article.
-
An Overview of Supervised Machine Learning Methods and Data Analysis for COVID-19 Detection.J Healthc Eng. 2021 Nov 22;2021:4733167. doi: 10.1155/2021/4733167. eCollection 2021. J Healthc Eng. 2021. PMID: 34853669 Free PMC article. Review.
-
A Survey of COVID-19 Diagnosis Using Routine Blood Tests with the Aid of Artificial Intelligence Techniques.Diagnostics (Basel). 2023 May 16;13(10):1749. doi: 10.3390/diagnostics13101749. Diagnostics (Basel). 2023. PMID: 37238232 Free PMC article. Review.
Cited by
-
Comparison of machine learning techniques to handle imbalanced COVID-19 CBC datasets.PeerJ Comput Sci. 2021 Aug 12;7:e670. doi: 10.7717/peerj-cs.670. eCollection 2021. PeerJ Comput Sci. 2021. PMID: 34458574 Free PMC article.
-
Integrating routine blood biomarkers and artificial intelligence for supporting diagnosis of silicosis in engineered stone workers.Bioeng Transl Med. 2024 Jun 28;9(6):e10694. doi: 10.1002/btm2.10694. eCollection 2024 Nov. Bioeng Transl Med. 2024. PMID: 39545094 Free PMC article.
-
Deep Generative Learning-Based 1-SVM Detectors for Unsupervised COVID-19 Infection Detection Using Blood Tests.IEEE Trans Instrum Meas. 2021 Nov 25;71:2500211. doi: 10.1109/TIM.2021.3130675. eCollection 2022. IEEE Trans Instrum Meas. 2021. PMID: 35582656 Free PMC article.
-
Identification of Pediatric Bacterial Gastroenteritis From Blood Counts and Interviews Based on Machine Learning.Cureus. 2023 Aug 17;15(8):e43644. doi: 10.7759/cureus.43644. eCollection 2023 Aug. Cureus. 2023. PMID: 37600437 Free PMC article.
-
A novel explainable COVID-19 diagnosis method by integration of feature selection with random forest.Inform Med Unlocked. 2022;30:100941. doi: 10.1016/j.imu.2022.100941. Epub 2022 Apr 6. Inform Med Unlocked. 2022. PMID: 35399333 Free PMC article.
References
-
- World Health Organization . 2020. Coronavirus Disease (Covid-19) Pandemic.https://www.who.int/emergencies/diseases/novel-coronavirus-2019 URL.
-
- Meng Z., Wang M., Song H., Guo S., Zhou Y., Li W., Zhou Y., Li M., Song X., Zhou Y., et al. medRxiv; 2020. Development and Utilization of an Intelligent Application for Aiding Covid-19 Diagnosis.
-
- Bullock J., Pham K.H., Lam C.S.N., Luengo-Oroz M., et al. 2020. Mapping the Landscape of Artificial Intelligence Applications against Covid-19; p. 11336. arXiv preprint arXiv:2003.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical