Development, evaluation, and validation of machine learning models for COVID-19 detection based on routine blood tests
- PMID: 33079698
- DOI: 10.1515/cclm-2020-1294
Development, evaluation, and validation of machine learning models for COVID-19 detection based on routine blood tests
Abstract
Objectives: The rRT-PCR test, the current gold standard for the detection of coronavirus disease (COVID-19), presents with known shortcomings, such as long turnaround time, potential shortage of reagents, false-negative rates around 15-20%, and expensive equipment. The hematochemical values of routine blood exams could represent a faster and less expensive alternative.
Methods: Three different training data set of hematochemical values from 1,624 patients (52% COVID-19 positive), admitted at San Raphael Hospital (OSR) from February to May 2020, were used for developing machine learning (ML) models: the complete OSR dataset (72 features: complete blood count (CBC), biochemical, coagulation, hemogasanalysis and CO-Oxymetry values, age, sex and specific symptoms at triage) and two sub-datasets (COVID-specific and CBC dataset, 32 and 21 features respectively). 58 cases (50% COVID-19 positive) from another hospital, and 54 negative patients collected in 2018 at OSR, were used for internal-external and external validation.
Results: We developed five ML models: for the complete OSR dataset, the area under the receiver operating characteristic curve (AUC) for the algorithms ranged from 0.83 to 0.90; for the COVID-specific dataset from 0.83 to 0.87; and for the CBC dataset from 0.74 to 0.86. The validations also achieved good results: respectively, AUC from 0.75 to 0.78; and specificity from 0.92 to 0.96.
Conclusions: ML can be applied to blood tests as both an adjunct and alternative method to rRT-PCR for the fast and cost-effective identification of COVID-19-positive patients. This is especially useful in developing countries, or in countries facing an increase in contagions.
Keywords: COVID-19; SARS-CoV-2; blood laboratory tests; complete blood count; gradient boosted decision tree; machine learning.
Similar articles
-
Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study.J Med Syst. 2020 Jul 1;44(8):135. doi: 10.1007/s10916-020-01597-4. J Med Syst. 2020. PMID: 32607737 Free PMC article.
-
Development and External Validation of a Machine Learning Tool to Rule Out COVID-19 Among Adults in the Emergency Department Using Routine Blood Tests: A Large, Multicenter, Real-World Study.J Med Internet Res. 2020 Dec 2;22(12):e24048. doi: 10.2196/24048. J Med Internet Res. 2020. PMID: 33226957 Free PMC article.
-
Routine Laboratory Blood Tests Predict SARS-CoV-2 Infection Using Machine Learning.Clin Chem. 2020 Nov 1;66(11):1396-1404. doi: 10.1093/clinchem/hvaa200. Clin Chem. 2020. PMID: 32821907 Free PMC article.
-
Thoracic imaging tests for the diagnosis of COVID-19.Cochrane Database Syst Rev. 2020 Sep 30;9:CD013639. doi: 10.1002/14651858.CD013639.pub2. Cochrane Database Syst Rev. 2020. Update in: Cochrane Database Syst Rev. 2020 Nov 26;11:CD013639. doi: 10.1002/14651858.CD013639.pub3. PMID: 32997361 Updated.
-
An Overview of Supervised Machine Learning Methods and Data Analysis for COVID-19 Detection.J Healthc Eng. 2021 Nov 22;2021:4733167. doi: 10.1155/2021/4733167. eCollection 2021. J Healthc Eng. 2021. PMID: 34853669 Free PMC article. Review.
Cited by
-
Machine Learning for Patient-Based Real-Time Quality Control (PBRTQC), Analytical and Preanalytical Error Detection in Clinical Laboratory.Diagnostics (Basel). 2024 Aug 20;14(16):1808. doi: 10.3390/diagnostics14161808. Diagnostics (Basel). 2024. PMID: 39202296 Free PMC article. Review.
-
Classification of COVID-19 and Influenza Patients Using Deep Learning.Contrast Media Mol Imaging. 2022 Feb 28;2022:8549707. doi: 10.1155/2022/8549707. eCollection 2022. Contrast Media Mol Imaging. 2022. PMID: 35280712 Free PMC article.
-
How to diagnose COVID-19 in family practice? Usability of complete blood count as a COVID-19 diagnostic tool: a cross-sectional study in Turkey.BMJ Open. 2023 Apr 17;13(4):e069493. doi: 10.1136/bmjopen-2022-069493. BMJ Open. 2023. PMID: 37068894 Free PMC article.
-
A multi-constraint representation learning model for identification of ovarian cancer with missing laboratory indicators.Nan Fang Yi Ke Da Xue Xue Bao. 2025 Jan 20;45(1):170-178. doi: 10.12122/j.issn.1673-4254.2025.01.20. Nan Fang Yi Ke Da Xue Xue Bao. 2025. PMID: 39819725 Free PMC article. Chinese, English.
-
Coronavirus diagnosis using cough sounds: Artificial intelligence approaches.Front Artif Intell. 2023 Feb 15;6:1100112. doi: 10.3389/frai.2023.1100112. eCollection 2023. Front Artif Intell. 2023. PMID: 36872932 Free PMC article.
References
-
- Oran, DP, Topol, EJ. Prevalence of asymptomatic SARS-CoV-2 infection: a narrative review. Ann Intern Med. https://doi.org/10.7326/M20-3012. [Published online June 3, 2020].
-
- Vogels, CBF, Brito, AF, Wyllie, AL, Fauver, JR, Ott, IM, Kalinich, CC, et al. Analytical sensitivity and efficiency comparisons of SARS-CoV-2 RT–qPCR primer–probe sets. Nat Microbiol. https://doi.org/10.1038/s41564-020-0761-6. [Published online July 10, 2020].
-
- Lippi, G, Simundic, A-M, Plebani, M. Potential preanalytical and analytical vulnerabilities in the laboratory diagnosis of coronavirus disease 2019 (COVID-19). Clin Chem Lab Med 2020;58:1070–6. https://doi.org/10.1515/cclm-2020-0285.
-
- Woloshin, S, Patel, N, Kesselheim, AS. False negative tests for SARS-CoV-2 infection — challenges and implications. N Engl J Med 2020;383:e38. https://doi.org/10.1056/NEJMp2015897.
-
- Wynants, L, Van Calster, B, Collins, GS, Riley, RD, Heinze, G, Schuit, E, et al. Prediction models for diagnosis and prognosis of COVID-19: systematic review and critical appraisal. BMJ 2020;369:m1328. https://doi.org/10.1136/bmj.m1328.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous