Machine learning-assisted prediction of pneumonia based on non-invasive measures
- PMID: 35968461
- PMCID: PMC9371749
- DOI: 10.3389/fpubh.2022.938801
Machine learning-assisted prediction of pneumonia based on non-invasive measures
Abstract
Background: Pneumonia is an infection of the lungs that is characterized by high morbidity and mortality. The use of machine learning systems to detect respiratory diseases via non-invasive measures such as physical and laboratory parameters is gaining momentum and has been proposed to decrease diagnostic uncertainty associated with bacterial pneumonia. Herein, this study conducted several experiments using eight machine learning models to predict pneumonia based on biomarkers, laboratory parameters, and physical features.
Methods: We perform machine-learning analysis on 535 different patients, each with 45 features. Data normalization to rescale all real-valued features was performed. Since it is a binary problem, we categorized each patient into one class at a time. We designed three experiments to evaluate the models: (1) feature selection techniques to select appropriate features for the models, (2) experiments on the imbalanced original dataset, and (3) experiments on the SMOTE data. We then compared eight machine learning models to evaluate their effectiveness in predicting pneumonia.
Results: Biomarkers such as C-reactive protein and procalcitonin demonstrated the most significant discriminating power. Ensemble machine learning models such as RF (accuracy = 92.0%, precision = 91.3%, recall = 96.0%, f1-Score = 93.6%) and XGBoost (accuracy = 90.8%, precision = 92.6%, recall = 92.3%, f1-score = 92.4%) achieved the highest performance accuracy on the original dataset with AUCs of 0.96 and 0.97, respectively. On the SMOTE dataset, RF and XGBoost achieved the highest prediction results with f1-scores of 92.0 and 91.2%, respectively. Also, AUC of 0.97 was achieved for both RF and XGBoost models.
Conclusions: Our models showed that in the diagnosis of pneumonia, individual clinical history, laboratory indicators, and symptoms do not have adequate discriminatory power. We can also conclude that the ensemble ML models performed better in this study.
Keywords: decision support system (DSS); electronic health records (EHR); machine learning; non-invasive measures; pneumonia.
Copyright © 2022 Effah, Miao, Drokow, Agboyibor, Qiao, Wu, Miao and Wang.
Conflict of interest statement
Author YaW was employed by Center of Health Management, General Hospital of Anyang Iron and Steel Group Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures













Similar articles
-
Hospital mortality prediction in traumatic injuries patients: comparing different SMOTE-based machine learning algorithms.BMC Med Res Methodol. 2023 Apr 22;23(1):101. doi: 10.1186/s12874-023-01920-w. BMC Med Res Methodol. 2023. PMID: 37087425 Free PMC article.
-
Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133. J Med Internet Res. 2020. PMID: 32866108 Free PMC article.
-
Comparing large scale and selected feature learning for community acquired pneumonia prognosis prediction using clinical data: a stacked ensemble approach.Sci Rep. 2025 Apr 10;15(1):12319. doi: 10.1038/s41598-025-95941-8. Sci Rep. 2025. PMID: 40210962 Free PMC article.
-
Predicting post-stroke pneumonia using deep neural network approaches.Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1. Int J Med Inform. 2019. PMID: 31629312
-
A machine learning approach for mortality prediction only using non-invasive parameters.Med Biol Eng Comput. 2020 Oct;58(10):2195-2238. doi: 10.1007/s11517-020-02174-0. Epub 2020 Jul 20. Med Biol Eng Comput. 2020. PMID: 32691219 Review.
Cited by
-
Efficient federated learning for pediatric pneumonia on chest X-ray classification.Sci Rep. 2024 Oct 7;14(1):23272. doi: 10.1038/s41598-024-74491-5. Sci Rep. 2024. PMID: 39375440 Free PMC article.
-
Machine Learning Approaches for the Prediction of Hepatitis B and C Seropositivity.Int J Environ Res Public Health. 2023 Jan 29;20(3):2380. doi: 10.3390/ijerph20032380. Int J Environ Res Public Health. 2023. PMID: 36767747 Free PMC article.
-
Determinants of community-acquired pneumonia among under-five children in Awi Zone, Northwest Ethiopia.Front Public Health. 2025 May 1;13:1511263. doi: 10.3389/fpubh.2025.1511263. eCollection 2025. Front Public Health. 2025. PMID: 40376063 Free PMC article.
-
Predicting omicron pneumonia severity and outcome: a single-center study in Hangzhou, China.Front Med (Lausanne). 2023 May 26;10:1192376. doi: 10.3389/fmed.2023.1192376. eCollection 2023. Front Med (Lausanne). 2023. PMID: 37305146 Free PMC article.
-
Prediction of HELLP Syndrome Severity Using Machine Learning Algorithms-Results from a Retrospective Study.Diagnostics (Basel). 2023 Jan 12;13(2):287. doi: 10.3390/diagnostics13020287. Diagnostics (Basel). 2023. PMID: 36673097 Free PMC article.
References
-
- O'Brien KL, Baggett HC, Brooks WA, Feikin DR, Hammitt LL, Higdon MM, et al. . Causes of severe pneumonia requiring hospital admission in children without HIV infection from Africa and Asia: the PERCH multi-country case-control study. Lancet. (2019) 394:757–79. 10.1016/S0140-6736(19)30721-4 - DOI - PMC - PubMed
-
- World Health Organization pneumonia vaccine trial investigator' group . Standardization of Interpretation of Chest Radiographs for the Diagnosis of Pneumonia in Children. (2001). p. 1–39.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials