Prediction of bacteremia using routine hematological and metabolic parameters based on logistic regression and random forest models
- PMID: 40792110
- PMCID: PMC12336153
- DOI: 10.3389/fcimb.2025.1605485
Prediction of bacteremia using routine hematological and metabolic parameters based on logistic regression and random forest models
Abstract
Background: This study aimed to evaluate the predictive utility of routine hematological, inflammatory, and metabolic markers for bacteremia and to compare the classification performance of logistic regression and random forest models.
Methods: A retrospective study was conducted on 287 inpatients who underwent blood culture testing at Fuding Hospital, Fujian University of Traditional Chinese Medicine between March and August 2024. Patients were divided into bacteremia (n = 137) and non-bacteremia (n = 150) groups based on blood culture results. Hematological indices, inflammatory markers (e.g., C-reactive protein (CRP), procalcitonin (PCT)), metabolic indices (e.g., glucose, cholesterol) and nutritional markers (e.g., albumin) were analyzed. Univariate and multivariate binary logistic regression analyses were used to identify independent risk factors. Logistic regression and random forest models were developed using 33 features with a 70:30 train-test split and evaluated using the receiver operating characteristic (ROC) curves, confusion matrices and standard classification.
Results: Hemoglobin, cholesterol, and albumin levels were significantly lower in the bacteremia group, while platelet count, CRP, PCT, glucose, and triglycerides were significantly elevated (all p < 0.05). Logistic regression identified platelet count (Odds ratios (OR) = 1.003, 95% confidence interval (CI): 1.001-1.006), PCT (OR = 1.032, 95% CI: 1.004-1.060), triglycerides (OR = 1.740, 95% CI: 1.052-2.879), and low cholesterol (OR = 0.523, 95% CI: 0.383-0.714) as independent risk factors. The area under the ROC curve (AUC) was 0.75 for the random forest model and 0.74 for logistic regression, with recall rates of 0.69 and 0.60, respectively.
Conclusion: Routine laboratory markers integrated into machine learning models demonstrated potential for early bacteremia prediction. Random forest exhibited superior sensitivity compared to logistic regression, suggesting its potential utility as a clinical screening tool.
Keywords: bacteremia; biomarkers; blood culture; logistic regression; machine learning; random forest.
Copyright © 2025 Wang, Zhuo, Lv, Shi, Yao, Zhang and Shi.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures


Similar articles
-
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23. Clin Orthop Relat Res. 2024. PMID: 39051924
-
[Predictive value of plasma exosome count for the prognosis in septic patients].Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Jul;36(7):687-692. doi: 10.3760/cma.j.cn121430-20230717-00529. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024. PMID: 39223881 Chinese.
-
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238. JMIR Med Inform. 2025. PMID: 40512995 Free PMC article.
-
Accuracy of routine laboratory tests to predict mortality and deterioration to severe or critical COVID-19 in people with SARS-CoV-2.Cochrane Database Syst Rev. 2024 Aug 6;8(8):CD015050. doi: 10.1002/14651858.CD015050.pub2. Cochrane Database Syst Rev. 2024. PMID: 39105481 Free PMC article.
-
Procalcitonin to initiate or discontinue antibiotics in acute respiratory tract infections.Cochrane Database Syst Rev. 2017 Oct 12;10(10):CD007498. doi: 10.1002/14651858.CD007498.pub3. Cochrane Database Syst Rev. 2017. PMID: 29025194 Free PMC article.
References
-
- Evans L., Rhodes A., Alhazzani W., Antonelli M., Coopersmith C. M., French C., et al. (2021). Prescott HC et al: Surviving Sepsis Campaign: International Guidelines for Management of Sepsis and Septic Shock 2021. Crit. Care Med. 49, e1063–e1143. doi: 10.1097/CCM.0000000000005337, PMID: - DOI - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous