Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 28;3(1):21.
doi: 10.1038/s44259-025-00090-7.

Integrating socioeconomic deprivation indices and electronic health record data to predict antimicrobial resistance

Affiliations

Integrating socioeconomic deprivation indices and electronic health record data to predict antimicrobial resistance

Marlon I Diaz et al. NPJ Antimicrob Resist. .

Abstract

We developed machine learning models to predict the presence of AMR organisms in blood cultures obtained at the first patient encounter, offering a new and inspiring direction for antimicrobial resistance management. Three supervised machine learning classifiers were used: penalized logistic regression, random forest, and XGBoost, which were used to classify five AMR organisms: ESBL, CRE, AmpC, MRSA, and VRE. The random forest and XGBoost models performed best, with AUC-ROC values of 0.70 and 92.9% negative predictive value, respectively. The multi-class random forest model's AUC-ROC values ranged from 0.80-0.95. Our models highlight how the combination of ADI and SVI increased the predictive power. This approach could reduce costs and mitigate the global public health threat posed by antibiotic-resistant infections. Machine learning techniques can predict antimicrobial-resistant infections in suspected cultures using patient data from EHRs, enabling clinicians to make targeted prescribing decisions and mitigate resistance development.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Combined AUC ROC for binary classification models.
The combined area under the receiver operating characteristics curves (AUC ROC) for the logistic regression, random forest, and XGBoost models with the logistic regression represented by the dark blue line, random forest with the cyan line, and XGBoost with the magenta line.
Fig. 2
Fig. 2
Feature importance for Random Forest Model.
Fig. 3
Fig. 3. AUC ROC for multi-class classification of 5 AMR organisms of interest.
AUC ROC plots for the three described models: logistic regression, random forest, and XGBoost. a ROC-AUC curve of ESBL; b ROC-AUC curve of CRE; c ROC-AUC curve of VRE; d ROC-AUC curve of MRSA; e ROC-AUC curve of AmpC. For each plot, the AUC ROCs for each model of interest are shown with the following color palette: a cyan solid line for Random Forest model, a light purple solid line for Logistic Regression, a pink solid line for XGBoost.

References

    1. Singer, M. et al. The Third International Consensus definitions for sepsis and septic shock (Sepsis-3). Jama315, 801–810 (2016). - PMC - PubMed
    1. Gotts, J. E., Matthay, M. A. Sepsis: pathophysiology and clinical management. BMJ.:i1585. (2016) - PubMed
    1. Rudd, K. E. et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the Global Burden of Disease Study. Lancet395, 200–211 (2020). - PMC - PubMed
    1. Rhodes, A. et al. Surviving Sepsis Campaign: international guidelines for management of sepsis and septic shock: 2016. Intensive Care Med.43, 304–377 (2017). - PubMed
    1. Tsachouridou, O. et al. Mortality due to multidrug-resistant Gram-negative bacteremia in an endemic region: no better than a toss of a coin. Microorganisms11, 1711 (2023). - PMC - PubMed

LinkOut - more resources