Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 5:49:101163.
doi: 10.1016/j.lanepe.2024.101163. eCollection 2025 Feb.

Development and validation of artificial intelligence models for early detection of postoperative infections (PERISCOPE): a multicentre study using electronic health record data

Collaborators, Affiliations

Development and validation of artificial intelligence models for early detection of postoperative infections (PERISCOPE): a multicentre study using electronic health record data

Siri L van der Meijden et al. Lancet Reg Health Eur. .

Abstract

Background: Postoperative infections significantly impact patient outcomes and costs, exacerbated by late diagnoses, yet early reliable predictors are scarce. Existing artificial intelligence (AI) models for postoperative infection prediction often lack external validation or perform poorly in local settings when validated. We aimed to develop locally valid models as part of the PERISCOPE AI system to enable early detection, safer discharge, and more timely treatment of patients.

Methods: We developed and validated XGBoost models to predict postoperative infections within 7 and 30 days of surgery. Using retrospective pre-operative and intra-operative electronic health record data from 2014 to 2023 across various surgical specialities, the models were developed at Hospital A and validated and updated at Hospitals B and C in the Netherlands and Belgium. Model performance was evaluated before and after updating using the two most recent years of data as temporal validation datasets. Main outcome measures were model discrimination (area under the receiver operating characteristic curve (AUROC)), calibration (slope, intercept, and plots), and clinical utility (decision curve analysis with net benefit).

Findings: The study included 253,010 surgical procedures with 23,903 infections within 30-days. Discriminative performance, calibration properties, and clinical utility significantly improved after updating. Final AUROCs after updating for Hospitals A, B, and C were 0.82 (95% confidence interval (CI) 0.81-0.83), 0.82 (95% CI 0.81-0.83), and 0.91 (95% CI 0.90-0.91) respectively for 30-day predictions on the temporal validation datasets (2022-2023). Calibration plots demonstrated adequate correspondence between observed outcomes and predicted risk. All local models were deemed clinically useful as the net benefit was higher than default strategies (treat all and treat none) over a wide range of clinically relevant decision thresholds.

Interpretation: PERISCOPE can accurately predict overall postoperative infections within 7- and 30-days post-surgery. The robust performance implies potential for improving clinical care in diverse clinical target populations. This study supports the need for approaches to local updating of AI models to account for domain shifts in patient populations and data distributions across different clinical settings.

Funding: This study was funded by a REACT EU grant from European Regional Development Fund (ERDF) and Kansen voor West.

Keywords: Artificial intelligence; Clinical utility; Model updating; Multi-centre validation; Postoperative infection.

PubMed Disclaimer

Conflict of interest statement

BFG is currently CEO and major shareholder of Healthplus.ai B.V. and subsidiaries. BFG has also received consulting fees from Philips NV and Edwards Lifesciences LLC. SLvdM and LJS are employees at Healthplus.ai B.V. SLvdM and LJS own stock options in Healthplus.ai B.V. HvG is an advisor for Healthplus.ai. MGJdB is the current president of the Dutch foundation for Antimicrobial Policies (Dutch acronym SWAB) and honorary secretary of ESCMID's Study Group on Antimicrobial Policies (ESGAP).

Figures

Fig. 1
Fig. 1
Calibration plots of final, locally updated models on the validation datasets (years 2022–2023): a) Hospital A – 7 day predictions, b) Hospital A – 30 day predictions, c) Hospital B – 7 day predictions, d) Hospital B – 30 day predictions, e) Hospital C – 7 day predictions, and f) Hospital C – 30 day predictions. The calibration curves (upper plots) show the agreement between the predicted probabilities by the model and the outcome of patients for those predictions. The histograms in the lower plot show the distributions of predictions.
Fig. 2
Fig. 2
Decision curves showing the net benefit for PERISCOPE's models before and after updating compared to ‘treat all’ and ‘treat none’ patients for validation datasets (years 2022–2023): a) Hospital A – 7 day predictions, b) Hospital A – 30 day predictions, c) Hospital B – 7 day predictions, d) Hospital B – 30 day predictions, e) Hospital C – 7 day predictions, and f) Hospital C – 30 day predictions. The unit of net benefit is ‘true positives’. At lower decision thresholds, the end-user is more worried about the disease, i.e., willing to accept more false positives, and at higher decision thresholds, the end-user is more worried about the intervention following the prediction. To be of clinical value, net benefit should be higher than zero and default strategies in the established range of clinically relevant risk thresholds.
Fig. 3
Fig. 3
Model explainability in terms of SHAP (SHapley Additive exPlanations) values for: a) Hospital A – 30 day predictions, b) Hospital B – 30 day predictions, and c) Hospital C – 30 day predictions. For Hospitals B and C, updated models were used for predictions. In the SHAP plot, red dots represent high feature values and blue dots represent low feature values. Dots on the right side of the y-axis indicate a contribution to a higher predicted risk of infection, while dots on the left side indicate a lower risk. Grey dots signify missing feature values.

References

    1. Eappen S., Lane B.H., Rosenberg B., et al. Relationship between occurrence of surgical complications and hospital finances. JAMA. 2013;309(15):1599–1606. - PubMed
    1. Wan Y.I., Patel A., Achary C., et al. Postoperative infection and mortality following elective surgery in the International Surgical Outcomes Study (ISOS) Br J Surg. 2021;108(2):220–227. - PubMed
    1. de Vries E.N., Ramrattan M.A., Smorenburg S.M., Gouma D.J., Boermeester M.A. The incidence and nature of in-hospital adverse events: a systematic review. Qual Saf Health Care. 2008;17(3):216–223. - PMC - PubMed
    1. Ghaferi A.A., Birkmeyer J.D., Dimick J.B. Hospital volume and failure to rescue with high-risk surgery. Med Care. 2011;49(12):1076–1081. - PubMed
    1. Gawande A.A., Thomas E.J., Zinner M.J., Brennan T.A. The incidence and nature of surgical adverse events in Colorado and Utah in 1992. Surgery. 1999;126(1):66–75. - PubMed

LinkOut - more resources