Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 21;13(1):1224.
doi: 10.1038/s41598-023-27418-5.

Dynamic predictions of postoperative complications from explainable, uncertainty-aware, and multi-task deep neural networks

Affiliations

Dynamic predictions of postoperative complications from explainable, uncertainty-aware, and multi-task deep neural networks

Benjamin Shickel et al. Sci Rep. .

Abstract

Accurate prediction of postoperative complications can inform shared decisions regarding prognosis, preoperative risk-reduction, and postoperative resource use. We hypothesized that multi-task deep learning models would outperform conventional machine learning models in predicting postoperative complications, and that integrating high-resolution intraoperative physiological time series would result in more granular and personalized health representations that would improve prognostication compared to preoperative predictions. In a longitudinal cohort study of 56,242 patients undergoing 67,481 inpatient surgical procedures at a university medical center, we compared deep learning models with random forests and XGBoost for predicting nine common postoperative complications using preoperative, intraoperative, and perioperative patient data. Our study indicated several significant results across experimental settings that suggest the utility of deep learning for capturing more precise representations of patient health for augmented surgical decision support. Multi-task learning improved efficiency by reducing computational resources without compromising predictive performance. Integrated gradients interpretability mechanisms identified potentially modifiable risk factors for each complication. Monte Carlo dropout methods provided a quantitative measure of prediction uncertainty that has the potential to enhance clinical trust. Multi-task learning, interpretability mechanisms, and uncertainty metrics demonstrated potential to facilitate effective clinical implementation.

PubMed Disclaimer

Conflict of interest statement

The authors have no competing interests as defined by Nature Research, or other interests that might be perceived to influence the results and/or discussion reported in this paper.

Figures

Figure 1
Figure 1
Classification accuracy compared with baseline models. Shown are area under the receiver operating characteristic curve (AUROC) results for random forest and XGBoost models, individual deep learning models independently trained on each outcome, and a combined multi-task jointly trained on all outcomes, using only preoperative features (a), only intraoperative features (b), and both preoperative and intraoperative features (c). A comparison of multi-task deep learning results at three stages of prediction is shown in (d).
Figure 2
Figure 2
Temporal integrated gradients feature attributions for example patient experiencing prolonged mechanical ventilation. The multi-task deep learning model correctly predicted elevated risk of prolonged mechanical ventilation after integrating multivariate intraoperative time series. Physiological time series labeled by variable (left) and value range (right). Implementation of integrated gradients highlighted physiological patterns important for updated risk prediction, including a rapid increase in heart rate and ETCO2, fluctuations in PIP, and changes in SPO2. ETCO2, end-tidal carbon dioxide; PIP, peak inspiratory pressure; SPO2, blood oxygen saturation.
Figure 3
Figure 3
Data processing pipeline and deep learning model architecture. Patient-level input variables were split into static preoperative data and temporal intraoperative data. Preoperative variables were split into continuous, binary, and high-cardinality features and followed variable-specific preprocessing procedures. Deep learning model architecture utilized a data fusion design combining latent representations of high-frequency intraoperative data (from a bidirectional recurrent neural network) and static preoperative patient data (from fully connected layers) for eventual multi-task prediction of nine postoperative complications.

References

    1. Elixhauser A, Andrews RM. Profile of inpatient operating room procedures in US hospitals in 2007. Arch. Surg. 2010;145:1201–1208. doi: 10.1001/archsurg.2010.269. - DOI - PubMed
    1. Dimick JB, et al. Hospital costs associated with surgical complications: a report from the private-sector National Surgical Quality Improvement Program. J. Am. Coll. Surg. 2004;199:531–537. doi: 10.1016/j.jamcollsurg.2004.05.276. - DOI - PubMed
    1. Healey MA, Shackford SR, Osler TM, Rogers FB, Burns E. Complications in surgical patients. Arch. Surg. 2002;137:611–617. doi: 10.1001/archsurg.137.5.611. - DOI - PubMed
    1. Cohen ME, Liu Y, Ko CY, Hall BL. An examination of American College of Surgeons NSQIP surgical risk calculator accuracy. J. Am. Coll. Surg. 2017;224:787–795e1. doi: 10.1016/j.jamcollsurg.2016.12.057. - DOI - PubMed
    1. Leeds IL, et al. Eye of the beholder: risk calculators and barriers to adoption in surgical trainees. Surgery. 2018;164:1117–1123. doi: 10.1016/j.surg.2018.07.002. - DOI - PMC - PubMed

Publication types