Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 19;31(6):3563-3578.
doi: 10.3390/curroncol31060262.

Testing Machine Learning Models to Predict Postoperative Ileus after Colorectal Surgery

Affiliations

Testing Machine Learning Models to Predict Postoperative Ileus after Colorectal Surgery

Garry Brydges et al. Curr Oncol. .

Abstract

Background: Postoperative ileus (POI) is a common complication after colorectal surgery, leading to increased hospital stay and costs. This study aimed to explore patient comorbidities that contribute to the development of POI in the colorectal surgical population and compare machine learning (ML) model accuracy to existing risk instruments. Study Design: In a retrospective study, data were collected on 316 adult patients who underwent colorectal surgery from January 2020 to December 2021. The study excluded patients undergoing multi-visceral resections, re-operations, or combined primary and metastatic resections. Patients lacking follow-up within 90 days after surgery were also excluded. Eight different ML models were trained and cross-validated using 29 patient comorbidities and four comorbidity risk indices (ASA Status, NSQIP, CCI, and ECI). Results: The study found that 6.33% of patients experienced POI. Age, BMI, gender, kidney disease, anemia, arrhythmia, rheumatoid arthritis, and NSQIP score were identified as significant predictors of POI. The ML models with the greatest accuracy were AdaBoost tuned with grid search (94.2%) and XG Boost tuned with grid search (85.2%). Conclusions: This study suggests that ML models can predict the risk of POI with high accuracy and may offer a new frontier in early detection and intervention for postoperative outcome optimization. ML models can greatly improve the prediction and prevention of POI in colorectal surgery patients, which can lead to improved patient outcomes and reduced healthcare costs. Further research is required to validate and assess the replicability of these results.

Keywords: artificial intelligence; colorectal cancer; comorbidities; machine learning; neural networks; postoperative ileus.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Demographics—Age (A) and BMI (B) by Gender. Blue: Male, Orange: Female.
Figure 2
Figure 2
Postoperative Ileus by Surgery Type (A) and Surgical Approach (B).
Figure 2
Figure 2
Postoperative Ileus by Surgery Type (A) and Surgical Approach (B).
Figure 3
Figure 3
SHAP Summary Plot for Co-Morbidity Contribution to POI.
Figure 4
Figure 4
Receiver Operating Characteristics for ML Models Predicting POI. Area under the receiver–operator curve for each machine learning (ML) model was calculated. (a) logistic regression model. (b) Random forest model. (c) Bagging classifier Model. (d) AdaBoost (tuned) model. (e) XGBoost (tuned) model. ROC curves for grid and random search tuning for both AdaBoost and XGboost classifier models are similar and so not represented in the figure. ROC curves for stacking classifier were overfitted (as were bagging classifier and random forest models—included as an example) and not represented in the figure. The diagonal red line divides the ROC space. Points above the diagonal represent good classification results (better than random); points below the line represent bad results (worse than random).Any model which is overfitted would not be useful for accurate predictions or conclusions, as it would not discriminate from the training data and defeats the very purpose of machine learning. Thus, based on accuracy, only the highest functioning models were included due to overfitting. For example, panel (b,c) are overfitting [5,9,11,12].
Figure 4
Figure 4
Receiver Operating Characteristics for ML Models Predicting POI. Area under the receiver–operator curve for each machine learning (ML) model was calculated. (a) logistic regression model. (b) Random forest model. (c) Bagging classifier Model. (d) AdaBoost (tuned) model. (e) XGBoost (tuned) model. ROC curves for grid and random search tuning for both AdaBoost and XGboost classifier models are similar and so not represented in the figure. ROC curves for stacking classifier were overfitted (as were bagging classifier and random forest models—included as an example) and not represented in the figure. The diagonal red line divides the ROC space. Points above the diagonal represent good classification results (better than random); points below the line represent bad results (worse than random).Any model which is overfitted would not be useful for accurate predictions or conclusions, as it would not discriminate from the training data and defeats the very purpose of machine learning. Thus, based on accuracy, only the highest functioning models were included due to overfitting. For example, panel (b,c) are overfitting [5,9,11,12].

References

    1. Merath K., Hyer J.M., Mehta R., Farooq A., Bagante F., Sahara K., Tsilimigras D.I., Beal E., Paredes A.Z., Wu L., et al. Use of Machine Learning for Prediction of Patient Risk of Postoperative Complications After Liver, Pancreatic, and Colorectal Surgery. J. Gastrointest. Surg. 2020;24:1843–1851. doi: 10.1007/s11605-019-04338-2. - DOI - PubMed
    1. Steinberg S.M., Popa M.R., Michalek J.A., Bethel M.J., Ellison E.C. Comparison of Risk Adjustment Methodologies in Surgical Quality Improvement. Surgery. 2008;144:662–669. doi: 10.1016/j.surg.2008.06.010. - DOI - PubMed
    1. Bellman R. Control Theory. Sci. Am. 1964;211:186–201. doi: 10.1038/scientificamerican0964-186. - DOI - PubMed
    1. Baloch Z.Q., Raza S.A., Pathak R., Marone L., Ali A. Machine Learning Confirms Nonlinear Relationship between Severity of Peripheral Arterial Disease, Functional Limitation and Symptom Severity. Diagnostics. 2020;10:515. doi: 10.3390/diagnostics10080515. - DOI - PMC - PubMed
    1. Lones M.A. How to Avoid Machine Learning Pitfalls: A Guide for Academic Researchers. arXiv. 20232108.02497