. 2023 May 18:16:1909-1925.

doi: 10.2147/IJGM.S408770. eCollection 2023.

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy

Yuan Liu^#¹, Chen Song^#¹, Zhiqiang Tian¹, Wei Shen¹

Affiliations

Affiliation

¹ Department of General Surgery, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, People's Republic of China.

^# Contributed equally.

PMID: 37228741
PMCID: PMC10202705
DOI: 10.2147/IJGM.S408770

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy

Yuan Liu et al. Int J Gen Med. 2023.

. 2023 May 18:16:1909-1925.

doi: 10.2147/IJGM.S408770. eCollection 2023.

Authors

Yuan Liu^#¹, Chen Song^#¹, Zhiqiang Tian¹, Wei Shen¹

Affiliation

¹ Department of General Surgery, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, People's Republic of China.

^# Contributed equally.

PMID: 37228741
PMCID: PMC10202705
DOI: 10.2147/IJGM.S408770

Abstract

Purpose: This study aims to construct a machine learning model that can recognize preoperative, intraoperative, and postoperative high-risk indicators and predict the onset of venous thromboembolism (VTE) in patients.

Patients and methods: A total of 1239 patients diagnosed with gastric cancer were enrolled in this retrospective study, among whom 107 patients developed VTE after surgery. We collected 42 characteristic variables of gastric cancer patients from the database of Wuxi People's Hospital and Wuxi Second People's Hospital between 2010 and 2020, including patients' demographic characteristics, chronic medical history, laboratory test characteristics, surgical information, and patients' postoperative conditions. Four machine learning algorithms, namely, extreme gradient boosting (XGBoost), random forest (RF), support vector machine (SVM), and k-nearest neighbor (KNN), were employed to develop predictive models. We also utilized Shapley additive explanation (SHAP) for model interpretation and evaluated the models using k-fold cross-validation, receiver operating characteristic (ROC) curves, calibration curves, decision curve analysis (DCA), and external validation metrics.

Results: The XGBoost algorithm demonstrated superior performance compared to the other three prediction models. The area under the curve (AUC) value for XGBoost was 0.989 in the training set and 0.912 in the validation set, indicating high prediction accuracy. Furthermore, the AUC value of the external validation set was 0.85, signifying good extrapolation of the XGBoost prediction model. The results of SHAP analysis revealed that several factors, including higher body mass index (BMI), history of adjuvant radiotherapy and chemotherapy, T-stage of the tumor, lymph node metastasis, central venous catheter use, high intraoperative bleeding, and long operative time, were significantly associated with postoperative VTE.

Conclusion: The machine learning algorithm XGBoost derived from this study enables the development of a predictive model for postoperative VTE in patients after radical gastrectomy, thereby assisting clinicians in making informed clinical decisions.

Keywords: gastrectomy; gastric neoplasms; machine learning; prediction model; risk factors; venous thromboembolism.

PubMed Disclaimer

Conflict of interest statement

The authors report no conflicts of interest in this work.

Figures

**Figure 1**
Flow diagram of patients included in the study.

**Figure 2**
The variable influence factor ranking plots of the four models. (A) Variable importance ranking diagram of the XGBoost model. (B) Variable importance ranking diagram of the RF model. (C) Variable importance ranking diagram of the SVM model. (D) Variable importance ranking diagram of the KNN model.

**Figure 3**
Evaluation of the four models for predicting VTE. (A) ROC curves for the training set of the four models. (B) ROC curves for the validation set of the four models. (C) Calibration plots of the four models. The 45-degree dashed line in each plot represents the ideal correspondence between the predicted (x-axis) and observed (y-axis) probabilities of complications. The closer the distance between the two curves, the higher the predictive accuracy. (D) DCA curves of the four models. The point of intersection between the red curve and the “All” curve represents the baseline or starting point, while the point of intersection between the red curve and the “None” curve indicates the decision node where the corresponding patients may derive benefit.

**Figure 4**
Internal validation of the XGBoost model. (A) ROC curve of the XGBoost model for the training set. (B) ROC curve of the XGBoost model for the validation set. (C) ROC curve of the XGBoost model for the test set. (D) External validation of the XGBoost model.

**Figure 5**
SHAP summary plot. The risk factors are ranked on the y-axis according to their significance, which is determined by the mean of their absolute Shapley values. The higher the risk factor appears on the plot, the more crucial it is for the model.

**Figure 6**
SHAP force plot. The explanatory variables are ordered along the horizontal axis based on the absolute value of their impact, with blue representing features that negatively affect disease prediction, as indicated by a decrease in SHAP values, and red representing features that positively affect disease prediction, as indicated by an increase in SHAP values. (A) Predictive Analysis of Patient I. (B) Predictive Analysis of Patient II. (C) Predictive Analysis of Patient III.

See this image and copyright information in PMC

Cited by

Leveraging machine learning for enhanced and interpretable risk prediction of venous thromboembolism in acute ischemic stroke care.
Jiang Y, Li A, Li Z, Li Y, Li R, Zhao Q, Li G. Jiang Y, et al. PLoS One. 2025 Mar 18;20(3):e0302676. doi: 10.1371/journal.pone.0302676. eCollection 2025. PLoS One. 2025. PMID: 40100876 Free PMC article.
Artificial intelligence in clinical thrombosis and hemostasis: A review.
Kuan YKI, Kok YJ, Liu NSH, Ong BJA, Chee YJ, Xu C, Chow M, Ramanathan K, Dalan R, Ho P, Fan BE. Kuan YKI, et al. Res Pract Thromb Haemost. 2025 Jul 24;9(5):102984. doi: 10.1016/j.rpth.2025.102984. eCollection 2025 Jul. Res Pract Thromb Haemost. 2025. PMID: 40837028 Free PMC article. Review.

References

1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. 2019;69(1):7–34. doi:10.3322/caac.21551 - DOI - PubMed
1. Feng RM, Zong YN, Cao SM, Xu RH. Current cancer situation in China: good or bad news from the 2018 global cancer statistics? Cancer Commun. 2019;39(1):22. doi:10.1186/s40880-019-0368-6 - DOI - PMC - PubMed
1. Adachi Y, Shiraishi N, Shiromizu A, Bandoh T, Aramaki M, Kitano S. Laparoscopy-assisted Billroth I gastrectomy compared with conventional open gastrectomy. Arch Surg. 2000;135(7):806–810. doi:10.1001/archsurg.135.7.806 - DOI - PubMed
1. Caruso S, Patriti A, Roviello F, et al. Laparoscopic and robot-assisted gastrectomy for gastric cancer: current considerations. World J Gastroenterol. 2016;22(25):5694–5717. doi:10.3748/wjg.v22.i25.5694 - DOI - PMC - PubMed
1. Braumann C, Jacobi CA, Menenakos C, Ismail M, Rueckert JC, Mueller JM. Robotic-assisted laparoscopic and thoracoscopic surgery with the da Vinci system: a 4-year experience in a single institution. Surg Laparosc Endosc Percutan Tech. 2008;18(3):260–266. doi:10.1097/SLE.0b013e31816f85e5 - DOI - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy

Affiliation

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources