Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 18:16:1909-1925.
doi: 10.2147/IJGM.S408770. eCollection 2023.

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy

Affiliations

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy

Yuan Liu et al. Int J Gen Med. .

Abstract

Purpose: This study aims to construct a machine learning model that can recognize preoperative, intraoperative, and postoperative high-risk indicators and predict the onset of venous thromboembolism (VTE) in patients.

Patients and methods: A total of 1239 patients diagnosed with gastric cancer were enrolled in this retrospective study, among whom 107 patients developed VTE after surgery. We collected 42 characteristic variables of gastric cancer patients from the database of Wuxi People's Hospital and Wuxi Second People's Hospital between 2010 and 2020, including patients' demographic characteristics, chronic medical history, laboratory test characteristics, surgical information, and patients' postoperative conditions. Four machine learning algorithms, namely, extreme gradient boosting (XGBoost), random forest (RF), support vector machine (SVM), and k-nearest neighbor (KNN), were employed to develop predictive models. We also utilized Shapley additive explanation (SHAP) for model interpretation and evaluated the models using k-fold cross-validation, receiver operating characteristic (ROC) curves, calibration curves, decision curve analysis (DCA), and external validation metrics.

Results: The XGBoost algorithm demonstrated superior performance compared to the other three prediction models. The area under the curve (AUC) value for XGBoost was 0.989 in the training set and 0.912 in the validation set, indicating high prediction accuracy. Furthermore, the AUC value of the external validation set was 0.85, signifying good extrapolation of the XGBoost prediction model. The results of SHAP analysis revealed that several factors, including higher body mass index (BMI), history of adjuvant radiotherapy and chemotherapy, T-stage of the tumor, lymph node metastasis, central venous catheter use, high intraoperative bleeding, and long operative time, were significantly associated with postoperative VTE.

Conclusion: The machine learning algorithm XGBoost derived from this study enables the development of a predictive model for postoperative VTE in patients after radical gastrectomy, thereby assisting clinicians in making informed clinical decisions.

Keywords: gastrectomy; gastric neoplasms; machine learning; prediction model; risk factors; venous thromboembolism.

PubMed Disclaimer

Conflict of interest statement

The authors report no conflicts of interest in this work.

Figures

Figure 1
Figure 1
Flow diagram of patients included in the study.
Figure 2
Figure 2
The variable influence factor ranking plots of the four models. (A) Variable importance ranking diagram of the XGBoost model. (B) Variable importance ranking diagram of the RF model. (C) Variable importance ranking diagram of the SVM model. (D) Variable importance ranking diagram of the KNN model.
Figure 3
Figure 3
Evaluation of the four models for predicting VTE. (A) ROC curves for the training set of the four models. (B) ROC curves for the validation set of the four models. (C) Calibration plots of the four models. The 45-degree dashed line in each plot represents the ideal correspondence between the predicted (x-axis) and observed (y-axis) probabilities of complications. The closer the distance between the two curves, the higher the predictive accuracy. (D) DCA curves of the four models. The point of intersection between the red curve and the “All” curve represents the baseline or starting point, while the point of intersection between the red curve and the “None” curve indicates the decision node where the corresponding patients may derive benefit.
Figure 4
Figure 4
Internal validation of the XGBoost model. (A) ROC curve of the XGBoost model for the training set. (B) ROC curve of the XGBoost model for the validation set. (C) ROC curve of the XGBoost model for the test set. (D) External validation of the XGBoost model.
Figure 5
Figure 5
SHAP summary plot. The risk factors are ranked on the y-axis according to their significance, which is determined by the mean of their absolute Shapley values. The higher the risk factor appears on the plot, the more crucial it is for the model.
Figure 6
Figure 6
SHAP force plot. The explanatory variables are ordered along the horizontal axis based on the absolute value of their impact, with blue representing features that negatively affect disease prediction, as indicated by a decrease in SHAP values, and red representing features that positively affect disease prediction, as indicated by an increase in SHAP values. (A) Predictive Analysis of Patient I. (B) Predictive Analysis of Patient II. (C) Predictive Analysis of Patient III.

Similar articles

Cited by

References

    1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. 2019;69(1):7–34. doi:10.3322/caac.21551 - DOI - PubMed
    1. Feng RM, Zong YN, Cao SM, Xu RH. Current cancer situation in China: good or bad news from the 2018 global cancer statistics? Cancer Commun. 2019;39(1):22. doi:10.1186/s40880-019-0368-6 - DOI - PMC - PubMed
    1. Adachi Y, Shiraishi N, Shiromizu A, Bandoh T, Aramaki M, Kitano S. Laparoscopy-assisted Billroth I gastrectomy compared with conventional open gastrectomy. Arch Surg. 2000;135(7):806–810. doi:10.1001/archsurg.135.7.806 - DOI - PubMed
    1. Caruso S, Patriti A, Roviello F, et al. Laparoscopic and robot-assisted gastrectomy for gastric cancer: current considerations. World J Gastroenterol. 2016;22(25):5694–5717. doi:10.3748/wjg.v22.i25.5694 - DOI - PMC - PubMed
    1. Braumann C, Jacobi CA, Menenakos C, Ismail M, Rueckert JC, Mueller JM. Robotic-assisted laparoscopic and thoracoscopic surgery with the da Vinci system: a 4-year experience in a single institution. Surg Laparosc Endosc Percutan Tech. 2008;18(3):260–266. doi:10.1097/SLE.0b013e31816f85e5 - DOI - PubMed