Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 19;15(1):463.
doi: 10.1007/s12672-024-01327-z.

Utilizing machine learning algorithms for predicting risk factors for bone metastasis from right-sided colon carcinoma after complete mesocolic excision: a 10-year retrospective multicenter study

Affiliations

Utilizing machine learning algorithms for predicting risk factors for bone metastasis from right-sided colon carcinoma after complete mesocolic excision: a 10-year retrospective multicenter study

Yuan Liu et al. Discov Oncol. .

Abstract

Background: Bone metastasis (BM) occurs when colon cancer cells disseminate from the primary tumor site to the skeletal system via the bloodstream or lymphatic system. The emergence of such bone metastases typically heralds a significantly poor prognosis for the patient. This study's primary aim is to develop a machine learning model to identify patients at elevated risk of bone metastasis among those with right-sided colon cancer undergoing complete mesocolonectomy (CME).

Patients and methods: The study cohort comprised 1,151 individuals diagnosed with right-sided colon cancer, with a subset of 73 patients presenting with bone metastases originating from the colon. We used univariate and multivariate regression analyses as well as four machine learning algorithms to screen variables for 38 characteristic variables such as patient demographic characteristics and surgical information. The study employed four distinct machine learning algorithms, namely, extreme gradient boosting (XGBoost), random forest (RF), support vector machine (SVM), and k-nearest neighbor algorithm (KNN), to develop the predictive model. Additionally, the model was assessed using receiver operating characteristic (ROC) curves, calibration curves, and decision curve analysis (DCA), while Shapley additive explanation (SHAP) was utilized to visualize and analyze the model.

Results: The XGBoost algorithm performed the best performance among the four prediction models. In the training set, the XGBoost algorithm had an area under curve (AUC) value of 0.973 (0.953-0.994), an accuracy of 0.925 (0.913-0.936), a sensitivity of 0.921 (0.902-0.940), and a specificity of 0.908 (0.894-0.922). In the validation set, the XGBoost algorithm had an AUC value of 0.922 (0.833-0.995), an accuracy of 0.908 (0.889-0.926), a sensitivity of 0.924 (0.873-0.975), and a specificity of 0.883 (0.810-0.956). Furthermore, the AUC value of 0.83 for the external validation set suggests that the XGBoost prediction model possesses strong extrapolation capabilities. The results of SHAP analysis identified alkaline phosphatase (ALP) levels, tumor size, invasion depth, lymph node metastasis, lung metastasis, and postoperative neutrophil-to-lymphocyte ratio (NLR) levels as significant risk factors for BM from right-sided colon cancer subsequent to CME.

Conclusion: The prediction model for BM from right-sided colon cancer developed using the XGBoost machine learning algorithm in this study is both highly precise and clinically valuable.

Keywords: Bone metastasis; Colonic neoplasms; Machine learning; Prognosis; Risk factor.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow diagram of patients included in the study
Fig. 2
Fig. 2
The variable ranking plots of the four models. A Variable importance ranking diagram of the XGBoost model. B Variable importance ranking diagram of the RF model. C Variable importance ranking diagram of the SVM model. D Variable importance ranking diagram of the KNN model
Fig. 3
Fig. 3
Evaluation of the four models for predicting BM. A ROC curves for the training set of the four models. B ROC curves for the validation set of the four models. C Calibration plots of the four models. The 45° dotted line on each graph represents the perfect match between the observed (y-axis) and predicted (x-axis) complication probabilities. A closer distance between two curves indicates greater accuracy. D DCA curves of the four models. The intersection of the red curve and the all curve is the starting point, and the intersection of the red curve and the None curve is the node within which the corresponding patients can benefit
Fig. 4
Fig. 4
Internal validation of the XGBoost model. A ROC curve of the XGBoost model for the training set. B ROC curve of the XGBoost model for the validation set. C ROC curve of the XGBoost model for the test set. D External validation of the XGBoost model
Fig. 5
Fig. 5
SHAP summary plot. Risk factors are arranged along the y-axis based on their importance, which is given by the mean of their absolute Shapley values. The higher the risk factor is positioned in the plot, the more important it is for the model
Fig. 6
Fig. 6
SHAP force plot. The contributing variables are arranged in the horizontal line, sorted by the absolute value of their impact. Blue represents features that have a negative effect on disease prediction, with a decrease in SHAP values; red represents features that have a positive effect on disease prediction, with an increase in SHAP values. A Predictive Analysis of Patient I. B Predictive Analysis of Patient II. C Predictive Analysis of Patient III. D Predictive Analysis of Patient IV

Similar articles

Cited by

References

    1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424. 10.3322/caac.21492. - PubMed
    1. Schlüter K, Gassmann P, Enns A, Korb T, Hemping-Bovenkerk A, Hölzen J, Haier J. Organ-specific metastatic tumor cell adhesion and extravasation of colon carcinoma cells with different metastatic potential. Am J Pathol. 2006;169:1064–73. 10.2353/ajpath.2006.050566. - PMC - PubMed
    1. Kanthan R, Loewy J, Kanthan SC. Skeletal metastases in colorectal carcinomas: a Saskatchewan profile. Dis Colon Rectum. 1999;42:1592–7. 10.1007/bf02236213. - PubMed
    1. Rades D, Bartscht T, Janssen S, Bajrovic A, Segedin B, Schild SE. Forecasting survival probabilities after radiotherapy of metastatic epidural spinal cord compression from colorectal cancer in the elderly. Anticancer Res. 2016;36:1829–33. - PubMed
    1. Santini D, Tampellini M, Vincenzi B, Ibrahim T, Ortega C, Virzi V, Silvestris N, Berardi R, Masini C, Calipari N, et al. Natural history of bone metastasis in colorectal cancer: final results of a large Italian bone metastases study. Ann Oncol. 2012;23:2072–7. 10.1093/annonc/mdr572. - PubMed

LinkOut - more resources