XGBoost algorithm and logistic regression to predict the postoperative 5-year outcome in patients with glioma
- PMID: 36110992
- PMCID: PMC9469144
- DOI: 10.21037/atm-22-3384
XGBoost algorithm and logistic regression to predict the postoperative 5-year outcome in patients with glioma
Abstract
Background: Glioma is the most common primary intracranial tumor with poor prognosis. The prediction of glioma prognosis has not been well investigated. XGBoost algorithm has been widely used in and data analysis. The predictive value of XGBoost algorithm in glioma remains unclear. This current study used the XGBoost algorithm to construct a predictive model for postoperative outcomes of glioma patients.
Methods: Patients with glioma who underwent surgery from January 2006 to April 2017 were retrospectively included in this study. Clinical and follow-up data were collected. The XGBoost model and multivariate logistic regression analysis model were used to screen the factors related to postoperative outcomes, and the results of the two models were compared. The area under the receiver operating characteristic (ROC) curve (AUC), sensitivity, specificity, and Youden index were calculated to evaluate the predictive value of the XGBoost model.
Results: A total of 638 patients were included. In total, 336 (52.7%) cases died within 5 years after the operation. Multivariate logistic regression analysis showed that age, gender, World Health Organization (WHO) grade, extent of tumor resection, Karnofsy performance score (KPS), tumor diameter, and whether postoperative radiotherapy and chemotherapy were administered, were the most important risk factors for death within 5 years after surgery in glioma patients. The XGBoost model showed that the top 5 factors related to death of glioma patients within 5 years after surgery were WHO grade (30 points), extent of tumor resection (19 points), postoperative radiotherapy and chemotherapy (16 points), KPS (14 points), and age (11 points). The AUC of the XGBoost model for predicting the death of glioma patients within 5 years after surgery was 0.803 [95% confidence interval (CI): 0.718-0.832], and the sensitivity and specificity were 0.894 and 0.581, respectively. The Youden index was 0.475. The AUC of the multivariate logistic regression model was 0.738 (95% CI: 0.704-0.781), the sensitivity and specificity were 0.785 and 0.632, respectively, and the Youden index was 0.417.
Conclusions: Compared with multivariate logistic regression model, XGBoost model has better performance in predicting the risk of death within 5 years after surgery in patients with glioma.
Keywords: Glioma; XGBoost model; death; multivariate logistic analysis; risk factors.
2022 Annals of Translational Medicine. All rights reserved.
Conflict of interest statement
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://atm.amegroups.com/article/view/10.21037/atm-22-3384/coif). The authors have no conflicts of interest to declare.
Figures


Similar articles
-
Development and validation of an XGBoost model to predict 5-year survival in elderly patients with intrahepatic cholangiocarcinoma after surgery: a SEER-based study.J Gastrointest Oncol. 2022 Dec;13(6):3290-3299. doi: 10.21037/jgo-22-1238. J Gastrointest Oncol. 2022. PMID: 36636060 Free PMC article.
-
Prediction Model of Postoperative Severe Hypocalcemia in Patients with Secondary Hyperparathyroidism Based on Logistic Regression and XGBoost Algorithm.Comput Math Methods Med. 2022 Jul 25;2022:8752826. doi: 10.1155/2022/8752826. eCollection 2022. Comput Math Methods Med. 2022. PMID: 35924110 Free PMC article. Clinical Trial.
-
[Risk factors for death in elderly patients admitted to intensive care unit after elective abdominal surgery: a consecutive 5-year retrospective study].Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2021 Dec;33(12):1453-1458. doi: 10.3760/cma.j.cn121430-20210804-00118. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2021. PMID: 35131012 Chinese.
-
[Combined prognostic value of serum lactic acid, procalcitonin and severity score for short-term prognosis of septic shock patients].Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2021 Mar;33(3):281-285. doi: 10.3760/cma.j.cn121430-20201113-00715. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2021. PMID: 33834968 Chinese.
-
XGBoost Machine Learning Algorithm for Prediction of Outcome in Aneurysmal Subarachnoid Hemorrhage.Neuropsychiatr Dis Treat. 2022 Mar 29;18:659-667. doi: 10.2147/NDT.S349956. eCollection 2022. Neuropsychiatr Dis Treat. 2022. PMID: 35378822 Free PMC article.
Cited by
-
A longitudinal cohort study uncovers plasma protein biomarkers predating clinical onset and treatment response of rheumatoid arthritis.Nat Commun. 2025 Jul 21;16(1):6692. doi: 10.1038/s41467-025-62032-1. Nat Commun. 2025. PMID: 40691443 Free PMC article.
-
Comparison of intracranial pressure prediction in hydrocephalus patients among linear, non-linear, and machine learning regression models in Thailand.Acute Crit Care. 2023 Aug;38(3):362-370. doi: 10.4266/acc.2023.00094. Epub 2023 Aug 18. Acute Crit Care. 2023. PMID: 37652865 Free PMC article.
-
Predicting 30-Day Cardiotoxicity in Patients Receiving Immune Checkpoint Inhibitors: An Observational Study Utilizing XGBoost.Cardiovasc Toxicol. 2025 Jul;25(7):994-1006. doi: 10.1007/s12012-025-09990-6. Epub 2025 Apr 10. Cardiovasc Toxicol. 2025. PMID: 40208554
-
Interpretable prediction of hospital mortality in bleeding critically ill patients based on machine learning and SHAP.BMC Med Inform Decis Mak. 2025 Jul 15;25(1):263. doi: 10.1186/s12911-025-03101-9. BMC Med Inform Decis Mak. 2025. PMID: 40665292 Free PMC article.
-
Decoding Wilson disease: a machine learning approach to predict neurological symptoms.Front Neurol. 2024 Jun 19;15:1418474. doi: 10.3389/fneur.2024.1418474. eCollection 2024. Front Neurol. 2024. PMID: 38966086 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous