Automated machine learning predicts liver metastases in patients with early-onset gastroenteropancreatic neuroendocrine tumors
- PMID: 40672078
- PMCID: PMC12260961
- DOI: 10.21037/jgo-2024-946
Automated machine learning predicts liver metastases in patients with early-onset gastroenteropancreatic neuroendocrine tumors
Abstract
Background: The incidence of early-onset gastroenteropancreatic neuroendocrine tumors (GEP-NETs) is increasing, with liver metastases often occurring early and adversely affecting prognosis. This study aimed to develop a predictive model for liver metastases detection in patients with early-onset GEP-NETs (<50 years) using an automated machine learning (AutoML) approach.
Methods: A retrospective analysis was conducted on patients diagnosed with early-onset GEP-NETs [2000-2021] using data from the Surveillance, Epidemiology, and End Results (SEER) database. Patients were randomly divided into a training set (n=8,983) and a validation set (n=3,819) in a 7:3 ratio. A nomogram-based scoring system was constructed using least absolute shrinkage and selection operator (LASSO) and logistic regression. AutoML was applied to build predictive models using gradient boosting machine (GBM), generalized linear model (GLM), deep learning (DL), and distributed random forest (DRF) algorithms. Model performance was assessed using receiver operating characteristic (ROC), calibration, decision curve analysis (DCA), and interpretability tools including SHapley Additive exPlanations (SHAP), partial dependence plots (PDPs), and locally interpretable model-agnostic explanations (LIME) plots.
Results: A total of 12,802 patients were included, of whom 1,187 (9.3%) developed liver metastases, comprising 851 (9.5%) and 336 (8.8%) cases in the training and validation sets, respectively. Comparative analyses demonstrated that the AutoML models outperformed traditional logistic regression models, with the GBM algorithm achieving the highest performance. The GBM model achieved an area under the curve (AUC) of 0.961 in the training set and 0.953 in the validation set. Tumor location was identified as the most important predictor in the GBM model, followed by surgery, tumor size, chemotherapy, and T-staging.
Conclusions: The AutoML model leveraging the GBM algorithm provides a robust and clinically valuable tool for the early prediction of liver metastases in patients with early-onset GEP-NETs.
Keywords: Automated machine learning (AutoML); age; gastroenteropancreatic neuroendocrine tumors (GEP-NETs); liver metastasis; predictive modeling.
Copyright © 2025 AME Publishing Company. All rights reserved.
Conflict of interest statement
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://jgo.amegroups.com/article/view/10.21037/jgo-2024-946/coif). The authors have no conflicts of interest to declare.
Figures





Similar articles
-
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733. J Med Internet Res. 2025. PMID: 40418571 Free PMC article.
-
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23. Clin Orthop Relat Res. 2024. PMID: 39051924
-
An endoscopic ultrasound-based interpretable deep learning model and nomogram for distinguishing pancreatic neuroendocrine tumors from pancreatic cancer.Sci Rep. 2025 Jan 27;15(1):3383. doi: 10.1038/s41598-024-84749-7. Sci Rep. 2025. PMID: 39870667 Free PMC article.
-
GEP-NET radiomics: a systematic review and radiomics quality score assessment.Eur Radiol. 2022 Oct;32(10):7278-7294. doi: 10.1007/s00330-022-08996-w. Epub 2022 Jul 26. Eur Radiol. 2022. PMID: 35882634
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Research Materials