Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 30;16(3):937-949.
doi: 10.21037/jgo-2024-946. Epub 2025 Jun 18.

Automated machine learning predicts liver metastases in patients with early-onset gastroenteropancreatic neuroendocrine tumors

Affiliations

Automated machine learning predicts liver metastases in patients with early-onset gastroenteropancreatic neuroendocrine tumors

Fuli Gao et al. J Gastrointest Oncol. .

Abstract

Background: The incidence of early-onset gastroenteropancreatic neuroendocrine tumors (GEP-NETs) is increasing, with liver metastases often occurring early and adversely affecting prognosis. This study aimed to develop a predictive model for liver metastases detection in patients with early-onset GEP-NETs (<50 years) using an automated machine learning (AutoML) approach.

Methods: A retrospective analysis was conducted on patients diagnosed with early-onset GEP-NETs [2000-2021] using data from the Surveillance, Epidemiology, and End Results (SEER) database. Patients were randomly divided into a training set (n=8,983) and a validation set (n=3,819) in a 7:3 ratio. A nomogram-based scoring system was constructed using least absolute shrinkage and selection operator (LASSO) and logistic regression. AutoML was applied to build predictive models using gradient boosting machine (GBM), generalized linear model (GLM), deep learning (DL), and distributed random forest (DRF) algorithms. Model performance was assessed using receiver operating characteristic (ROC), calibration, decision curve analysis (DCA), and interpretability tools including SHapley Additive exPlanations (SHAP), partial dependence plots (PDPs), and locally interpretable model-agnostic explanations (LIME) plots.

Results: A total of 12,802 patients were included, of whom 1,187 (9.3%) developed liver metastases, comprising 851 (9.5%) and 336 (8.8%) cases in the training and validation sets, respectively. Comparative analyses demonstrated that the AutoML models outperformed traditional logistic regression models, with the GBM algorithm achieving the highest performance. The GBM model achieved an area under the curve (AUC) of 0.961 in the training set and 0.953 in the validation set. Tumor location was identified as the most important predictor in the GBM model, followed by surgery, tumor size, chemotherapy, and T-staging.

Conclusions: The AutoML model leveraging the GBM algorithm provides a robust and clinically valuable tool for the early prediction of liver metastases in patients with early-onset GEP-NETs.

Keywords: Automated machine learning (AutoML); age; gastroenteropancreatic neuroendocrine tumors (GEP-NETs); liver metastasis; predictive modeling.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://jgo.amegroups.com/article/view/10.21037/jgo-2024-946/coif). The authors have no conflicts of interest to declare.

Figures

Figure 1
Figure 1
The flowchart of this study. DL, deep learning; DRF, distributed random forest; GBM, gradient boosting machine; GEP-NENs, gastroenteropancreatic neuroendocrine neoplasms; GLM, generalized linear model; LASSO, least absolute shrinkage and selection operator.
Figure 2
Figure 2
Nomogram of logistic regression model for predicting liver metastasis in patients with early-onset gastroenteropancreatic neuroendocrine tumors. N, node.
Figure 3
Figure 3
The ROC curves (A,D), calibration curves (B,E), and decision curves (C,F) of the logistic regression model in the training and validation sets. AUC, area under the curve; CI, confidence interval; ROC, receiver operating characteristic
Figure 4
Figure 4
Variable importance and SHAP of the GBM model in the training cohort. Panel (A) shows that tumor location is the most important feature. As illustrated in (B), when the variable value approaches 1, the likelihood of liver metastasis increases for the patient. GBM, gradient boosting machine; N, node; SHAP, SHapley Additive exPlanations; T, tumor.
Figure 5
Figure 5
LIME visualization of variable importance in a random sample from the validation cohort (demonstrating the impact of key variables on individual predictions; p0 represents no liver metastasis, and p1 represents liver metastasis). DX, diagnosis; LIME, locally interpretable model-agnostic explanations; N, node; T, tumor.

Similar articles

References

    1. Fernandes CJ, Leung G, Eads JR, et al. Gastroenteropancreatic Neuroendocrine Tumors. Gastroenterol Clin North Am 2022;51:625-47. 10.1016/j.gtc.2022.06.002 - DOI - PubMed
    1. Riihimäki M, Hemminki A, Sundquist K, et al. The epidemiology of metastases in neuroendocrine tumors. Int J Cancer 2016;139:2679-86. 10.1002/ijc.30400 - DOI - PubMed
    1. Tran CG, Sherman SK, Chandrasekharan C, et al. Surgical Management of Neuroendocrine Tumor Liver Metastases. Hematol Oncol Clin North Am 2025;39:37-53. 10.1016/j.hoc.2024.08.001 - DOI - PubMed
    1. Xu Q, Yin B, Han X, et al. Long-term Outcomes of Surgical Treatment for Pancreatic Neuroendocrine Neoplasm With Synchronous Hepatic Metastasis: A Multicenter Retrospective Cohort Study. Pancreas 2025;54:e179-87. 10.1097/MPA.0000000000002417 - DOI - PMC - PubMed
    1. Sharma A, Muralitharan M, Ramage J, et al. Current Management of Neuroendocrine Tumour Liver Metastases. Curr Oncol Rep 2024;26:1070-84. 10.1007/s11912-024-01559-w - DOI - PMC - PubMed

LinkOut - more resources