Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2025 Jul 1;15(1):22347.
doi: 10.1038/s41598-025-09083-y.

Muscle-Driven prognostication in gastric cancer: A multicenter deep learning framework integrating Iliopsoas and erector spinae radiomics for 5-Year survival prediction

Affiliations
Multicenter Study

Muscle-Driven prognostication in gastric cancer: A multicenter deep learning framework integrating Iliopsoas and erector spinae radiomics for 5-Year survival prediction

Yuan Hong et al. Sci Rep. .

Abstract

This study developed a 5-year survival prediction model for gastric cancer patients by combining radiomics and deep learning, focusing on CT-based 2D and 3D features of the iliopsoas and erector spinae muscles. Retrospective data from 705 patients across two centers were analyzed, with clinical variables assessed via Cox regression and radiomic features extracted using deep learning. The 2D model outperformed the 3D approach, leading to feature fusion across five dimensions, optimized via logistic regression. Results showed no significant association between clinical baseline characteristics and survival, but the 2D model demonstrated strong prognostic performance (AUC ~ 0.8), with attention heatmaps emphasizing spinal muscle regions. The 3D model underperformed due to irrelevant data. The final integrated model achieved stable predictive accuracy, confirming the link between muscle mass and survival. This approach advances precision medicine by enabling personalized prognosis and exploring 3D imaging feasibility, offering insights for gastric cancer research.

Keywords: Deep learning; Gastric cancer; Radical gastrectomy; Sarcopenia.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no potential conflict of interest in the research, writing, and publication of this article. Financial support: This study was supported by a grant from the Graduate Student Research and Practice Innovation Program of Anhui Medical University (YJS20230080), National Natural Science Foundation of China (82403333).

Figures

Fig. 1
Fig. 1
Flowchart of patient enrollment. Of 603 initially screened gastrectomy patients, 511 completed 5-year follow-up (32 withdrew, 57 lost to follow-up, 3 unrelated deaths).
Fig. 2
Fig. 2
Flow chart of muscle segmentation. The pipeline processes abdominal CT images through three stages: (1) original/abdominal window phases, (2) automated segmentation using a U-NET model (TotalSegmentation), and (3) generation of three muscle subgroups (bilateral psoas, erector spinae, and combined muscles). Six standardized input datasets (3 muscle groups × 2 CT phases) were produced for subsequent analysis.
Fig. 3
Fig. 3
Radiomics workflow for 5-year survival prediction after gastrectomy. The pipeline consists of three main stages: (1) preprocessing of clinical and imaging data, (2) multi-modal 2D neural network screening (14 architectures tested) with feature fusion, and (3) predictive model evaluation using 11 machine learning classifiers. The optimal 2D-based fusion strategy demonstrated superior performance over 3D approaches in external validation (n = 194).
Fig. 4
Fig. 4
Performance evaluation of optimal 2D neural networks across five imaging modalities. (A-E) The receiver operating characteristic (ROC) and decision curve analysis (DCA) results demonstrate the predictive performance of selected models: a.Muscle fusion: ResNet18 for both non-enhanced (A) and enhanced (B) modalities (AUC > 0.6). b.Iliopsoas: MNASNet0_5 (C, non-enhanced) and ResNet152 (D, enhanced). c.Erector spinae: ResNet50 (E, enhanced only; original phase models excluded due to accuracy ≈ 0.5). d.Each modality is presented in three subplots: (1) ROC curves with AUC and 95% CI, (2) training set DCA, and (3) test set DCA. Model selection was based on strict criteria excluding underperforming models (AUC < 0.5) and prioritizing those with consistent train-test performance and low overfitting risk.
Fig. 5
Fig. 5
Performance comparison of 3D neural networks using ROC curves. The evaluation of DenseNet121 and DenseNet201 models on abdominal CT scans showed limited classification capability, with AUC values below 0.5 for both models. While 3D architectures can capture spatial information effectively, these results suggest they may not be optimal for this particular task. The ROC curves (left: DenseNet121; right: DenseNet201) with 95% confidence intervals demonstrate this performance limitation, leading to their exclusion from the final model integration.
Fig. 6
Fig. 6
Performance comparison of machine learning classifiers after multimodal feature fusion. The evaluation of 11 classifiers on fused neural network features showed logistic regression (LR) and support vector machine (SVM) as top performers. (A-B) ROC curves demonstrate classification accuracy of LR and SVM models. (C-F) DCA curves validate clinical utility in both training and test sets. (G) Boxplot summarizes cross-validated accuracy across all classifiers. The final selection of LR as optimal classifier was based on its balanced performance and practical stability in clinical application scenarios.
Fig. 7
Fig. 7
2D attention heatmaps based on neural networks. Models consistently focused on paravertebral muscles during classification, shown for (A) erector spinae, (B) iliopsoas, and (C) combined muscle groups.
Fig. 8
Fig. 8
Visualization of attention heatmaps for correctly and incorrectly predicted cases. (A) Heatmaps from correctly predicted cases demonstrate model focus on clinically relevant muscle areas. (B) Heatmaps from misclassified cases show scattered or non-informative attention regions.

Similar articles

References

    1. Park, S. E. et al. Loss of skeletal muscle mass during palliative chemotherapy is a poor prognostic factor in patients with advanced gastric cancer. Sci Rep. ;10(1):17683. Published 2020 Oct 19. (2020). 10.1038/s41598-020-74765-8 - PMC - PubMed
    1. Fukuda, Y. et al. Sarcopenia is associated with severe postoperative complications in elderly gastric cancer patients undergoing gastrectomy. Gastric Cancer. 19 (3), 986–993. 10.1007/s10120-015-0546-4 (2016). - PubMed
    1. Ricciardolo, A. A. et al. Strong impact of sarcopenia as a risk factor of survival in resected gastric cancer patients: first Italian report of a bicentric study. Updates Surg.74 (1), 283–293. 10.1007/s13304-021-01175-4 (2022). - PubMed
    1. Ozcan, S. G. G. et al. Clinical significance of sarcopenia in patients undergoing treatment for gastric cancer. Rev. Assoc. Med. Bras. (1992)69(12), e20230568. 10.1590/1806-9282.20230568 (2023) (Published 2023 Nov 13). - PMC - PubMed
    1. Yang, W. et al. Quantifying skeletal muscle wasting during chemoradiotherapy with Jacobian calculations for the prediction of survival and toxicity in patients with gastric cancer. Eur. J. Surg. Oncol.46 (7), 1254–1261. 10.1016/j.ejso.2020.03.223 (2020). - PubMed

Publication types

LinkOut - more resources