Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 19:15:1398685.
doi: 10.3389/fimmu.2024.1398685. eCollection 2024.

Machine learning to predict distant metastasis and prognostic analysis of moderately differentiated gastric adenocarcinoma patients: a novel focus on lymph node indicators

Affiliations

Machine learning to predict distant metastasis and prognostic analysis of moderately differentiated gastric adenocarcinoma patients: a novel focus on lymph node indicators

Kangping Yang et al. Front Immunol. .

Abstract

Background: Moderately differentiated gastric adenocarcinoma (MDGA) has a high risk of metastasis and individual variation, which strongly affects patient prognosis. Using large-scale datasets and machine learning algorithms for prediction can improve individualized treatment. The specific efficacy of several lymph node indicators in predicting distant metastasis (DM) and patient prognosis in MDGA remains obscure.

Methods: We collected data from MDGA patients from the SEER database from 2010 to 2019. Additionally, we collected data from MDGA patients in China. We used nine machine learning algorithms to predict DM. Subsequently, we used Cox regression analysis to determine the risk factors affecting overall survival (OS) and cancer-specific survival (CSS) in DM patients and constructed nomograms. Furthermore, we used logistic regression and Cox regression analyses to assess the specific impact of six lymph node indicators on DM incidence and patient prognosis.

Results: We collected data from 5,377 MDGA patients from the SEER database and 109 MDGC patients from hospitals. T stage, N stage, tumor size, primary site, number of positive lymph nodes, and chemotherapy were identified as independent risk factors for DM. The random forest prediction model had the best overall predictive performance (AUC = 0.919). T stage, primary site, chemotherapy, and the number of regional lymph nodes were identified as prognostic factors for OS. Moreover, T stage, number of regional lymph nodes, primary site, and chemotherapy were also influential factors for CSS. The nomograms showed good predictive value and stability in predicting the 1-, 3-, and 5-year OS and CSS in DM patients. Additionally, the log odds of a metastatic lymph node and the number of negative lymph nodes may be risk factors for DM, while the regional lymph node ratio and the number of regional lymph nodes are prognostic factors for OS.

Conclusion: The random forest prediction model accurately identified high-risk populations, and we established OS and CSS survival prediction models for MDGA patients with DM. Our hospital samples demonstrated different characteristics of lymph node indicators in terms of distant metastasis and prognosis.

Keywords: distant metastasis; lymph node indicators; machine learning; moderately differentiated gastric adenocarcinoma; nomogram; prognosis.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Flowchart of the data screening process. The figure shows the process of filtering eligible patient data from the SEER database.
Figure 2
Figure 2
Data analysis guide. The figure shows the procedure of this study for processing and analyzing the screened data.
Figure 3
Figure 3
Results of Pearson correlation analysis for each variable (A) and ranking of importance of predictive model characteristics (B). The results of Pearson correlation analysis for each variable (A) showed that all variables existed independently of each other. The predictive model characteristics (B) were the number of Reg LNs, N stage, T stage, chemotherapy, age, tumor size, and primary site, in order of importance.
Figure 4
Figure 4
Receiver operating characteristic (ROC) curves for the training set, test set, and external validation set prediction models. (A) Training set; (B) test set; (C) external validation set. The aggregation of the previous ROC curves for the RF model (D). AUC, area under the ROC curve; RF, random forest; LR, logistic regression; LASSO, least absolute shrinkage and selection operator; SVM, support vector machine; KNN, K-nearest neighbor; NBC, naive Bayes classifier; ANN, artificial neural network.
Figure 5
Figure 5
Nomograms for 1-, 3-, and 5-year OS (A) and CSS (B) in MDCA patients with DM.
Figure 6
Figure 6
Evaluation of the ability of the nomogram to predict OS. ROC curves validating the OS prediction nomogram for 1-, 3-, and 5-year OS in the training set (A) and validation set (B). Calibration curves validating the OS prediction nomograms for 1-, 3-, and 5-year OS in the training set (C–E) and validation set (F–H). Decision curve analysis validating the OS prediction nomogram for 1-, 3-, and 5-year OS in the training set (I–K) and validation set (L–N).
Figure 7
Figure 7
Evaluation of the ability of the nomogram to predict CSS. ROC curves validating the CSS prediction nomogram for 1-, 3-, and 5-year RFS in the training set (A) and validation set (B). Calibration curves validating the CSS prediction nomograms for 1-, 3-, and 5-year survival in the training cohort (C–E) and validation cohort (F–H). Decision curve analysis validating the CSS prediction nomogram for 1-, 3-, and 5-year RFS in the training set (I–K) and validation set (L–N).

Similar articles

Cited by

References

    1. Lin Y, Zheng Y, Wang HL, Wu J. Global patterns and trends in gastric cancer incidence rates (1988-2012) and predictions to 2030. Gastroenterology. (2021) 161:116–27.e8. doi: 10.1053/j.gastro.2021.03.023 - DOI - PubMed
    1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. . Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660 - DOI - PubMed
    1. López Sala P, Leturia Etxeberria M, Inchausti Iguíñiz E, Astiazaran Rodríguez A, Aguirre Oteiza MI, Zubizarreta Etxaniz M. Gastric adenocarcinoma: A review of the TNM classification system and ways of spreading. Radiologia. (2023) 65:66–80. doi: 10.1016/j.rxeng.2022.10.011 - DOI - PubMed
    1. Nagtegaal ID, Odze RD, Klimstra D, Paradis V, Rugge M, Schirmacher P, et al. . The 2019 WHO classification of tumours of the digestive system. Histopathology. (2020) 76:182–8. doi: 10.1111/his.13975 - DOI - PMC - PubMed
    1. Xiang Y, Yao L. Retrospective analysis of diagnosis and treatment of gastric cancer at Huzhou central hospital. Altern Ther Health Med. (2023) 29:302–9. - PubMed

LinkOut - more resources