Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2023 Oct;149(13):12177-12189.
doi: 10.1007/s00432-023-05123-0. Epub 2023 Jul 10.

Deep learning model for predicting the survival of patients with primary gastrointestinal lymphoma based on the SEER database and a multicentre external validation cohort

Affiliations
Multicenter Study

Deep learning model for predicting the survival of patients with primary gastrointestinal lymphoma based on the SEER database and a multicentre external validation cohort

Feifan Wang et al. J Cancer Res Clin Oncol. 2023 Oct.

Abstract

Purpose: Due to the rarity of primary gastrointestinal lymphoma (PGIL), the prognostic factors and optimal management of PGIL have not been clearly defined. We aimed to establish prognostic models using a deep learning algorithm for survival prediction.

Methods: We collected 11,168 PGIL patients from the Surveillance, Epidemiology, and End Results (SEER) database to form the training and test cohorts. At the same time, we collected 82 PGIL patients from three medical centres to form the external validation cohort. We constructed a Cox proportional hazards (CoxPH) model, random survival forest (RSF) model, and neural multitask logistic regression (DeepSurv) model to predict PGIL patients' overall survival (OS).

Results: The 1-, 3-, 5-, and 10-year OS rates of PGIL patients in the SEER database were 77.1%, 69.4%, 63.7%, and 50.3%, respectively. The RSF model based on all variables showed that the top three most important variables for predicting OS were age, histological type, and chemotherapy. The independent risk factors for PGIL patient prognosis included sex, age, race, primary site, Ann Arbor stage, histological type, symptom, radiotherapy, and chemotherapy, according to the Lasso regression analysis. Using these factors, we built the CoxPH and DeepSurv models. The DeepSurv model's C-index values were 0.760 in the training cohort, 0.742 in the test cohort, and 0.707 in the external validation cohort, which demonstrated that the DeepSurv model performed better compared to the RSF model (0.728) and the CoxPH model (0.724). The DeepSurv model accurately predicted 1-, 3-, 5- and 10-year OS. Both calibration curves and decision curve analysis curves demonstrated the superior performance of the DeepSurv model. We developed the DeepSurv model as an online web calculator for survival prediction, which can be accessed at http://124.222.228.112:8501/ .

Conclusions: This DeepSurv model with external validation is superior to previous studies in predicting short-term and long-term survival and can help us make better-individualized decisions for PGIL patients.

Keywords: Deep learning; DeepSurv; Machine learning; Survival analysis; primary gastrointestinal lymphoma.

PubMed Disclaimer

Conflict of interest statement

The authors have no relevant financial or non-financial interests to disclose.

Figures

Fig. 1
Fig. 1
Flowchart of PGIL patient selection in the SEER database
Fig. 2
Fig. 2
Kaplan–Meier survival analysis curves of OS in PGIL patients stratified by A sex, B age, C race, D marital status, E primary site, F Ann Arbor stage, G histological type, H symptom, I distant metastasis, J surgery, K radiotherapy, and L chemotherapy
Fig. 3
Fig. 3
A Error rate curve of RSF. B Variable importance score. CN The contribution of variables to mortality in each subgroup
Fig. 4
Fig. 4
A Lasso regression coefficients. B Lasso tenfold cross-validation
Fig. 5
Fig. 5
The 1-, 3-, 5-, and 10-year calibration curves for OS in the training (A) and test (B) cohorts in the CoxPH model. The 1-, 3-, 5-, and 10-year ROC curves for OS in the training (C), test (D) and external validation (E) cohorts in the CoxPH model. DCA of the CoxPH model for OS in the training (F), test (G) and external validation (H) cohorts in the CoxPH model
Fig. 6
Fig. 6
A Loss convergence graph for the DeepSurv model. The 1-, 3-, 5-, and 10-year calibration curves for OS in the training (B) and test (C) cohorts in the DeepSurv model. The 1-, 3-, 5-, and 10-year ROC curves for OS in the training (D), test (E) and external validation (F) cohorts in the DeepSurv model. DCA of the DeepSurv for OS in the training (G), test (H) and external validation (I) cohorts in the DeepSurv model
Fig. 7
Fig. 7
The online web-based calculator of the DeepSurv model for predicting OS in PGIL patients

Similar articles

Cited by

References

    1. Adham D, Abbasgholizadeh N, Abazari M (2017) Prognostic factors for survival in patients with gastric cancer using a random survival forest. Asian Pac J Cancer Prev 18(1):129–134. 10.22034/APJCP.2017.18.1.129 - PMC - PubMed
    1. Alvarez-Lesmes J, Chapman JR, Cassidy D, Zhou Y, Garcia-Buitrago M, Montgomery EA, Lossos IS, Sussman D, Poveda J (2021) Gastrointestinal tract lymphomas. Arch Pathol Lab Med 145(12):1585–1596. 10.5858/arpa.2020-0661-RA - PubMed
    1. Bautista-Quach MA, Ake CD, Chen M, Wang J (2012) Gastrointestinal lymphomas: morphology, immunophenotype and molecular features. J Gastrointest Oncol 3(3):209–225. 10.3978/j.issn.2078-6891.2012.024 - PMC - PubMed
    1. Cheung MC, Housri N, Ogilvie MP, Sola JE, Koniaris LG (2009) Surgery does not adversely affect survival in primary gastrointestinal lymphoma. J Surg Oncol 100(1):59–64. 10.1002/jso.21298 - PubMed
    1. Choi KH, Lee HH, Jung SE, Park KS, Joo-Hyun O, Jeon YW, Choi BO, Cho SG (2020) Analysis of the response time to involved-field radiotherapy in primary gastrointestinal low-grade B-cell lymphoma. Radiat Oncol 15(1):210. 10.1186/s13014-020-01649-6 - PMC - PubMed

LinkOut - more resources