Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 2:13:1049787.
doi: 10.3389/fonc.2023.1049787. eCollection 2023.

A comparison of machine learning models and Cox proportional hazards models regarding their ability to predict the risk of gastrointestinal cancer based on metabolic syndrome and its components

Affiliations

A comparison of machine learning models and Cox proportional hazards models regarding their ability to predict the risk of gastrointestinal cancer based on metabolic syndrome and its components

Tao Thi Tran et al. Front Oncol. .

Abstract

Background: Little is known about applying machine learning (ML) techniques to identify the important variables contributing to the occurrence of gastrointestinal (GI) cancer in epidemiological studies. We aimed to compare different ML models to a Cox proportional hazards (CPH) model regarding their ability to predict the risk of GI cancer based on metabolic syndrome (MetS) and its components.

Methods: A total of 41,837 participants were included in a prospective cohort study. Incident cancer cases were identified by following up with participants until December 2019. We used CPH, random survival forest (RSF), survival trees (ST), gradient boosting (GB), survival support vector machine (SSVM), and extra survival trees (EST) models to explore the impact of MetS on GI cancer prediction. We used the C-index and integrated Brier score (IBS) to compare the models.

Results: In all, 540 incident GI cancer cases were identified. The GB and SSVM models exhibited comparable performance to the CPH model concerning the C-index (0.725). We also recorded a similar IBS for all models (0.017). Fasting glucose and waist circumference were considered important predictors.

Conclusions: Our study found comparably good performance concerning the C-index for the ML models and CPH model. This finding suggests that ML models may be considered another method for survival analysis when the CPH model's conditions are not satisfied.

Keywords: Korea; gastrointestinal cancer; machine learning; metabolic syndrome; prospective cohort study.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Flow chart of the study participants. Among 41,837 participants recruited, 41,121 participants were linked to the Korea Central Cancer Registry. Our final analysis included 24,139 participants after exclusion of 1,754 participants with incomplete questionnaires, 2,100 participants with a diagnosis of any cancer before recruitment, 6 participants aged <20 years, and 13,122 participants who lacked information on individual characteristics.
Figure 2
Figure 2
Time-dependent AUC. We presented the time-dependent AUC of the CPH model and five ML models, namely, the random survival forest, survival trees, gradient boosting, extra survival trees, and survival support vector machine models. The vertical axis is the time-dependent AUC. The horizontal axis is follow-up (year).

References

    1. Arnold M, Abnet CC, Neale RE, Vignat J, Giovannucci EL, McGlynn KA, et al. . Global burden of 5 major types of gastrointestinal cancer. Gastroenterology (2020) 159:335–49. doi: 10.1053/j.gastro.2020.02.068 - DOI - PMC - PubMed
    1. Lu L, Mullins CS, Schafmayer C, Zeißig S, Linnebacher M. A global assessment of recent trends in gastrointestinal cancer and lifestyle-associated risk factors. Cancer Commun (Lond) (2021) 41:1137–51. doi: 10.1002/cac2.12220 - DOI - PMC - PubMed
    1. Hong S, Won YJ, Lee JJ, Jung KW, Kong HJ, Im JS, et al. . Cancer statistics in Korea: Incidence, mortality, survival, and prevalence in 2018. Cancer Res Treat (2021) 53:301–15. doi: 10.4143/crt.2021.291 - DOI - PMC - PubMed
    1. Yoo H, Kim H, Lee JH, Lee KS, Choi MJ, Song HR, et al. . Study on the relevance of metabolic syndrome and incidence of gastric cancer in Korea. Int J Environ Res Public Health (2019) 16:1101. doi: 10.3390/ijerph16071101 - DOI - PMC - PubMed
    1. Saklayen MG. The global epidemic of the metabolic syndrome. Curr Hypertens Rep (2018) 20:12. doi: 10.1007/s11906-018-0812-z - DOI - PMC - PubMed

LinkOut - more resources