Predicting lung cancer survival based on clinical data using machine learning: A review
- PMID: 37625260
- DOI: 10.1016/j.compbiomed.2023.107338
Predicting lung cancer survival based on clinical data using machine learning: A review
Abstract
Machine learning has gained popularity in predicting survival time in the medical field. This review examines studies utilizing machine learning and data-mining techniques to predict lung cancer survival using clinical data. A systematic literature review searched MEDLINE, Scopus, and Google Scholar databases, following reporting guidelines and using the COVIDENCE system. Studies published from 2000 to 2023 employing machine learning for lung cancer survival prediction were included. Risk of bias assessment used the prediction model risk of bias assessment tool. Thirty studies were reviewed, with 13 (43.3%) using the surveillance, epidemiology, and end results database. Missing data handling was addressed in 12 (40%) studies, primarily through data transformation and conversion. Feature selection algorithms were used in 19 (63.3%) studies, with age, sex, and N stage being the most chosen features. Random forest was the predominant machine learning model, used in 17 (56.6%) studies. While the number of lung cancer survival prediction studies is limited, the use of machine learning models based on clinical data has grown since 2012. Consideration of diverse patient cohorts and data pre-processing are crucial. Notably, most studies did not account for missing data, normalization, scaling, or standardized data, potentially introducing bias. Therefore, a comprehensive study on lung cancer survival prediction using clinical data is needed, addressing these challenges.
Keywords: Artificial intelligence; Data mining; Feature selection; Lung cancer; Machine learning; Survival prediction.
Copyright © 2023 The Authors. Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare no conflict of interest.
Similar articles
-
Use of machine learning to predict bladder cancer survival outcomes: a systematic literature review.Expert Rev Pharmacoecon Outcomes Res. 2023 Jul-Dec;23(7):761-771. doi: 10.1080/14737167.2023.2224963. Epub 2023 Jun 19. Expert Rev Pharmacoecon Outcomes Res. 2023. PMID: 37306511
-
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320. Health Technol Assess. 2001. PMID: 12065068
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
-
Eliciting adverse effects data from participants in clinical trials.Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2. Cochrane Database Syst Rev. 2018. PMID: 29372930 Free PMC article.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
Cited by
-
Integration of machine learning and experimental validation to identify the prognostic signature related to diverse programmed cell deaths in breast cancer.Front Oncol. 2025 Jan 6;14:1505934. doi: 10.3389/fonc.2024.1505934. eCollection 2024. Front Oncol. 2025. PMID: 39834939 Free PMC article.
-
Advancements and future trends in machine learning for lung cancer: a comprehensive bibliometric analysis.Clin Transl Oncol. 2025 Jun 4. doi: 10.1007/s12094-025-03945-7. Online ahead of print. Clin Transl Oncol. 2025. PMID: 40465135 Review.
-
Ensemble machine learning models for lung cancer incidence risk prediction in the elderly: a retrospective longitudinal study.BMC Cancer. 2025 Jan 22;25(1):126. doi: 10.1186/s12885-025-13562-w. BMC Cancer. 2025. PMID: 39844064 Free PMC article.
-
Predicting Neoplastic Polyp in Patients With Gallbladder Polyps Using Interpretable Machine Learning Models: Retrospective Cohort Study.Cancer Med. 2025 Mar;14(5):e70739. doi: 10.1002/cam4.70739. Cancer Med. 2025. PMID: 40052528 Free PMC article.
-
Monte Carlo Gradient Boosted Trees for Cancer Staging: A Machine Learning Approach.Cancers (Basel). 2025 Jul 24;17(15):2452. doi: 10.3390/cancers17152452. Cancers (Basel). 2025. PMID: 40805154 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical