Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Oct;20(5):850-866.
doi: 10.1016/j.gpb.2022.11.003. Epub 2022 Dec 1.

Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis

Affiliations
Review

Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis

Yawei Li et al. Genomics Proteomics Bioinformatics. 2022 Oct.

Abstract

The recent development of imaging and sequencing technologies enables systematic advances in the clinical study of lung cancer. Meanwhile, the human mind is limited in effectively handling and fully utilizing the accumulation of such enormous amounts of data. Machine learning-based approaches play a critical role in integrating and analyzing these large and complex datasets, which have extensively characterized lung cancer through the use of different perspectives from these accrued data. In this review, we provide an overview of machine learning-based approaches that strengthen the varying aspects of lung cancer diagnosis and therapy, including early detection, auxiliary diagnosis, prognosis prediction, and immunotherapy practice. Moreover, we highlight the challenges and opportunities for future applications of machine learning in lung cancer.

Keywords: Feature extraction; Imaging dataset; Immunotherapy; Omics dataset; Prediction.

PubMed Disclaimer

Conflict of interest statement

The authors have declared no competing interests.

Figures

Figure 1
Figure 1
Applications of ML model in lung cancer We presented an overview of ML methodologies for different aspects of lung cancer therapies, including CAD from imaging datasets, lung cancer early detection based on sequencing technologies, data integration and biomarker extraction from multi-omics datasets, treatment response and prognosis prediction, and immunotherapy studies. ML, machine learning; IC50, half-maximal inhibitory concentration; HLA, human leukocyte antigen; CT, computed tomography; MALDI, matrix-assisted laser desorption/ionization; DL, deep learning; cfDNA, cell-free DNA; CAD, computer-aided diagnosis; CNV, copy number variation; RECIST, Response Evaluation Criteria in Solid Tumors; TIL, tumor-infiltrating lymphocyte.
Figure 2
Figure 2
Feature-based CAD and DL-based CAD systems Differences in the development process of feature-based CAD systems and CNN-based CAD systems. Compared with feature-based CAD systems, the DL-based CAD systems can automatically retrieve and extract intrinsic features of a suspicious nodule. CNN, convolutional neural network; LR, logistic regression; SVM, support vector machine; RF, random forest.
Figure 3
Figure 3
Omics analysis in lung cancer studies Different sequencing techniques allow for the simultaneous measurement of multiple molecular features of a biological sample. To improve efficiency and reduce overfitting, statistical and ML tools perform differential analysis or feature selection. Further ML models concatenate the obtained omics features with clinical features as input for lung cancer diagnostic/prognostic prediction. DEG, differentially expressed gene; RFE, recursive feature elimination; UAF, univariate association filtering.
Figure 4
Figure 4
Diagram of ML applications in treatment response and survival prediction

References

    1. Thai A.A., Solomon B.J., Sequist L.V., Gainor J.F., Heist R.S. Lung cancer. Lancet. 2021;398:535–554. - PubMed
    1. Svoboda E. Artificial intelligence is improving the detection of lung cancer. Nature. 2020;587:S20–S22. - PubMed
    1. Ling S., Hu Z., Yang Z., Yang F., Li Y., Lin P., et al. Extremely high genetic diversity in a single tumor points to prevalence of non-darwinian cell evolution. Proc Natl Acad Sci U S A. 2015;112:E6496–E6505. - PMC - PubMed
    1. International Cancer Genome Consortium, Hudson T.J., Anderson W., Artez A., Barker A.D., Bell C., et al. International network of cancer genome projects. Nature. 2010;464:993–998. - PMC - PubMed
    1. Cancer Genome Atlas Research Network, Weinstein J.N., Collisson E.A., Mills G.B., Shaw K.R., Ozenberger B.A., et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013;45:1113–1120. - PMC - PubMed

Publication types