Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 9;13(24):6210.
doi: 10.3390/cancers13246210.

Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Affiliations

Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Sébastien Benzekry et al. Cancers (Basel). .

Abstract

Background: Immune checkpoint inhibitors (ICIs) are now a therapeutic standard in advanced non-small cell lung cancer (NSCLC), but strong predictive markers for ICIs efficacy are still lacking. We evaluated machine learning models built on simple clinical and biological data to individually predict response to ICIs.

Methods: Patients with metastatic NSCLC who received ICI in second line or later were included. We collected clinical and hematological data and studied the association of this data with disease control rate (DCR), progression free survival (PFS) and overall survival (OS). Multiple machine learning (ML) algorithms were assessed for their ability to predict response.

Results: Overall, 298 patients were enrolled. The overall response rate and DCR were 15.3% and 53%, respectively. Median PFS and OS were 3.3 and 11.4 months, respectively. In multivariable analysis, DCR was significantly associated with performance status (PS) and hemoglobin level (OR 0.58, p < 0.0001; OR 1.8, p < 0.001). These variables were also associated with PFS and OS and ranked top in random forest-based feature importance. Neutrophil-to-lymphocyte ratio was also associated with DCR, PFS and OS. The best ML algorithm was a random forest. It could predict DCR with satisfactory efficacy based on these three variables. Ten-fold cross-validated performances were: accuracy 0.68 ± 0.04, sensitivity 0.58 ± 0.08; specificity 0.78 ± 0.06; positive predictive value 0.70 ± 0.08; negative predictive value 0.68 ± 0.06; AUC 0.74 ± 0.03.

Conclusion: Combination of simple clinical and biological data could accurately predict disease control rate at the individual level.

Keywords: blood counts; lung cancer; machine learning; prediction; response; survival.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Exploratory data analysis. (A) Boxplots of continuous variables. (B) Barplots of categorical variables. BMI = body mass index, NLR = neutrophil-to-lymphocyte ratio, PLR = platelets-to-lymphocytes ratio, CR = complete response, PR = partial response, SD = stable disease and PD = progressive disease. Stars indicate statistical significance: **: p < 0.01, ***: p < 0.001, ****: p < 0.0001, n.s. = non-significant.
Figure 2
Figure 2
Variable selection. (A) Feature importance based on random forest classification and mean decrease in accuracy. (B) Accuracy score of incremental logistic regression models built on an increasing number of predictors (i.e., the first one contains only hemoglobin, the second hemoglobin and NLR, etc.). NLR = neutrophil-to-lymphocyte ratio. PLR = platelet-to-lymphocyte ratio. BMI = body mass index.
Figure 3
Figure 3
Machine learning algorithms predictive performances. (A) Receiver-operator curves for prediction on test sets from each fold of the outer cross-validation loop, for each model. AUC = area under the curve. (B) Precision (positive-predictive value)–recall (sensitivity) curves. (C) Main performance metrics for each algorithm. (D) Decision tree obtained after tuning and training. Each node shows: the predicted class (0 = PD, 1 = CR + PR + SD), the predicted probability of response and the percentage of total observations in the node.

References

    1. Planchard D., Popat S., Kerr K., Novello S., Smit E.F., Faivre-Finn C., Mok T.S., Reck M., Van Schil P.E., Hellmann M.D., et al. Metastatic non-small cell lung cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2018;29((Suppl. 4)):iv192–iv237. doi: 10.1093/annonc/mdy275. - DOI - PubMed
    1. Reck M., Rodríguez-Abreu D., Robinson A.G., Hui R., Csőszi T., Fülöp A., Gottfried M., Peled N., Tafreshi A., Cuffe S., et al. Pembrolizumab versus Chemotherapy for PD-L1-Positive Non-Small-Cell Lung Cancer. N. Engl. J. Med. 2016;375:1823–1833. doi: 10.1056/NEJMoa1606774. - DOI - PubMed
    1. Gooden M.J.M., de Bock G.H., Leffers N., Daemen T., Nijman H.W. The prognostic influence of tumour-infiltrating lymphocytes in cancer: A systematic review with meta-analysis. Br. J. Cancer. 2011;105:93–103. doi: 10.1038/bjc.2011.189. - DOI - PMC - PubMed
    1. Balkwill F., Mantovani A. Inflammation and cancer: Back to Virchow? Lancet. 2001;357:539–545. doi: 10.1016/S0140-6736(00)04046-0. - DOI - PubMed
    1. Hopkins A.M., Rowland A., Kichenadasse G., Wiese M.D., Gurney H., McKinnon R.A., Karapetis C.S., Sorich M.J. Predicting response and toxicity to immune checkpoint inhibitors using routinely available blood and clinical markers. Br. J. Cancer. 2017;117:913–920. doi: 10.1038/bjc.2017.274. - DOI - PMC - PubMed

LinkOut - more resources