Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Affiliations

¹ Computational Pharmacology and Clinical Oncology (COMPO) Unit, Inria Sophia Antipolis-Méditerranée, Cancer Research Center of Marseille, Inserm UMR1068, CNRS UMR7258, Aix Marseille University UM105, 13385 Marseille, France.
² Multidisciplinary Oncology and Therapeutic Innovations Department, Assistance Publique-Hôpitaux de Marseille, Aix Marseille University, 13005 Marseille, France.
³ Thoracic Oncology Department, Aix Marseille University, CNRS, INSERM, CRCM, 13385 Marseille, France.
⁴ International Center of Thoracic Cancers, Gustave Roussy Cancer Campus, Université Paris-Saclay, 94805 Villejuif, France.

PMID: 34944830
PMCID: PMC8699503
DOI: 10.3390/cancers13246210

Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Sébastien Benzekry et al. Cancers (Basel). 2021.

. 2021 Dec 9;13(24):6210.

doi: 10.3390/cancers13246210.

Authors

Affiliations

¹ Computational Pharmacology and Clinical Oncology (COMPO) Unit, Inria Sophia Antipolis-Méditerranée, Cancer Research Center of Marseille, Inserm UMR1068, CNRS UMR7258, Aix Marseille University UM105, 13385 Marseille, France.
² Multidisciplinary Oncology and Therapeutic Innovations Department, Assistance Publique-Hôpitaux de Marseille, Aix Marseille University, 13005 Marseille, France.
³ Thoracic Oncology Department, Aix Marseille University, CNRS, INSERM, CRCM, 13385 Marseille, France.
⁴ International Center of Thoracic Cancers, Gustave Roussy Cancer Campus, Université Paris-Saclay, 94805 Villejuif, France.

PMID: 34944830
PMCID: PMC8699503
DOI: 10.3390/cancers13246210

Abstract

Background: Immune checkpoint inhibitors (ICIs) are now a therapeutic standard in advanced non-small cell lung cancer (NSCLC), but strong predictive markers for ICIs efficacy are still lacking. We evaluated machine learning models built on simple clinical and biological data to individually predict response to ICIs.

Methods: Patients with metastatic NSCLC who received ICI in second line or later were included. We collected clinical and hematological data and studied the association of this data with disease control rate (DCR), progression free survival (PFS) and overall survival (OS). Multiple machine learning (ML) algorithms were assessed for their ability to predict response.

Results: Overall, 298 patients were enrolled. The overall response rate and DCR were 15.3% and 53%, respectively. Median PFS and OS were 3.3 and 11.4 months, respectively. In multivariable analysis, DCR was significantly associated with performance status (PS) and hemoglobin level (OR 0.58, p < 0.0001; OR 1.8, p < 0.001). These variables were also associated with PFS and OS and ranked top in random forest-based feature importance. Neutrophil-to-lymphocyte ratio was also associated with DCR, PFS and OS. The best ML algorithm was a random forest. It could predict DCR with satisfactory efficacy based on these three variables. Ten-fold cross-validated performances were: accuracy 0.68 ± 0.04, sensitivity 0.58 ± 0.08; specificity 0.78 ± 0.06; positive predictive value 0.70 ± 0.08; negative predictive value 0.68 ± 0.06; AUC 0.74 ± 0.03.

Conclusion: Combination of simple clinical and biological data could accurately predict disease control rate at the individual level.

Keywords: blood counts; lung cancer; machine learning; prediction; response; survival.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
Exploratory data analysis. (A) Boxplots of continuous variables. (B) Barplots of categorical variables. BMI = body mass index, NLR = neutrophil-to-lymphocyte ratio, PLR = platelets-to-lymphocytes ratio, CR = complete response, PR = partial response, SD = stable disease and PD = progressive disease. Stars indicate statistical significance: **: p < 0.01, ***: p < 0.001, ****: p < 0.0001, n.s. = non-significant.

**Figure 2**
Variable selection. (A) Feature importance based on random forest classification and mean decrease in accuracy. (B) Accuracy score of incremental logistic regression models built on an increasing number of predictors (i.e., the first one contains only hemoglobin, the second hemoglobin and NLR, etc.). NLR = neutrophil-to-lymphocyte ratio. PLR = platelet-to-lymphocyte ratio. BMI = body mass index.

**Figure 3**
Machine learning algorithms predictive performances. (A) Receiver-operator curves for prediction on test sets from each fold of the outer cross-validation loop, for each model. AUC = area under the curve. (B) Precision (positive-predictive value)–recall (sensitivity) curves. (C) Main performance metrics for each algorithm. (D) Decision tree obtained after tuning and training. Each node shows: the predicted class (0 = PD, 1 = CR + PR + SD), the predicted probability of response and the percentage of total observations in the node.

See this image and copyright information in PMC

References

1. Planchard D., Popat S., Kerr K., Novello S., Smit E.F., Faivre-Finn C., Mok T.S., Reck M., Van Schil P.E., Hellmann M.D., et al. Metastatic non-small cell lung cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2018;29((Suppl. 4)):iv192–iv237. doi: 10.1093/annonc/mdy275. - DOI - PubMed
1. Reck M., Rodríguez-Abreu D., Robinson A.G., Hui R., Csőszi T., Fülöp A., Gottfried M., Peled N., Tafreshi A., Cuffe S., et al. Pembrolizumab versus Chemotherapy for PD-L1-Positive Non-Small-Cell Lung Cancer. N. Engl. J. Med. 2016;375:1823–1833. doi: 10.1056/NEJMoa1606774. - DOI - PubMed
1. Gooden M.J.M., de Bock G.H., Leffers N., Daemen T., Nijman H.W. The prognostic influence of tumour-infiltrating lymphocytes in cancer: A systematic review with meta-analysis. Br. J. Cancer. 2011;105:93–103. doi: 10.1038/bjc.2011.189. - DOI - PMC - PubMed
1. Balkwill F., Mantovani A. Inflammation and cancer: Back to Virchow? Lancet. 2001;357:539–545. doi: 10.1016/S0140-6736(00)04046-0. - DOI - PubMed
1. Hopkins A.M., Rowland A., Kichenadasse G., Wiese M.D., Gurney H., McKinnon R.A., Karapetis C.S., Sorich M.J. Predicting response and toxicity to immune checkpoint inhibitors using routinely available blood and clinical markers. Br. J. Cancer. 2017;117:913–920. doi: 10.1038/bjc.2017.274. - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Affiliations

Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources