Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2025 Feb 1;11(2):109-118.
doi: 10.1001/jamaoncol.2024.5356.

Deep Learning Model for Predicting Immunotherapy Response in Advanced Non-Small Cell Lung Cancer

Affiliations
Multicenter Study

Deep Learning Model for Predicting Immunotherapy Response in Advanced Non-Small Cell Lung Cancer

Mehrdad Rakaee et al. JAMA Oncol. .

Abstract

Importance: Only a small fraction of patients with advanced non-small cell lung cancer (NSCLC) respond to immune checkpoint inhibitor (ICI) treatment. For optimal personalized NSCLC care, it is imperative to identify patients who are most likely to benefit from immunotherapy.

Objective: To develop a supervised deep learning-based ICI response prediction method; evaluate its performance alongside other known predictive biomarkers; and assess its association with clinical outcomes in patients with advanced NSCLC.

Design, setting, and participants: This multicenter cohort study developed and independently validated a deep learning-based response stratification model for predicting ICI treatment outcome in patients with advanced NSCLC from whole slide hematoxylin and eosin-stained images. Images for model development and validation were obtained from 1 participating center in the US and 3 in the European Union (EU) from August 2014 to December 2022. Data analyses were performed from September 2022 to May 2024.

Exposure: Monotherapy with ICIs.

Main outcomes and measures: Model performance measured by clinical end points and objective response rate (ORR) differentiation power vs other predictive biomarkers, ie, programmed death-ligand 1 (PD-L1), tumor mutational burden (TMB), and tumor-infiltrating lymphocytes (TILs).

Results: A total of 295 581 image tiles from 958 patients (mean [SD] age, 66.0 [10.6] years; 456 [48%] females and 502 [52%] males) treated with ICI for NSCLC were included in the analysis. The US-based development cohort consisted of 614 patients with median (IQR) follow-up time of 54.5 (38.2-68.1) months, and the EU-based validation cohort, 344 patients with 43.3 (27.4-53.9) months of follow-up. The ORR to ICI was 26% in the developmental cohort and 28% in the validation cohort. The deep learning model's area under the receiver operating characteristic curve (AUC) for ORR was 0.75 (95% CI, 0.64-0.85) in the internal test set and 0.66 (95% CI, 0.60-0.72) in the validation cohort. In a multivariable analysis, the deep learning model's score was an independent predictor of ICI response in the validation cohort for both progression-free (hazard ratio, 0.56; 95% CI, 0.42-0.76; P < .001) and overall survival (hazard ratio, 0.53; 95% CI, 0.39-0.73; P < .001). The tuned deep learning model achieved a higher AUC than TMB, TILs, and PD-L1 in the internal set; in the validation cohort, it was superior to TILs and comparable with PD-L1 (AUC, 0.67; 95% CI, 0.60-0.74), with a 10-percentage point improvement in specificity. In the validation cohort, combining the deep learning model with PD-L1 scores achieved an AUC of 0.70 (95% CI, 0.63-0.76), outperforming either marker alone, with a response rate of 51% compared to 41% for PD-L1 (≥50%) alone.

Conclusions and relevance: The findings of this cohort study demonstrate a strong and independent deep learning-based feature associated with ICI response in patients with NSCLC across various cohorts. Clinical use of this deep learning model could refine treatment precision and better identify patients who are likely to benefit from ICI for treatment of advanced NSCLC.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Ricciuti reported personal fees from Amgen, AstraZeneca, Regeneron, and Bayer outside the submitted work. Dr Cortellini reported personal fees from Merck Sharp & Dohme, Bristol Myers Squibb, Roche, AstraZeneca, Regeneron, GlaxoSmithKline, and Sanofi outside the submitted work. Dr Di Federico reported personal fees from Novartis and Hanson-Wade outside the submitted work. Dr Hashemi reported institutional grants from AstraZeneca, Boehringer Ingelheim, Bristol Myers Squibb, Lilly Janssen, GlaxoSmithKline, Merck Sharp & Dohme, Novartis, Roche, and Takeda outside the submitted work. Dr Pinato reported consulting fees from ViiV Healthcare, Bayer, Roche, MiNa Therapeutics, Mursla Bio, H3B, AstraZeneca, Da Volterra, Exact Sciences, Ipsen, Avammune, Lift Biosciences, Starpharma, Boston Scientific, Eisai, Bristol Myers Squibb, Merck Sharp & Dohme, and GlaxoSmithKline outside the submitted work. Dr Helland reported research grants from AstraZeneca, Roche, Incyte, Eli Lilly Drug, Novartis, Ultimovacs, Illumina Nanopore, Johnson & Johnson, Pfizer, AstraZeneca, Sanofi, Roche, Takeda, Medicover, Bayer, Bristol Myers Squibb, Merck Sharp & Dohme, and Medicover Analyses, and other support from AdBoards, all outside the submitted work. Dr Sholl reported grants from Bristol Myers Squibb and AstraZeneca, and institutional grants from Genentech and Lilly outside the submitted work. Dr Awad reported personal fees from Merck, Mirati, Gritstone, and grants from Amgen, AstraZeneca, EMD Serono, Regeneron, Janssen, Affini-T, Novartis, Coherus BioSciences, Synthekine, Genentech, Bristol Myers Squibb, AbbVie, Lilly, Genentech, and Bristol Myers Squibb, outside the submitted work. Dr Kwiatkowski reported support from Genentech, AADI, and Revolution Medicines; and consulting fees from Genentech, AADI, Expertconnect, Guidepoint, Bridgebio, Slingshot Insights, William Blair, MEDACorp, and Radyus Research, all outside the submitted work. No other disclosures were reported.

Figures

Figure 1.
Figure 1.. Deep Learning Model and Immune Checkpoint Inhibitors (ICIs) by Clinical Outcome
A and B, Progression-free survival and overall survival in response to ICI, stratified by the Deep-IO model scores in the validation cohort. C, Distribution of Deep-IO probability scores across ORR subgroups in the validation cohort using the Mann-Whitney U-test. CR indicates complete response; HR, hazard ratio; mOS, median OS in months; mPFS, median PFS in months; ORR, objective response rate; PD, progressive disease; PR, partial response; and SD, stable disease.
Figure 2.
Figure 2.. Multivariable Analysis in the Validation Cohort
Cox proportional hazard models of significant independent predictive factors associated with progression-free survival and overall survival. Variables with P < .25 from univariate analyses were included in the Cox regression analysis. Number of events and C-index are indicated for each model. C-index indicates concordance index; ECOG, Eastern Cooperative Oncology Group performance status; ICI, immune checkpoint inhibitors; LUAD, lung adenocarcinoma; and LUSC, lung squamous cell carcinoma.
Figure 3.
Figure 3.. Performance of Immune Checkpoint Inhibitors (ICIs) Biomarkers vs the Deep-IO Model
Analysis of the performance power of individual biomarkers (Deep-IO, PD-L1, TMB, TILs) and the combined Deep-IO and PD-L1 in differentiating ICI objective response rate binary groups in the A, test and B, validation cohorts. C, Proportion of ICI responses in PD-L1 and Deep-IO subgroups within the validation cohort, with Deep-IO subgroups classified into tertiles as low (lower tertile), medium (middle tertile), and high (upper tertile). PD-L1 subgroups are categorized as low (<1%), medium (1%-49%), and high (≥50%). D, Combination of Deep-IO scores and PD-L1 expression subgroups in relation to the ICI response rate within the validation cohort. The color intensity of the squares represents the response rate, with darker colors indicating a higher response rate and lighter colors indicating a lower response rate (shown as percentages). AUC indicates area under the receiver operating characteristic curve; HR, hazard ratio; ORR, objective response rate; PD-L1, programmed death-ligand 1; TILs, tumor-infiltrating lymphocytes; TMB, tumor mutational burden; and TPS, tumor proportion score.

References

    1. Hellmann MD, Paz-Ares L, Bernabe Caro R, et al. . Nivolumab plus ipilimumab in advanced non-small-cell lung cancer. N Engl J Med. 2019;381(21):2020-2031. doi:10.1056/NEJMoa1910231 - DOI - PubMed
    1. Gogishvili M, Melkadze T, Makharadze T, et al. . Cemiplimab plus chemotherapy versus chemotherapy alone in non-small cell lung cancer: a randomized, controlled, double-blind phase 3 trial. Nat Med. 2022;28(11):2374-2380. doi:10.1038/s41591-022-01977-y - DOI - PMC - PubMed
    1. Hendriks LE, Kerr KM, Menis J, et al. ; ESMO Guidelines Committee . Non-oncogene-addicted metastatic non-small-cell lung cancer: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up. Ann Oncol. 2023;34(4):358-376. doi:10.1016/j.annonc.2022.12.013 - DOI - PubMed
    1. Pinato DJ, Shiner RJ, White SDT, et al. . Intra-tumoral heterogeneity in the expression of programmed-death (PD) ligands in isogeneic primary and metastatic lung cancer: Implications for immunotherapy. Oncoimmunology. 2016;5(9):e1213934. doi:10.1080/2162402X.2016.1213934 - DOI - PMC - PubMed
    1. Horn L, Spigel DR, Vokes EE, et al. . Nivolumab versus docetaxel in previously treated patients with advanced non-small-cell lung cancer: two-year outcomes from two randomized, open-label, phase III trials (CheckMate 017 and CheckMate 057). J Clin Oncol. 2017;35(35):3924-3933. doi:10.1200/JCO.2017.74.3062 - DOI - PMC - PubMed

Publication types

MeSH terms