External Test of a Deep Learning Algorithm for Pulmonary Nodule Malignancy Risk Stratification Using European Screening Data

Noa Antonissen¹, Kiran Vaidhya Venkadesh¹, Renate Dinnessen¹, Ernst Th Scholten¹, Zaigham Saghir^{2

3}, Mario Silva⁴, Ugo Pastorino⁵, Grigory Sidorenkov^{6

7}, Marjolein A Heuvelmans^{6

8}, Geertruida H de Bock⁶, Firdaus A A Mohamed Hoesein⁹, Pim A de Jong⁹, Harry J M Groen¹⁰, Rozemarijn Vliegenthart⁷, Hester A Gietema^{11

12}, Mathias Prokop¹, Cornelia Schaefer-Prokop^{1

13}, Colin Jacobs¹; NELSON-POP consortium

Collaborators, Affiliations

Affiliations

¹ Department of Medical Imaging, Diagnostic Image Analysis Group, Radboud University Medical Center, Route 767, Room 2.30, Radboudumc, Geert Grooteplein Zuid 10, 6525 GA Nijmegen, the Netherlands.
² Department of Medicine, Section of Pulmonary Medicine, Herlev-Gentofte Hospital, Hellerup, Denmark.
³ Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
⁴ Department of Medicine and Surgery, University of Parma, Parma, Italy.
⁵ Department of Thoracic Surgery, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
⁶ Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
⁷ Department of Radiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
⁸ Department of Respiratory Medicine, Amsterdam University Medical Center, Amsterdam, the Netherlands.
⁹ Department of Radiology, Utrecht University, University Medical Center Utrecht, Utrecht, the Netherlands.
¹⁰ Department of Pulmonary Disease, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
¹¹ Department of Radiology and Nuclear Medicine, Maastricht University, Maastricht University Medical Center, Maastricht, the Netherlands.
¹² GROW School for Oncology and Reproduction, Maastricht University, Maastricht, the Netherlands.
¹³ Department of Radiology, Meander Medical Center, Amersfoort, the Netherlands.

PMID: 40956165
DOI: 10.1148/radiol.250874

Multicenter Study

External Test of a Deep Learning Algorithm for Pulmonary Nodule Malignancy Risk Stratification Using European Screening Data

Noa Antonissen et al. Radiology. 2025 Sep.

. 2025 Sep;316(3):e250874.

doi: 10.1148/radiol.250874.

Authors

Affiliations

¹ Department of Medical Imaging, Diagnostic Image Analysis Group, Radboud University Medical Center, Route 767, Room 2.30, Radboudumc, Geert Grooteplein Zuid 10, 6525 GA Nijmegen, the Netherlands.
² Department of Medicine, Section of Pulmonary Medicine, Herlev-Gentofte Hospital, Hellerup, Denmark.
³ Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
⁴ Department of Medicine and Surgery, University of Parma, Parma, Italy.
⁵ Department of Thoracic Surgery, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
⁶ Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
⁷ Department of Radiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
⁸ Department of Respiratory Medicine, Amsterdam University Medical Center, Amsterdam, the Netherlands.
⁹ Department of Radiology, Utrecht University, University Medical Center Utrecht, Utrecht, the Netherlands.
¹⁰ Department of Pulmonary Disease, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
¹¹ Department of Radiology and Nuclear Medicine, Maastricht University, Maastricht University Medical Center, Maastricht, the Netherlands.
¹² GROW School for Oncology and Reproduction, Maastricht University, Maastricht, the Netherlands.
¹³ Department of Radiology, Meander Medical Center, Amersfoort, the Netherlands.

PMID: 40956165
DOI: 10.1148/radiol.250874

Abstract

Background Low-dose CT screening reduces lung cancer-related deaths but has high rates of false-positive findings. A deep learning (DL) algorithm could improve nodule risk stratification but requires robust external testing. Purpose To externally test a DL algorithm for nodule malignancy risk estimation using pooled data from three large European lung cancer screening trials. Materials and Methods In this retrospective study, a DL algorithm trained on National Lung Screening Trial data was externally tested using baseline CT scans from the Danish Lung Cancer Screening Trial, the Multicentric Italian Lung Detection trial, and the Dutch-Belgian Lung Cancer Screening Trial. Performance was assessed across the pooled cohort and two subsets: subset A, including indeterminate nodules (5-15 mm); and subset B, including cancers size-matched to benign nodules (1:2 ratio). Performance, including the area under the receiver operating characteristic curve (AUC), was compared with the Pan-Canadian Early Detection of Lung Cancer (PanCan) model. Results The pooled cohort included 4146 participants (median age, 58 years; 78% male participants; median smoking history, 38 pack-years) with 7614 benign and 180 malignant nodules. The DL algorithm achieved AUCs of 0.98, 0.96, and 0.94 for cancers diagnosed within 1 year, 2 years, and throughout screening, respectively, compared with 0.98, 0.94, and 0.93 (P = .19, .02, and .46, respectively) for the PanCan model. In subset A (129 malignant and 2086 benign nodules), DL significantly outperformed PanCan across the same cancer diagnosis timeframes (respective AUCs: 0.95, 0.94, and 0.90 vs 0.91, 0.88, and 0.86; all P < .05). At 100% sensitivity for cancers diagnosed within 1 year, DL classified 68.1% of benign cases as low risk versus 47.4% for the PanCan model, a 39.4% relative reduction in false-positive findings. In subset B (180 malignant and 360 benign nodules), the AUC of the DL algorithm versus the PanCan model was 0.79 versus 0.60 (P < .01), respectively. Conclusion The DL algorithm outperformed the PanCan model across multiple European screening datasets, demonstrating superior malignancy prediction while substantially reducing false-positive classifications for indeterminate nodules. © RSNA, 2025 Supplemental material is available for this article.

PubMed Disclaimer

Comment in

Reducing False Alarms in Lung Cancer Screening: The Promise of Deep Learning.
Mohajer B, Chernyak V. Mohajer B, et al. Radiology. 2025 Oct;317(1):e252917. doi: 10.1148/radiol.252917. Radiology. 2025. PMID: 41147921 No abstract available.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Atypon
Medical
- MedlinePlus Consumer Health Information
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

External Test of a Deep Learning Algorithm for Pulmonary Nodule Malignancy Risk Stratification Using European Screening Data

Collaborators

Affiliations

External Test of a Deep Learning Algorithm for Pulmonary Nodule Malignancy Risk Stratification Using European Screening Data

Authors

Collaborators

Affiliations

Abstract

Comment in

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical