Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 7;9(1):166.
doi: 10.1038/s41698-025-00940-7.

Systematic scoping review of external validation studies of AI pathology models for lung cancer diagnosis

Affiliations

Systematic scoping review of external validation studies of AI pathology models for lung cancer diagnosis

Soumya Arun et al. NPJ Precis Oncol. .

Abstract

Clinical adoption of digital pathology-based artificial intelligence models for diagnosing lung cancer has been limited, partly due to lack of robust external validation. This review provides an overview of such tools, their performance and external validation. We systematically searched for external validation studies in medical, engineering and grey literature databases from 1st January 2010 to 31st October 2024. 22 studies were included. Models performed various tasks, including classification of malignant versus non-malignant tissue, tumour growth pattern classification and subtyping of adeno- versus squamous cell carcinomas. Subtyping models were most common and performed highly, with average AUC values ranging from 0.746 to 0.999. Although most studies used restricted datasets, methodological issues relevant to the applicability of models in real-world settings included small and/or non-representative datasets, retrospective studies and case-control studies without further real-world validation. Ultimately, more rigorous external validation of models is warranted for increased clinical adoption.

PubMed Disclaimer

Conflict of interest statement

Competing interests: Author J.O. has previously acted as a paid consultant for Hardian Health but declares no non-financial competing interests. All other authors declare no financial or non-financial competing interests.

Figures

Fig. 1
Fig. 1. PRISMA flow diagram.
Summary of studies included and excluded at each stage of the review.
Fig. 2
Fig. 2
Characteristics of included studies.
Fig. 3
Fig. 3. Results of the quality assessment conducted using QUADAS-AI-P.
a Results of the risk of bias assessment. For each QUADAS-AI-P domain, the blue, orange and grey sections of the bar indicate the percentage of studies judged to be at low, high or unclear risk of bias, respectively. b Results of the concerns regarding applicability assessment. For each QUADAS-AI-P domain, the blue, orange and grey sections of the bar indicate the percentage of studies considered to have low, high or unclear concerns regarding applicability. QUADAS-AI-P: QUality Assessment tool of Diagnostic Accuracy Studies tailored to Artificial Intelligence and digital Pathology.

References

    1. Kim, Y. J., Roh, E. H. & Park, S. A literature review of quality, costs, process-associated with digital pathology. J. Exerc. Rehabil.17, 11–14 (2021). - PMC - PubMed
    1. Gourd, E. Lung cancer control in the UK hit badly by COVID-19 pandemic. Lancet Oncol.21, 1559 (2020). - PMC - PubMed
    1. Cancer Research UK. Survival for Lung Cancer. https://www.cancerresearchuk.org/about-cancer/lung-cancer/survival (2022).
    1. Lam, S. et al. Current and future perspectives on computed tomography screening for lung cancer: a roadmap from 2023 to 2027 from the International Association for the study of lung cancer. J. Thorac. Oncol.19, 36–51 (2024). - PMC - PubMed
    1. The Royal College of Pathologists. Meeting Pathology Demand: Histopathology Workforce Census.https://www.rcpath.org/static/952a934d-2ec3-48c9-a8e6e00fcdca700f/Meetin... (2018).

LinkOut - more resources