Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2021 Mar;34(3):660-671.
doi: 10.1038/s41379-020-0640-y. Epub 2020 Aug 5.

Artificial intelligence assistance significantly improves Gleason grading of prostate biopsies by pathologists

Collaborators, Affiliations
Comparative Study

Artificial intelligence assistance significantly improves Gleason grading of prostate biopsies by pathologists

Wouter Bulten et al. Mod Pathol. 2021 Mar.

Abstract

The Gleason score is the most important prognostic marker for prostate cancer patients, but it suffers from significant observer variability. Artificial intelligence (AI) systems based on deep learning can achieve pathologist-level performance at Gleason grading. However, the performance of such systems can degrade in the presence of artifacts, foreign tissue, or other anomalies. Pathologists integrating their expertise with feedback from an AI system could result in a synergy that outperforms both the individual pathologist and the system. Despite the hype around AI assistance, existing literature on this topic within the pathology domain is limited. We investigated the value of AI assistance for grading prostate biopsies. A panel of 14 observers graded 160 biopsies with and without AI assistance. Using AI, the agreement of the panel with an expert reference standard increased significantly (quadratically weighted Cohen's kappa, 0.799 vs. 0.872; p = 0.019). On an external validation set of 87 cases, the panel showed a significant increase in agreement with a panel of international experts in prostate pathology (quadratically weighted Cohen's kappa, 0.733 vs. 0.786; p = 0.003). In both experiments, on a group-level, AI-assisted pathologists outperformed the unassisted pathologists and the standalone AI system. Our results show the potential of AI systems for Gleason grading, but more importantly, show the benefits of pathologist-AI synergy.

PubMed Disclaimer

Conflict of interest statement

WB reports grants from the Dutch Cancer Society, during the conduct of the study. JvdL reports personal fees from Philips, grants from Philips, personal fees from ContextVision, personal fees from AbbVie, grants from Sectra, outside the submitted work. GL reports grants from the Dutch Cancer Society, during the conduct of the study; grants from Philips Digital Pathology Solutions, personal fees from Novartis, outside the submitted work. MB, JAB, AB, AC, LE, ME, XF, KG, GP, PR, GS, PS, JT, HvB, RV, and CH-vdK have nothing to disclose.

Figures

Fig. 1
Fig. 1. Overview of the viewer used in the observer experiment.
Both the original biopsy (a) and the biopsy with the AI overlay (b) are presented to the pathologist. Each individual tumor gland is marked by the deep learning system in the overlay. The case-level grade group was supplied to the panel as part of their (separate) grading form.
Fig. 2
Fig. 2. Survey results on the AI feedback.
Panel members were asked to indicate how useful each part of the AI’s feedback was on a five-point scale from “Not useful” to “Very useful”.
Fig. 3
Fig. 3. Survey results on the grading process.
Panel members were asked to reflect on the grading process and answer questions on a five-point scale from “Strongly disagree” to “Strongly agree”.
Fig. 4
Fig. 4. Panel performance with and without AI assistance.
With AI assistance, the median performance of the group increased while the variability between panel members went down.
Fig. 5
Fig. 5. Individual performance of panel members shown for both the unassisted read (light blue) and assisted read (dark blue).
Results for the internal test set shown in (a) and external test set shown in (b). Lower performance in the unassisted read is indicated with a line in the light blue bars. Pathologists are sorted based on experience level and the kappa value of the unassisted read. The performance of the standalone AI system is shown in green. In the unassisted reads, the AI system outperforms the group. In the assisted reads, the median performance of the group is higher than of the standalone AI system.
Fig. 6
Fig. 6. Pairwise agreement for each panel member with the other panel members.
Each horizontal bar indicates the average agreement. The consensus reference standard was not used in this figure. The agreement in the assisted read (dark blue) is higher than in the unassisted read (light blue). Panel members are sorted based on their agreement in the unassisted read.
Fig. 7
Fig. 7. Pairwise correlations on reported total tumor volume between panel members.
While only a slight increase can be observed in the assisted read, the total variation dropped substantially.

References

    1. Epstein JI. An update of the Gleason grading system. J Urol. 2010;183:433–40. doi: 10.1016/j.juro.2009.10.046. - DOI - PubMed
    1. Allsbrook WC, Mangold KA, Johnson MH, Lane RB, Lane CG, Epstein JI. Interobserver reproducibility of Gleason grading of prostatic carcinoma: general pathologists. Hum Pathol. 2001;32:81–8. doi: 10.1053/hupa.2001.21135. - DOI - PubMed
    1. Egevad L, Ahmad AS, Algaba F, Berney DM, Boccon-Gibod L, Compérat E, et al. Standardization of Gleason grading among 337 European pathologists. Histopathology. 2013;62:247–56. doi: 10.1111/his.12008. - DOI - PubMed
    1. Allsbrook WC, Mangold KA, Johnson MH, Lane RB, Lane CG, Amin MB, et al. Interobserver reproducibility of Gleason grading of prostatic carcinoma: urologic pathologists. Hum Pathol. 2001;32:74–80. doi: 10.1053/hupa.2001.21134. - DOI - PubMed
    1. Nagpal K, Foote D, Liu Y, Po-Hsuan C, Wulczyn E, Tan F, et al. Development and validation of a deep learning algorithm for improving Gleason scoring of prostate cancer. npj Digital Med. 2018;2:48. doi: 10.1038/s41746-019-0112-2. - DOI - PMC - PubMed

Publication types