A generalised vision transformer-based self-supervised model for diagnosing and grading prostate cancer using histological images
- PMID: 40087478
- PMCID: PMC12643932
- DOI: 10.1038/s41391-025-00957-w
A generalised vision transformer-based self-supervised model for diagnosing and grading prostate cancer using histological images
Abstract
Background: Gleason grading remains the gold standard for prostate cancer histological classification and prognosis, yet its subjectivity leads to grade variability between pathologists, potentially impacting clinical decision-making. Herein, we trained and validated a generalised AI-driven system for diagnosing prostate cancer using diverse datasets from tissue microarray (TMA) core and whole slide images (WSIs) with Haematoxylin and Eosin staining.
Methods: We analysed eight prostate cancer datasets, which included 12,711 histological images from 3648 patients, incorporating TMA core images and WSIs. The Macenko method was used to normalise colours for consistency across diverse images. Subsequently, we trained a multi-resolution (5x, 10x, 20x, and 40x) binary classifier to identify benign and malignant tissue. We then implemented a multi-class classifier for Gleason patterns (GP) sub-categorisation from malignant tissue. Finally, the models were externally validated on 11,132 histology images from 2176 patients to determine the International Society of Urological Pathology (ISUP) grade. Models were assessed using various classification metrics, and the agreement between the model's predictions and the ground truth was quantified using the quadratic weighted Cohen's Kappa (κ) score.
Results: Our multi-resolution binary classifier demonstrated robust performance in distinguishing malignant from benign tissue with κ scores of 0.967 on internal validation. The model achieved κ scores ranging from 0.876 to 0.995 across four unseen testing datasets. The multi-class classifier also distinguished GP3, GP4, and GPs with an overall κ score of 0.841. This model was further tested across four datasets, obtaining κ scores ranging from 0.774 to 0.888. The models' performance was compared against an independent pathologist's annotation on an external dataset, achieving a κ score of 0.752 for four classes.
Conclusion: The self-supervised ViT-based model effectively diagnoses and grades prostate cancer using histological images, distinguishing benign and malignant tissues and classifying malignancies by aggressiveness. External validation highlights its robustness and clinical applicability in digital pathology.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: AKC, PWT, and AWH are cofounders of Pandani Solutions Pty Ltd, which is developing automated AI-based histopathology assessments. Ethics approval: This study analysed publicly accessible datasets. Ethical approval and informed consent were waived, as all information in the datasets was completely de-identified and did not involve direct interaction with human participants.
Figures
References
-
- Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2024;74:229–63. - PubMed
-
- Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70:7–30. - PubMed
-
- Epstein JI, Egevad L, Amin MB, Delahunt B, Srigley JR, Humphrey PA. The 2014 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma: definition of grading patterns and proposal for a new grading system. Am J Surg Pathol. 2016;40:244–52. - PubMed
-
- Computational pathology: Challenges and promises for tissue analysis. Comput Med Imaging Graph. 2011;35:515–30. Available from: 10.1016/j.compmedimag.2011.02.006 - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
