Auditing the inference processes of medical-image classifiers by leveraging generative AI and the expertise of physicians
- PMID: 38155295
- DOI: 10.1038/s41551-023-01160-9
Auditing the inference processes of medical-image classifiers by leveraging generative AI and the expertise of physicians
Abstract
The inferences of most machine-learning models powering medical artificial intelligence are difficult to interpret. Here we report a general framework for model auditing that combines insights from medical experts with a highly expressive form of explainable artificial intelligence. Specifically, we leveraged the expertise of dermatologists for the clinical task of differentiating melanomas from melanoma 'lookalikes' on the basis of dermoscopic and clinical images of the skin, and the power of generative models to render 'counterfactual' images to understand the 'reasoning' processes of five medical-image classifiers. By altering image attributes to produce analogous images that elicit a different prediction by the classifiers, and by asking physicians to identify medically meaningful features in the images, the counterfactual images revealed that the classifiers rely both on features used by human dermatologists, such as lesional pigmentation patterns, and on undesirable features, such as background skin texture and colour balance. The framework can be applied to any specialized medical domain to make the powerful inference processes of machine-learning models medically understandable.
© 2023. The Author(s), under exclusive licence to Springer Nature Limited.
Conflict of interest statement
Competing interests: R.D. reports fees from L’Oreal, Frazier Healthcare Partners, Pfizer, DWA and VisualDx for consulting; stock options from MDAcne and Revea for advisory board; and research funding from UCB. The other authors declare no competing interests.
Similar articles
-
Dissection of medical AI reasoning processes via physician and generative-AI collaboration.medRxiv [Preprint]. 2023 May 16:2023.05.12.23289878. doi: 10.1101/2023.05.12.23289878. medRxiv. 2023. PMID: 37292705 Free PMC article. Preprint.
-
A promising AI based super resolution image reconstruction technique for early diagnosis of skin cancer.Sci Rep. 2025 Feb 11;15(1):5084. doi: 10.1038/s41598-025-89693-8. Sci Rep. 2025. PMID: 39934265 Free PMC article.
-
Artificial Intelligence and Its Effect on Dermatologists' Accuracy in Dermoscopic Melanoma Image Classification: Web-Based Survey Study.J Med Internet Res. 2020 Sep 11;22(9):e18091. doi: 10.2196/18091. J Med Internet Res. 2020. PMID: 32915161 Free PMC article.
-
[Computer-assisted skin cancer diagnosis : Is it time for artificial intelligence in clinical practice?].Hautarzt. 2020 Sep;71(9):669-676. doi: 10.1007/s00105-020-04662-8. Hautarzt. 2020. PMID: 32747996 Review. German.
-
Computational neural network in melanocytic lesions diagnosis: artificial intelligence to improve diagnosis in dermatology?Eur J Dermatol. 2019 Apr 1;29(S1):4-7. doi: 10.1684/ejd.2019.3538. Eur J Dermatol. 2019. PMID: 31017580 Review.
Cited by
-
Explainable AI for computational pathology identifies model limitations and tissue biomarkers.ArXiv [Preprint]. 2024 Nov 18:arXiv:2409.03080v2. ArXiv. 2024. PMID: 39279830 Free PMC article. Preprint.
-
Visual interpretability of image-based classification models by generative latent space disentanglement applied to in vitro fertilization.Nat Commun. 2024 Aug 27;15(1):7390. doi: 10.1038/s41467-024-51136-9. Nat Commun. 2024. PMID: 39191720 Free PMC article.
-
DREAM: A framework for discovering mechanisms underlying AI prediction of protected attributes.medRxiv [Preprint]. 2025 Jul 21:2024.04.09.24305289. doi: 10.1101/2024.04.09.24305289. medRxiv. 2025. PMID: 40778150 Free PMC article. Preprint.
-
Digital twins as global learning health and disease models for preventive and personalized medicine.Genome Med. 2025 Feb 7;17(1):11. doi: 10.1186/s13073-025-01435-7. Genome Med. 2025. PMID: 39920778 Free PMC article. Review.
-
Machine learning methods for histopathological image analysis: Updates in 2024.Comput Struct Biotechnol J. 2024 Dec 30;27:383-400. doi: 10.1016/j.csbj.2024.12.033. eCollection 2025. Comput Struct Biotechnol J. 2024. PMID: 39897057 Free PMC article. Review.
References
-
- DeGrave, A. J., Janizek, J. D. & Lee, S.-I. AI for radiographic COVID-19 detection selects shortcuts over signal. Nat. Mach. Intell. 3, 610–619 (2021). - DOI
-
- Singh, N. et al. Agreement between saliency maps and human-labeled regions of interest: applications to skin disease classification. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 3172–3181 (IEEE, 2020).
MeSH terms
Grants and funding
- R01 AG061132/AG/NIA NIH HHS/United States
- DBI-1759487/National Science Foundation (NSF)
- R35 GM 128638/U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS)
- 5T32 AR007422-38/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
LinkOut - more resources
Full Text Sources
Medical