GANterfactual-Counterfactual Explanations for Medical Non-experts Using Generative Adversarial Learning

doi:10.3389/frai.2022.825565

. 2022 Apr 8:5:825565.

doi: 10.3389/frai.2022.825565. eCollection 2022.

GANterfactual-Counterfactual Explanations for Medical Non-experts Using Generative Adversarial Learning

Silvan Mertes¹, Tobias Huber¹, Katharina Weitz¹, Alexander Heimerl¹, Elisabeth André¹

Affiliations

PMID: 35464995
PMCID: PMC9024220
DOI: 10.3389/frai.2022.825565

GANterfactual-Counterfactual Explanations for Medical Non-experts Using Generative Adversarial Learning

Silvan Mertes et al. Front Artif Intell. 2022.

. 2022 Apr 8:5:825565.

doi: 10.3389/frai.2022.825565. eCollection 2022.

Authors

Silvan Mertes¹, Tobias Huber¹, Katharina Weitz¹, Alexander Heimerl¹, Elisabeth André¹

Affiliation

¹ Lab for Human-Centered Artificial Intelligence, Augsburg University, Augsburg, Germany.

PMID: 35464995
PMCID: PMC9024220
DOI: 10.3389/frai.2022.825565

Abstract

With the ongoing rise of machine learning, the need for methods for explaining decisions made by artificial intelligence systems is becoming a more and more important topic. Especially for image classification tasks, many state-of-the-art tools to explain such classifiers rely on visual highlighting of important areas of the input data. Contrary, counterfactual explanation systems try to enable a counterfactual reasoning by modifying the input image in a way such that the classifier would have made a different prediction. By doing so, the users of counterfactual explanation systems are equipped with a completely different kind of explanatory information. However, methods for generating realistic counterfactual explanations for image classifiers are still rare. Especially in medical contexts, where relevant information often consists of textural and structural information, high-quality counterfactual images have the potential to give meaningful insights into decision processes. In this work, we present GANterfactual, an approach to generate such counterfactual image explanations based on adversarial image-to-image translation techniques. Additionally, we conduct a user study to evaluate our approach in an exemplary medical use case. Our results show that, in the chosen medical use-case, counterfactual explanations lead to significantly better results regarding mental models, explanation satisfaction, trust, emotions, and self-efficacy than two state-of-the art systems that work with saliency maps, namely LIME and LRP.

Keywords: counterfactual explanations; explainable AI; generative adversarial networks; image-to-image translation; interpretable machine learning; machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Schematic overview of our approach. A CycleGAN architecture is extended with the classifier that shall be explained. Both the generators of the CycleGAN include the classifier's decisions for the generated data into their loss function.

**Figure 2**
Example images of the used dataset. The top row shows images that are labeled as *Normal*, while the bottom row shows images labeled as *Lung Opacity*, indicating lungs that are suffering from pneumonia.

**Figure 3**
Examples of counterfactual images produced with our proposed approach. In each pair, the left image shows the original image, while the right image shows the corresponding counterfactual explanation. The red boxes were added manually to point the reader to the regions that were altered the most. The original images in the top row were classified as *normal*, while the original images in the bottom row were classified as *pneumonia*. The shown counterfactual images were all classified as the opposite as their respective counterpart.

**Figure 4**
Computational evaluation results of the counterfactual image generation performance. The confusion matrices show the number of samples out of each subset (Normal, Pneumonia, Total) of the rsna dataset that the classifier predicted to be the respective class before (y-axis) and after (x-axis) the samples had been translated by either the original CycleGAN or by our approach.

**Figure 5**
An example x-ray image classified as *Pneumonia*, as well as the different XAI visualizations used in our study when the slider is fully on the right side. Best viewed in color.

**Figure 6**
A simplified schematic of our prediction task.

**Figure 7**
Results of the explanation satisfaction and trust questionnaires. Error bars represent the 95% Confidence Interval (CI).

**Figure 8**
Results of the prediction task, and the task reflection questions. Error bars represent the 95% Confidence Interval (CI).

**Figure 9**
Results of the emotion questionnaires. Participants in the counterfactual condition felt significantly less angry and more relaxed compared to the LRP saliency map condition. For LIME, no significant differences were found. Error bars represent the 95% CI.

**Figure 10**
Significant differences regarding self-efficacy and general confidence of the participants in their predictions of the AI between the counterfactual condition and the saliency map conditions (LRP and LIME). Error bars represent the 95% CI.

**Figure 11**
Confidence of the participants in correct and false predictions. The significant difference between the counterfactual condition and the saliency map conditions is based on the confidence in correct predictions, not in the incorrect ones. Error bars represent the 95% CI.

See this image and copyright information in PMC

Cited by

Exploring interpretability in deep learning prediction of successful ablation therapy for atrial fibrillation.
Ogbomo-Harmitt S, Muffoletto M, Zeidan A, Qureshi A, King AP, Aslanidi O. Ogbomo-Harmitt S, et al. Front Physiol. 2023 Mar 14;14:1054401. doi: 10.3389/fphys.2023.1054401. eCollection 2023. Front Physiol. 2023. PMID: 36998987 Free PMC article.
MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models.
Guo G, Deng L, Tandon A, Endert A, Kwon BC. Guo G, et al. FACCT 24 (2024). 2024 Jun;2024:1861-1874. doi: 10.1145/3630106.3659011. Epub 2024 Jun 5. FACCT 24 (2024). 2024. PMID: 39877054 Free PMC article.
A novel method to derive personalized minimum viable recommendations for type 2 diabetes prevention based on counterfactual explanations.
Lenatti M, Carlevaro A, Guergachi A, Keshavjee K, Mongelli M, Paglialonga A. Lenatti M, et al. PLoS One. 2022 Nov 17;17(11):e0272825. doi: 10.1371/journal.pone.0272825. eCollection 2022. PLoS One. 2022. PMID: 36395096 Free PMC article.
FUTURE-AI: international consensus guideline for trustworthy and deployable artificial intelligence in healthcare.
Lekadir K, Frangi AF, Porras AR, Glocker B, Cintas C, Langlotz CP, Weicken E, Asselbergs FW, Prior F, Collins GS, Kaissis G, Tsakou G, Buvat I, Kalpathy-Cramer J, Mongan J, Schnabel JA, Kushibar K, Riklund K, Marias K, Amugongo LM, Fromont LA, Maier-Hein L, Cerdá-Alberich L, Martí-Bonmatí L, Cardoso MJ, Bobowicz M, Shabani M, Tsiknakis M, Zuluaga MA, Fritzsche MC, Camacho M, Linguraru MG, Wenzel M, De Bruijne M, Tolsgaard MG, Goisauf M, Cano Abadía M, Papanikolaou N, Lazrak N, Pujol O, Osuala R, Napel S, Colantonio S, Joshi S, Klein S, Aussó S, Rogers WA, Salahuddin Z, Starmans MPA; FUTURE-AI Consortium. Lekadir K, et al. BMJ. 2025 Feb 5;388:e081554. doi: 10.1136/bmj-2024-081554. BMJ. 2025. PMID: 39909534 Free PMC article.
Dissection of medical AI reasoning processes via physician and generative-AI collaboration.
DeGrave AJ, Cai ZR, Janizek JD, Daneshjou R, Lee SI. DeGrave AJ, et al. medRxiv [Preprint]. 2023 May 16:2023.05.12.23289878. doi: 10.1101/2023.05.12.23289878. medRxiv. 2023. PMID: 37292705 Free PMC article. Preprint.

See all "Cited by" articles

References

1. Ahsan M. M., Gupta K. D., Islam M., Sen S., Rahman M. L., Hossain M. (2020). Study of different deep learning approach with explainable AI for screening patients with COVID-19 symptoms: using CT scan and chest x-ray image dataset. CoRR. Available online at: https://arxiv.org/abs/2007.12525
1. Alqaraawi A., Schuessler M., Weiß P., Costanza E., Berthouze N. (2020). Evaluating saliency map explanations for convolutional neural networks: a user study, in IUI '20: 25th International Conference on Intelligent User Interfaces (Cagliari: ), 275–285. 10.1145/3377325.3377519 - DOI
1. Anderson A., Dodge J., Sadarangani A., Juozapaitis Z., Newman E., Irvine J., et al. . (2019). Explaining reinforcement learning to mere mortals: an empirical study, in Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19 (Macao: ), 1328–1334. 10.24963/ijcai.2019/184 - DOI
1. Arrieta A. B., Díaz-Rodríguez N., Del Ser J., Bennetot A., Tabik S., Barbado A., et al. . (2020). Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inform. Fus. 58, 82–115. 10.1016/j.inffus.2019.12.012 - DOI
1. Bach S., Binder A., Montavon G., Klauschen F., Müller K.-R., Samek W. (2015). On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10:e130140. 10.1371/journal.pone.0130140 - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

[1] Ahsan M. M., Gupta K. D., Islam M., Sen S., Rahman M. L., Hossain M. (2020). Study of different deep learning approach with explainable AI for screening patients with COVID-19 symptoms: using CT scan and chest x-ray image dataset. CoRR. Available online at: https://arxiv.org/abs/2007.12525

[2] Ahsan M. M., Gupta K. D., Islam M., Sen S., Rahman M. L., Hossain M. (2020). Study of different deep learning approach with explainable AI for screening patients with COVID-19 symptoms: using CT scan and chest x-ray image dataset. CoRR. Available online at: https://arxiv.org/abs/2007.12525

[3] Alqaraawi A., Schuessler M., Weiß P., Costanza E., Berthouze N. (2020). Evaluating saliency map explanations for convolutional neural networks: a user study, in IUI '20: 25th International Conference on Intelligent User Interfaces (Cagliari: ), 275–285. 10.1145/3377325.3377519 - DOI

[4] Alqaraawi A., Schuessler M., Weiß P., Costanza E., Berthouze N. (2020). Evaluating saliency map explanations for convolutional neural networks: a user study, in IUI '20: 25th International Conference on Intelligent User Interfaces (Cagliari: ), 275–285. 10.1145/3377325.3377519 - DOI

[5] Anderson A., Dodge J., Sadarangani A., Juozapaitis Z., Newman E., Irvine J., et al. . (2019). Explaining reinforcement learning to mere mortals: an empirical study, in Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19 (Macao: ), 1328–1334. 10.24963/ijcai.2019/184 - DOI

[6] Anderson A., Dodge J., Sadarangani A., Juozapaitis Z., Newman E., Irvine J., et al. . (2019). Explaining reinforcement learning to mere mortals: an empirical study, in Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19 (Macao: ), 1328–1334. 10.24963/ijcai.2019/184 - DOI

[7] Arrieta A. B., Díaz-Rodríguez N., Del Ser J., Bennetot A., Tabik S., Barbado A., et al. . (2020). Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inform. Fus. 58, 82–115. 10.1016/j.inffus.2019.12.012 - DOI

[8] Arrieta A. B., Díaz-Rodríguez N., Del Ser J., Bennetot A., Tabik S., Barbado A., et al. . (2020). Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inform. Fus. 58, 82–115. 10.1016/j.inffus.2019.12.012 - DOI

[9] Bach S., Binder A., Montavon G., Klauschen F., Müller K.-R., Samek W. (2015). On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10:e130140. 10.1371/journal.pone.0130140 - DOI - PMC - PubMed

[10] Bach S., Binder A., Montavon G., Klauschen F., Müller K.-R., Samek W. (2015). On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10:e130140. 10.1371/journal.pone.0130140 - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

GANterfactual-Counterfactual Explanations for Medical Non-experts Using Generative Adversarial Learning

Affiliation

GANterfactual-Counterfactual Explanations for Medical Non-experts Using Generative Adversarial Learning

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous