Review

. 2020 May 27;2(3):e190043.

doi: 10.1148/ryai.2020190043.

On the Interpretability of Artificial Intelligence in Radiology: Challenges and Opportunities

Mauricio Reyes¹, Raphael Meier¹, Sérgio Pereira¹, Carlos A Silva¹, Fried-Michael Dahlweid¹, Hendrik von Tengg-Kobligk¹, Ronald M Summers¹, Roland Wiest¹

Affiliations

Affiliation

¹ Artorg Center for Biomedical Research, University of Bern, Murtenstrasse 50, 3008 Bern, Switzerland (M.R.); Insel Data Science Center, University of Bern, Bern, Switerland (F.M.D.); Institute of Diagnostic and Interventional Neuroradiology (R.M., R.W.) and Department of Diagnostic, Interventional and Paediatric Radiology (H.v.T.K.), Inselspital University Hospital Bern, Bern, Switzerland; Center for Microelectromechanical Systems-University of Minho Research Unit, University of Minho, Guimarães, Portugal (S.P., C.A.S.); and Department of Radiology and Imaging Sciences, National Institutes of Health Clinical Center, Bethesda, Md (R.M.S.).

PMID: 32510054
PMCID: PMC7259808
DOI: 10.1148/ryai.2020190043

Review

On the Interpretability of Artificial Intelligence in Radiology: Challenges and Opportunities

Mauricio Reyes et al. Radiol Artif Intell. 2020.

. 2020 May 27;2(3):e190043.

doi: 10.1148/ryai.2020190043.

Authors

Mauricio Reyes¹, Raphael Meier¹, Sérgio Pereira¹, Carlos A Silva¹, Fried-Michael Dahlweid¹, Hendrik von Tengg-Kobligk¹, Ronald M Summers¹, Roland Wiest¹

Affiliation

¹ Artorg Center for Biomedical Research, University of Bern, Murtenstrasse 50, 3008 Bern, Switzerland (M.R.); Insel Data Science Center, University of Bern, Bern, Switerland (F.M.D.); Institute of Diagnostic and Interventional Neuroradiology (R.M., R.W.) and Department of Diagnostic, Interventional and Paediatric Radiology (H.v.T.K.), Inselspital University Hospital Bern, Bern, Switzerland; Center for Microelectromechanical Systems-University of Minho Research Unit, University of Minho, Guimarães, Portugal (S.P., C.A.S.); and Department of Radiology and Imaging Sciences, National Institutes of Health Clinical Center, Bethesda, Md (R.M.S.).

PMID: 32510054
PMCID: PMC7259808
DOI: 10.1148/ryai.2020190043

Abstract

As artificial intelligence (AI) systems begin to make their way into clinical radiology practice, it is crucial to assure that they function correctly and that they gain the trust of experts. Toward this goal, approaches to make AI "interpretable" have gained attention to enhance the understanding of a machine learning algorithm, despite its complexity. This article aims to provide insights into the current state of the art of interpretability methods for radiology AI. This review discusses radiologists' opinions on the topic and suggests trends and challenges that need to be addressed to effectively streamline interpretability methods in clinical practice. Supplemental material is available for this article. © RSNA, 2020 See also the commentary by Gastounioti and Kontos in this issue.

2020 by the Radiological Society of North America, Inc.

PubMed Disclaimer

Conflict of interest statement

Disclosures of Conflicts of Interest: M.R. disclosed no relevant relationships. R.M. disclosed no relevant relationships. S.P. Activities related to the present article: disclosed no relevant relationships. Activities not related to the present article: disclosed no relevant relationships. Other relationships: pending patent with Universidade do Minho. C.A.S. disclosed no relevant relationships. F.M.D. disclosed no relevant relationships. H.v.T.K. disclosed no relevant relationships. R.M.S. Activities related to the present article: disclosed no relevant relationships. Activities not related to the present article: institution receives grants from CRADA with Ping An and GPU card donation from Nvidia. Other relationships: license royalties to author and institution from Ping An, ScanMed, Philips, Imbio, Zebra Medical and iCAD; patent royalties to author and institution from Ping An, ScanMed, Philips, and iCAD. R.W. disclosed no relevant relationships.

Figures

Examples of interpretability methods used on medical images. (a) Guided backpropagation and gradient-weighted class activation mapping (Grad-CAM) used on MRI to interpret areas of a brain image used by a deep learning model classifying the input image as a high-grade glioma. (Adapted and reprinted, with permission, from reference 7). Importance of pixels are color-coded as red = high importance, blue = low importance. (b) Regression concept vectors used to assess relevance of selected features describing curvature, tortuosity, and dilatation of retinal arteries and veins from retinal images, analyzed by a deep convolutional neural network. In b, examples of a correctly and wrongly classified image are shown, allowing the interpretation that the network is more sensitive to curvature and dilatation concepts for the classification of normal images, while being more sensitive to tortuosity for disease images. (Adapted and reprinted, with permission, from reference 6). Avg = average, cti = cumulative tortuosity index, Pn, Ppre, Pplus = network probabilities for normal, pre, and pre-plus classes. — **Figure 1a:**
Examples of interpretability methods used on medical images. **(a)** Guided backpropagation and gradient-weighted class activation mapping (Grad-CAM) used on MRI to interpret areas of a brain image used by a deep learning model classifying the input image as a high-grade glioma. (Adapted and reprinted, with permission, from reference 7). Importance of pixels are color-coded as red = high importance, blue = low importance. **(b)** Regression concept vectors used to assess relevance of selected features describing curvature, tortuosity, and dilatation of retinal arteries and veins from retinal images, analyzed by a deep convolutional neural network. In b, examples of a correctly and wrongly classified image are shown, allowing the interpretation that the network is more sensitive to curvature and dilatation concepts for the classification of normal images, while being more sensitive to tortuosity for disease images. (Adapted and reprinted, with permission, from reference 6). Avg = average, cti = cumulative tortuosity index, Pn, *Ppre*, *Pplus* = network probabilities for normal, pre, and pre-plus classes.

Gradient-based saliency maps for image classification. (a) Basic concepts of neuron activation. A neuron is activated via a weighted combination of inputs and application of an activation function, g. (b) Gradient-based methods rely on a forward and a backward pass. Given an input image x, a class k is maximally activated through forward passing throughout all layers of the network. All positive forward activations are recorded for later use during the backward pass. To visualize the contribution of pixels in the image to the class k, all activations are set to zero except for the studied class k, and then (c) backpropagation uses the chain rule to compute gradients from the output to the input of the network. ReLU = rectified linear unit, tanh = hyperbolic tangent. — **Figure 2a:**
Gradient-based saliency maps for image classification. **(a)** Basic concepts of neuron activation. A neuron is activated via a weighted combination of inputs and application of an activation function, g. **(b)** Gradient-based methods rely on a forward and a backward pass. Given an input image x, a class k is maximally activated through forward passing throughout all layers of the network. All positive forward activations are recorded for later use during the backward pass. To visualize the contribution of pixels in the image to the class k, all activations are set to zero except for the studied class k, and then **(c)** backpropagation uses the chain rule to compute gradients from the output to the input of the network. ReLU = rectified linear unit, tanh = hyperbolic tangent.

A, Local interpretable model-agnostic explanations (LIME) method approximates a complex model f (eg, a neural network) with a simplified model g (eg, linear model) around the input case I being interpreted. B, Perturbed instances (Ip)1,...,n are produced, and C, predictions f(Ip)1,...,n = p1,...,n are obtained. D, The similarity πI(Ip)1,...,n between the input image I and each perturbed instance (Ip)1,...,n is measured, and these values are used as weights to fit a simpler (eg, linear) model g, in a weighted fashion. The size of red crosses and blue circles illustrates weights. E, An explanation, ϵ(I), is generated by minimizing the disagreement between f and g (ie, how well g approximates f) while keeping the complexity of model g, as measured by Ω(g), low. Note: Perturbations can be of any type; in this example, image regions are blacked out. The similarity metric πI as well as the model g can be selected by the user. — **Figure 3:**
A, Local interpretable model-agnostic explanations (LIME) method approximates a complex model f (eg, a neural network) with a simplified model g (eg, linear model) around the input case I being interpreted. B, Perturbed instances (I_p)_1,...,n are produced, and C, predictions f(I_p)_1,...,n = p_1,...,n are obtained. D, The similarity π_I(I_p)_1,...,n between the input image I and each perturbed instance (I_p)_1,...,n is measured, and these values are used as weights to fit a simpler (eg, linear) model g, in a weighted fashion. The size of red crosses and blue circles illustrates weights. E, An explanation, ϵ(I), is generated by minimizing the disagreement between f and g (ie, how well g approximates f) while keeping the complexity of model g, as measured by Ω(g), low. Note: Perturbations can be of any type; in this example, image regions are blacked out. The similarity metric π_I as well as the model g can be selected by the user.

A, Testing with concept activation vectors (TCAVs) requires a set of samples characterizing the concept (eg, “honeycomb pattern,” a set of “nonconcept” examples, which are not related to the concept being studied), B, a testing dataset of the class k of interest (eg, idiopathic pulmonary fibrosis), and, C, a complex model f (eg, neural network) that one desires to interpret, and which has been trained to perform classification of these classes. D, A linear model is built from the concept and nonconcept samples using model f, by employing model f to generate classification labels for the concept and nonconcept samples. E, From the resulting linear model, separating concept from nonconcept examples (dotted line in D), its main perpendicular direction vcl (red arrow in D) can be obtained to assess the sensitivity of model f to concept C at layer l by quantifying changes to the activations of model f in the vcl direction. — **Figure 4:**
A, Testing with concept activation vectors (TCAVs) requires a set of samples characterizing the concept (eg, “honeycomb pattern,” a set of “nonconcept” examples, which are not related to the concept being studied), B, a testing dataset of the class k of interest (eg, idiopathic pulmonary fibrosis), and, C, a complex model f (eg, neural network) that one desires to interpret, and which has been trained to perform classification of these classes. D, A linear model is built from the concept and nonconcept samples using model f, by employing model f to generate classification labels for the concept and nonconcept samples. E, From the resulting linear model, separating concept from nonconcept examples (dotted line in D), its main perpendicular direction v_c^l (red arrow in D) can be obtained to assess the sensitivity of model f to concept C at layer l by quantifying changes to the activations of model f in the v_c^l direction.

Different modalities for model interpretation. For example, an artificial intelligence (AI) system that predicts the condition from a patient’s chest radiograph is shown. From top to bottom, interpretability information is added to the decision: (1) no interpretability information, (2) added output probabilities, (3) added visual saliency information describing areas of the image driving the prediction, (4) added matched real cases used during training of the AI solution influencing the prediction (ie, influential functions), and (5) added computer-generated semantic explanation. — **Figure 5:**
Different modalities for model interpretation. For example, an artificial intelligence (AI) system that predicts the condition from a patient’s chest radiograph is shown. From top to bottom, interpretability information is added to the decision: *(1)* no interpretability information, *(2)* added output probabilities, *(3)* added visual saliency information describing areas of the image driving the prediction, *(4)* added matched real cases used during training of the AI solution influencing the prediction (ie, influential functions), and *(5)* added computer-generated semantic explanation.

See this image and copyright information in PMC

References

1. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med 2019;25(1):44–56. - PubMed
1. Szegedy C, Zaremba W, Sutskever I, et al. . Intriguing properties of neural networks. ArXiv 13126199 [preprint] http://adsabs.harvard.edu/abs/2013arXiv1312.6199S. Posted December 2013. Accessed January 6, 2020.
1. Lipton ZC. The mythos of model interpretability. ArXiv 1606.03490v3 [preprint] http://arxiv.org/abs/1606.03490. Posted June 10, 2016. Accessed January 6, 2020.
1. Van Lent M, Fisher W, Mancuso M. An explainable artificial intelligence system for small-unit tactical behavior. In: Proceedings of the Nineteenth National Conference on Artificial Intelligence. San Jose, Calif, July 25–29, 2004. Palo Alto, Calif: Association for the Advancement of Artificial Intelligence, 2004; 900–907.
1. Gunning D. Explainable artificial intelligence (XAI). Defense Advanced Research Projects Agency (DARPA) Web site. https://www.darpa.mil/attachments/XAIProgramUpdate.pdf. Updated November 2017. Accessed January 6, 2020.

Publication types

Actions

Grants and funding

Z01 CL040004/ImNIH/Intramural NIH HHS/United States

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

On the Interpretability of Artificial Intelligence in Radiology: Challenges and Opportunities

Affiliation

On the Interpretability of Artificial Intelligence in Radiology: Challenges and Opportunities

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials