. 2023 Aug:144:104438.

doi: 10.1016/j.jbi.2023.104438. Epub 2023 Jul 4.

WindowSHAP: An efficient framework for explaining time-series classifiers based on Shapley values

Amin Nayebi¹, Sindhu Tipirneni², Chandan K Reddy², Brandon Foreman³, Vignesh Subbian⁴

Affiliations

¹ Department of Systems and Industrial Engineering, University of Arizona, AZ, USA. Electronic address: aminnayebi@arizona.edu.
² Department of Computer Science, Virginia Tech, VA, USA.
³ College of Medicine, University of Cincinnati, OH, USA.
⁴ Department of Systems and Industrial Engineering, University of Arizona, AZ, USA; Department of Biomedical Engineering, University of Arizona, AZ, USA.

PMID: 37414368
PMCID: PMC10552726
DOI: 10.1016/j.jbi.2023.104438

WindowSHAP: An efficient framework for explaining time-series classifiers based on Shapley values

Amin Nayebi et al. J Biomed Inform. 2023 Aug.

. 2023 Aug:144:104438.

doi: 10.1016/j.jbi.2023.104438. Epub 2023 Jul 4.

Authors

Amin Nayebi¹, Sindhu Tipirneni², Chandan K Reddy², Brandon Foreman³, Vignesh Subbian⁴

Affiliations

¹ Department of Systems and Industrial Engineering, University of Arizona, AZ, USA. Electronic address: aminnayebi@arizona.edu.
² Department of Computer Science, Virginia Tech, VA, USA.
³ College of Medicine, University of Cincinnati, OH, USA.
⁴ Department of Systems and Industrial Engineering, University of Arizona, AZ, USA; Department of Biomedical Engineering, University of Arizona, AZ, USA.

PMID: 37414368
PMCID: PMC10552726
DOI: 10.1016/j.jbi.2023.104438

Abstract

Unpacking and comprehending how black-box machine learning algorithms (such as deep learning models) make decisions has been a persistent challenge for researchers and end-users. Explaining time-series predictive models is useful for clinical applications with high stakes to understand the behavior of prediction models, e.g., to determine how different variables and time points influence the clinical outcome. However, existing approaches to explain such models are frequently unique to architectures and data where the features do not have a time-varying component. In this paper, we introduce WindowSHAP, a model-agnostic framework for explaining time-series classifiers using Shapley values. We intend for WindowSHAP to mitigate the computational complexity of calculating Shapley values for long time-series data as well as improve the quality of explanations. WindowSHAP is based on partitioning a sequence into time windows. Under this framework, we present three distinct algorithms of Stationary, Sliding and Dynamic WindowSHAP, each evaluated against baseline approaches, KernelSHAP and TimeSHAP, using perturbation and sequence analyses metrics. We applied our framework to clinical time-series data from both a specialized clinical domain (Traumatic Brain Injury - TBI) as well as a broad clinical domain (critical care medicine). The experimental results demonstrate that, based on the two quantitative metrics, our framework is superior at explaining clinical time-series classifiers, while also reducing the complexity of computations. We show that for time-series data with 120 time steps (hours), merging 10 adjacent time points can reduce the CPU time of WindowSHAP by 80 % compared to KernelSHAP. We also show that our Dynamic WindowSHAP algorithm focuses more on the most important time steps and provides more understandable explanations. As a result, WindowSHAP not only accelerates the calculation of Shapley values for time-series data, but also delivers more understandable explanations with higher quality.

Keywords: Explainable artificial intelligence; Model interpretation; Shapley value; Time-series data.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Amin Nayebi reports financial support was provided by National Science Foundation. Brandon Foreman reports financial support was provided by National Institutes of Health.

Figures

**Figure 1.**
Conceptual Demonstration of KernelSHAP vs WindowSHAP for a classification model for an individual instance, predicting whether there is an anomaly in a synthetically generated sequence. The top picture shows the sequence and its Shapley values derived from the KernelSHAP while the bottom depicts the Shapley values from WindowSHAP. While KernelSHAP is spreading the Shapley values all over the sequence, our approach focuses more on the part of the sequence that is more important, avoiding calculating Shapley values for each single time step.

**Figure 2.**
A visualization of time-axis partitioning in the Stationary WindowSHAP algorithm. The windows are non-overlapping, contiguous, and of the same length, except possibly the last window being smaller.

**Figure 3.**
Demonstration of Sliding WindowSHAP: (a) depicts a single iteration in which the entire sequence is divided into two time intervals, inside and outside of the time window. (b) shows the final windowing result after all iterations have been completed and a Shapley value has been produced for each time window.

**Figure 4.**
Demonstration of Dynamic WindowSHAP algorithm for a sequence. The algorithm stops in the fourth iteration because all the Shapley values for time windows are less than the threshold $δ$

**Figure 5.**
Evaluation metrics for all explanation algorithms. Each row of figures shows the result for one of the prediction models. The x axis in all figures represents the percentile p that is used in the metrics definitions. The $y$ axis represents the change in the quality metric after perturbing the most crucial time points. Error bars are shown as the mean ± standard errors of the mean of binary loss function.

**Figure 6.**
Visualization of RAM usage and CPU time of different algorithms under WindowSHAP framework. Columns (a), (b), and (c) represent Stationary, Sliding, and Dynamic WindowSHAP algorithms respectively.

**Figure 7.**
Heatmaps depicting the importance of all time steps for the important features for a certain patient record from the MIMIC-III dataset. The top 15 variables depicted on the y axis are ranked according to their importance. The darker the color is, the higher the absolute value of the assigned Shapley value is.

**Figure 8.**
The explanations of the heart rate variable for a patient in MIMIC mortality prediction model. The left and right plots represent visual explanations of WindowSHAP and KernelSHAP, respectively.

See this image and copyright information in PMC

Cited by

An explainable long short-term memory network for surgical site infection identification.
Kiser AC, Shi J, Bucher BT. Kiser AC, et al. Surgery. 2024 Jul;176(1):24-31. doi: 10.1016/j.surg.2024.03.006. Epub 2024 Apr 18. Surgery. 2024. PMID: 38616153 Free PMC article.
Practical guide to SHAP analysis: Explaining supervised machine learning model predictions in drug development.
Ponce-Bobadilla AV, Schmitt V, Maier CS, Mensing S, Stodtmann S. Ponce-Bobadilla AV, et al. Clin Transl Sci. 2024 Nov;17(11):e70056. doi: 10.1111/cts.70056. Clin Transl Sci. 2024. PMID: 39463176 Free PMC article. Review.
Surveying haemoperfusion impact on COVID-19 from machine learning using Shapley values.
Einollahi B, Javanbakht M, Ebrahimi M, Ahmadi M, Izadi M, Ghasemi S, Einollahi Z, Beyram B, Mirani A, Kianfar E. Einollahi B, et al. Inflammopharmacology. 2024 Aug;32(4):2285-2294. doi: 10.1007/s10787-024-01494-z. Epub 2024 May 19. Inflammopharmacology. 2024. PMID: 38762840

References

1. Samek W, Wiegand T, and Müller K-R, “Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models,” arXiv:1708.08296, Aug. 2017, Accessed: Mar. 02, 2022. [Online]. Available: http://arxiv.org/abs/1708.08296
1. Rudin C, “Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead,” Nat Mach Intell, vol. 1, no. 5, p. 206, May 2019, doi: 10.1038/S42256-019-0048-X. - DOI - PMC - PubMed
1. Walsh CG et al., “Stigma, biomarkers, and algorithmic bias: recommendations for precision behavioral health with artificial intelligence,” JAMIA Open, vol. 3, no. 1, pp. 9–15, Apr. 2020, doi: 10.1093/JAMIAOPEN/OOZ054. - DOI - PMC - PubMed
1. Selbst A. and Powles J, “‘Meaningful Information’ and the Right to Explanation,” in Proceedings of the 1st Conference on Fairness, Accountability and Transparency, PMLR, PMLR, Jan. 2018, pp. 48–48. Accessed: Sep. 20, 2022. [Online]. Available: https://proceedings.mlr.press/v81/selbst18a.html
1. Lipton ZC, Kale DC, and Wetzel R, “Modeling Missing Data in Clinical Time Series with RNNs,” vol. 56, 2016, [Online]. Available: http://arxiv.org/abs/1606.04130

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

K23 NS101123/NS/NINDS NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central
Medical
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

WindowSHAP: An efficient framework for explaining time-series classifiers based on Shapley values

Affiliations

WindowSHAP: An efficient framework for explaining time-series classifiers based on Shapley values

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials