Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023;15(3):267-280.
doi: 10.1007/s41060-022-00324-1. Epub 2022 Apr 30.

Explainability of the COVID-19 epidemiological model with nonnegative tensor factorization

Affiliations

Explainability of the COVID-19 epidemiological model with nonnegative tensor factorization

Thirunavukarasu Balasubramaniam et al. Int J Data Sci Anal. 2023.

Abstract

The world is witnessing the devastating effects of the COVID-19 pandemic. Each country responded to contain the spread of the virus in the early stages through diverse response measures. Interpreting these responses and their patterns globally is essential to inform future responses to COVID-19 variants and future pandemics. A stochastic epidemiological model (SEM) is a well-established mathematical tool that helps to analyse the spread of infectious diseases through communities and the effects of various response measures. However, interpreting the outcome of these models is complex and often requires manual effort. In this paper, we propose a novel method to provide the explainability of an epidemiological model. We represent the output of SEM as a tensor model. We then apply nonnegative tensor factorization (NTF) to identify patterns of global response behaviours of countries and cluster the countries based on these patterns. We interpret the patterns and clusters to understand the global response behaviour of countries in the early stages of the pandemic. Our experimental results demonstrate the advantage of clustering using NTF and provide useful insights into the characteristics of country clusters.

Keywords: COVID-19; Clustering; Explainable AI; Nonnegative tensor factorization; Pattern mining; Stochastic epidemiological modelling.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Overall architecture of the SEM with multiple components [34]. A feedback mechanism also incorporates the community influence in the process
Fig. 2
Fig. 2
Example of traditional consolidated matrix representation. Here multiple matrices representing country × parameter (i.e. C×P) over multiple particles are consolidated by element average to derive a single matrix Y
Fig. 3
Fig. 3
Example of tensor representation. Instead of consolidation, multiple matrices encoding country × parameter over multiple particles are represented as a three-dimensional tensor, where particle is the third dimension
Fig. 4
Fig. 4
Overall process of NTF for clustering and interpretation. A 3-D tensor is factorized into 3 factor matrices C,P, and D representing the latent features of countries, parameters, and particles, respectively, after learning dependencies among the modes. Each factor matrix can become input to clustering the countries and interpreting the parameter and particle patterns
Fig. 5
Fig. 5
Determining the six clusters containing similar countries based on the factor matrix C
Fig. 6
Fig. 6
Interpretation of country clusters using the factor matrix P which represents the parameter patterns. The darker the colour in each column, the higher the value of the feature, indicating the characteristics of that column
Fig. 7
Fig. 7
Countries grouped together in all three datasets. Each colour indicates the set of countries that travel together from T1 to T2 to T3. Some clusters are not included in the figure as they do not travel together with any other countries for all three datasets
Fig. 8
Fig. 8
Characteristics of clusters w.r.t. parameters using NTF: a T1(March time-period); b T2 (April time-period); and c T3 (June time-period)
Fig. 9
Fig. 9
Characteristics of clusters w.r.t. parameters using NMF: a T1(March time-period); b T2 (April time-period); and c T3 (June time-period)
Fig. 10
Fig. 10
Characteristics of clusters w.r.t. particles: a T1 (March period); b T2 (April period); and c T3 (June period)
Fig. 11
Fig. 11
Cluster to cluster similarity based on characteristics. Similarity score of 0.5 is set as threshold. For example, the connection between C1 of T1 to C3 of T2 means that C1 of T1 is similar to C3 of T2
Fig. 12
Fig. 12
Cluster to cluster similarity based on characteristics. Similarity score of 0.8 is set as threshold

Similar articles

References

    1. Abbas HSM, Xu X, Sun C. China health technology and stringency containment measures during COVID-19 pandemic: a discussion of first and second wave of covid-19. Heal. Technol. 2021;11(2):405–410. doi: 10.1007/s12553-021-00519-9. - DOI - PMC - PubMed
    1. Alimadadi, A., Aryal, S., Manandhar, I., et al.: Artificial intelligence and machine learning to fight covid-19 (2020) - PMC - PubMed
    1. Allen, L.J.: An introduction to stochastic epidemic models. In: Mathematical Epidemiology. Springer, pp. 81–130 (2008)
    1. Balasubramaniam, T., Nayak, R., Bashar, M.A.: Understanding the spatio-temporal topic dynamics of covid-19 using nonnegative tensor factorization: a case study. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, pp 1218–1225 (2020)
    1. Balasubramaniam T, Nayak R, Yuen C. Efficient nonnegative tensor factorization via saturating coordinate descent. ACM Trans. Knowl. Discov. Data. 2020;14(4):1–28. doi: 10.1145/3385654. - DOI

LinkOut - more resources