Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 10;380(2214):20210117.
doi: 10.1098/rsta.2021.0117. Epub 2021 Nov 22.

On some fundamental challenges in monitoring epidemics

Affiliations

On some fundamental challenges in monitoring epidemics

Vaiva Vasiliauskaite et al. Philos Trans A Math Phys Eng Sci. .

Abstract

Epidemic models often reflect characteristic features of infectious spreading processes by coupled nonlinear differential equations considering different states of health (such as susceptible, infectious or recovered). This compartmental modelling approach, however, delivers an incomplete picture of the dynamics of epidemics, as it neglects stochastic and network effects, and the role of the measurement process, on which the estimation of epidemiological parameters and incidence values relies. In order to study the related issues, we combine established epidemiological spreading models with a measurement model of the testing process, considering the problems of false positives and false negatives as well as biased sampling. Studying a model-generated ground truth in conjunction with simulated observation processes (virtual measurements) allows one to gain insights into the fundamental limitations of purely data-driven methods when assessing the epidemic situation. We conclude that epidemic monitoring, simulation, and forecasting are wicked problems, as applying a conventional data-driven approach to a complex system with nonlinear dynamics, network effects and uncertainty can be misleading. Nevertheless, some of the errors can be corrected for, using scientific knowledge of the spreading dynamics and the measurement process. We conclude that such corrections should generally be part of epidemic monitoring, modelling and forecasting efforts. This article is part of the theme issue 'Data science approaches to infectious disease surveillance'.

Keywords: complex systems; computer simulation; data science; epidemic modelling; network theory; statistics.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Estimation of the true proportion I/N of Infectious (green solid line) based on the fraction (3.3) of positive tests (blue dotted line) and the correction formula (3.11) (red dashed-dotted line). The corrected estimate tends to be very close to the true value when the parameters α, β, p and q of the test method are known exactly. The monitoring parameters are: α=0.05, β=0.15, p=0.8, q=0.06. The curves displayed here are for SEIRD dynamics with N=100000, a=1/5, b=3/14, c=1/28, d=1/28. (Online version in colour.)
Figure 2.
Figure 2.
Estimation of the true proportion I/N of Infectious (green solid line) based on the fraction (3.3) of positive tests (blue dotted line) and the correction formula (3.11) (other lines according to the colour scales on the right). In these figures, we assume that only three of the four parameters α=0.05, β=0.15, p=0.8 and q=0.06 are known exactly, while the estimate of the fourth one (represented with a hat on top) is uncertain and, therefore, varied. Specifically, in (3.11) we have replaced α by α^ in the top left figure, β by β^ in the top right figure, p by p^ in the bottom left figure and q by q^ in the bottom right figure. All figures show results for a SEIRD dynamics with N=100000, a=1/5, b=3/14, c=1/28, d=1/28. (Online version in colour.)
Figure 3.
Figure 3.
Ensemble of stochastic trajectories (SP-KMC sampling) on an ensemble of 103 Barabási-Albert networks with N=100000, μ=3 and γ=1. (a) Accurate epidemic reconstruction assuming that the parameters α=0.05, β=0.15, p=0.8, q=0.06 of the test method are exactly known. (b) Inaccurate reconstruction for somewhat incorrect estimates p^=0.75 and q^=0.1, while the other parameters are assumed to be the same. The assumed parameters of the SEIRD dynamics are: a=1/5, b=3/14, c=1/28, d=1/28. Q0.01, Q0.25, Q0.75 and Q0.99 represent 1, 25, 75 and 99 per cent quantiles. The median values and error quantile bands are based on 103 simulations. (Online version in colour.)
Figure 4.
Figure 4.
I^(t) computed from (4.3), assuming uncertainty in monitoring parameters: α^Beta(10,90), β^Beta(5,95), p^Beta(4,16), q^Beta(5,95). We used the distributions whose mean is ‘guessed’ correctly, i.e.α^=α. The true number I(t) of Infectious was obtained by simulating SEIRD dynamics with parameters a=1/5, b=3/14, c=1/28, d=1/28 on an ER network with N=105 and average degree 1.5. (Online version in colour.)
Figure 5.
Figure 5.
Posterior distribution P(I|np) when a network topology prior is used (in red) and when it is not (in blue). Monitoring parameters were also considered as priors, with p^Beta(4,16), q^Beta(5,95), α^Beta(5,95), β^Beta(10,90). For the ground truth parameters, we selected α=0.05, β=0.1, p=0.2, q=0.05. The ground truth number of Infectious I=43448 was obtained from an epidemic trajectory generated on a network sampled from an ensemble of Barabási–Albert (BA) networks with μ=3, γ=1, giving a scale-free network with an average degree k=3. In scenario ‘BA prior with correct μ=3, γ=1’, we used a prior which assumes the correct network ensemble—Barabási-Albert networks with N=100000, μ=3 and γ=1. In ‘BA prior with μ=6, γ=1’, we used a network ensemble prior of BA networks with μ=6 and γ=1, which over-estimates the average node degree k. In ‘BA prior with μ=3, γ=2’, we used a prior for BA networks with μ=3 and γ=2, and in ‘ER prior with π=3×105’ we used a prior for Erdös–Rényi networks with π=3×105, giving networks with k=3, but the degree distribution P(k) is Poisson rather than a power law. In all figures, the green line shows the ‘ground truth’, against which the biased testing is performed. We observe that knowledge about the degree distribution of contacts helps to estimate the true ground truth. To estimate network priors within t[60,65], we have used 1000 networks from the ensemble. In all cases, M=1000 Monte Carlo samples were used to estimate (4.3). (Online version in colour.)

Similar articles

Cited by

References

    1. van Riel D, de Wit E. 2020. Next-generation vaccine platforms for COVID-19. Nat. Mater. 19, 810-812. (10.1038/s41563-020-0746-0) - DOI - PubMed
    1. Arshadi AK et al. 2020. Artificial intelligence for COVID-19 drug discovery and vaccine development. Front. Artif. Intell. 3, 65. (10.3389/frai.2020.00065) - DOI - PMC - PubMed
    1. Vaishya R, Javaid M, Khan IH, Haleem A. 2020. Artificial intelligence (AI) applications for COVID-19 pandemic. Diabetes Metab. Syndr.: Clin. Res. Rev. 14, 337-339. (10.1016/j.dsx.2020.04.012) - DOI - PMC - PubMed
    1. Bharadwaj KK et al. 2021. Computational intelligence in vaccine design against COVID-19. In Computational intelligence methods in COVID-19: surveillance, prevention, prediction and diagnosis, pp. 311–329. Berlin, Germany: Springer. (10.1007/978-981-15-8534-0_16) - DOI
    1. Eltorai AE, Fox H, McGurrin E, Guang S. 2016. Microchips in medicine: current and future applications. BioMed Res. Int. 2016, 1743472. (10.1155/2016/1743472) - DOI - PMC - PubMed