Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 15;41(13):2317-2337.
doi: 10.1002/sim.9357. Epub 2022 Feb 27.

Extending the susceptible-exposed-infected-removed (SEIR) model to handle the false negative rate and symptom-based administration of COVID-19 diagnostic tests: SEIR-fansy

Affiliations

Extending the susceptible-exposed-infected-removed (SEIR) model to handle the false negative rate and symptom-based administration of COVID-19 diagnostic tests: SEIR-fansy

Ritwik Bhaduri et al. Stat Med. .

Abstract

False negative rates of severe acute respiratory coronavirus 2 diagnostic tests, together with selection bias due to prioritized testing can result in inaccurate modeling of COVID-19 transmission dynamics based on reported "case" counts. We propose an extension of the widely used Susceptible-Exposed-Infected-Removed (SEIR) model that accounts for misclassification error and selection bias, and derive an analytic expression for the basic reproduction number R0 as a function of false negative rates of the diagnostic tests and selection probabilities for getting tested. Analyzing data from the first two waves of the pandemic in India, we show that correcting for misclassification and selection leads to more accurate prediction in a test sample. We provide estimates of undetected infections and deaths between April 1, 2020 and August 31, 2021. At the end of the first wave in India, the estimated under-reporting factor for cases was at 11.1 (95% CI: 10.7,11.5) and for deaths at 3.58 (95% CI: 3.5,3.66) as of February 1, 2021, while they change to 19.2 (95% CI: 17.9, 19.9) and 4.55 (95% CI: 4.32, 4.68) as of July 1, 2021. Equivalently, 9.0% (95% CI: 8.7%, 9.3%) and 5.2% (95% CI: 5.0%, 5.6%) of total estimated infections were reported on these two dates, while 27.9% (95% CI: 27.3%, 28.6%) and 22% (95% CI: 21.4%, 23.1%) of estimated total deaths were reported. Extensive simulation studies demonstrate the effect of misclassification and selection on estimation of R0 and prediction of future infections. A R-package SEIRfansy is developed for broader dissemination.

Keywords: R package SEIRfansy; compartmental models; infection fatality rate; reproduction number; selection bias; sensitivity; undetected infections.

PubMed Disclaimer

Conflict of interest statement

The author declares that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.

Figures

FIGURE 1
FIGURE 1
Compartmental model incorporating false negative test results. There are total 10 compartments in this model. The S and E compartments corresponds to Susceptible (who have not been infected till now) and Exposed (who are infected with the virus but are still not infectious). There are three infectious compartments, namely, U (Untested infectious), P (Tested Positive), and F (Tested False Negative). Finally the four compartments RU, RR, DU, and DR denote the Recovered Unreported, Recovered Reported, Deceased Unreported, and Deceased Reported, respectively. β corresponds to the rate of transmission by false negative (F) individuals, while for U and P, it is multiplied by scaling factors αu and αp. Other parameters include De which corresponds to the incubation period, f which is the false negative rate (= 1‐sensitivity) and r which denotes the rate of ascertainment. Dr, β1.Dr, and Dr/β2 correspond to recovery period for P, U, and F, respectively, while μc, δ1, μc, and μc/δ2 denote the death rates for P, U, and F, respectively
FIGURE 2
FIGURE 2
Flowchart showing the estimation process of misclassification model using a Metropolis‐Hastings Markov chain Monte Carlo algorithm
FIGURE 3
FIGURE 3
Estimates of R0 in India across phases for (A) wave 1 and (B) wave 2. The mean and 95% credible intervals (in parentheses) are provided under the multinomial‐2‐parameter model. The reproduction numbers are estimated for the training periods corresponding to each of the two waves: April 1, 2020 to January 31, 2021 for the first wave and February 1, 2021 to June 30, 2021 for the second wave
FIGURE 4
FIGURE 4
COVID‐19 cases in India for wave 1 with estimated number of reported, false negative, and untested cases. We have taken April 1, 2020 to January 31, 2021 as the first wave training period and in the first wave, we have not taken any testing period. (A) Total active COVID cases in India from April 1, 2020 to January 31, 2021 including reported active cases, false negatives active, and untested active cases. (B) Proportion of reported active cases among Active COVID cases in India (C) Total cumulative cases in India from April 1, 2020 to January 31, 2021 including reported cumulative cases, cumulative false negatives and untested cumulative cases. (D) Proportion of reported cases among total cumulative COVID cases in India . (E) Total deaths in India from April 1, 2020 to January 31, 2021 including reported and unreported deaths. (F) Proportion of reported deaths among total deaths in India. The dotted curves in subfigures A, C, and E represent the observed data
FIGURE 5
FIGURE 5
COVID‐19 cases in India for wave 2 with estimated number of reported, false negative, and untested cases. In the wave 2, we have taken February 1, 2021 to June 30, 2021 as the training period, while, July 1, 2021 to August 31, 2021 was taken to be the test period. (A) Total active COVID cases in India from February 1, 2021 to August 31, 2021 including reported active cases, false negatives active, and untested active cases. (B) Proportion of reported active cases among active COVID cases in India (C) Total cumulative cases in India from February 1, 2021 to August 31, 2021 including reported cumulative cases, cumulative false negatives, and untested cumulative cases. (D) Proportion of reported cases among total cumulative COVID cases in India . (E) Total deaths in India from February 1, 2021 to August 31, 2021 including reported and unreported deaths. (F) Proportion of reported deaths among total deaths in India. The dotted curves in subfigures A, C, and E represent the observed data
FIGURE 6
FIGURE 6
Effect of misclassification on estimates for India (A) Estimates of Total Active Cases for wave 1 with f = 0, 0.15 and 0.3 (B) Estimates of Reported Active Cases for wave 1 with f = 0, 0.15 and 0.3 with the observed data (C) Estimates of Total Active Cases for wave 2 with f = 0, 0.15 and 0.3 (D) Estimates of Reported Active Cases for wave 2 with f = 0, 0.15, and 0.3 with the observed data. April 1, 2020 to January 31, 2021 was taken as the first wave training period and there was no test period for the first wave. In the wave 2, February 1, 2021 to June 30, 2021 was taken as the training period, while July 1, 2021 to August 31, 2021 was taken to be the test period for the second wave
FIGURE 7
FIGURE 7
(A) Effect of misclassification on Total Active Cases; (B) Effect of selection on Total Active Cases; and (C) Effect of selection on Reported Active Cases: simulations were carried out in order to assess the effect of misclassification, selection on Reported Active and Total Active Cases. For misclassification, the data was generated for a period of 101 days which was divided into five time periods: days 110, 1131, 3250, 5164, 65101. The values of βt across the five periods were set at 0.8, 0.65, 0.4, 0.3, 0.3 and the corresponding values of rt were set at 0.1, 0.2, 0.15, 0.15, 0.2. For selection model, additional parameters were set as p0=(106,105,1106105) and p1=(0.02,0.18,0.8). As before, the data are generated for a period of 101 days with five periods 110, 1131, 3250, 5164 and 65101 while the values of βt used to generate the data were 0.6,0.4,0.3,0.25, and 0.2 for the five periods, respectively. Predictions are based on the multinomial‐2‐parameter model, where the probability of being tested is assumed to be independent of symptoms with f=0.3 (the simulation truth)

Update of

Similar articles

Cited by

References

    1. Johns Hopkins University . COVID‐19 dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU); 2020. https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6. Accessed April 7, 2020.
    1. Sharfstein JM, Becker SJ, Mello MM. Diagnostic testing for the novel coronavirus. Jama. 2020;323(15):1437‐1438. doi:10.1001/jama.2020.3864 - DOI - PubMed
    1. Ai T, Yang Z, Hou H, et al. Correlation of chest CT and RT‐PCR testing in coronavirus disease 2019 (COVID‐19) in China: a report of 1014 cases. Radiology. 2020;200642. doi:10.1148/radiol.2020200642 - DOI - PMC - PubMed
    1. West CP, Montori VM, Sampathkumar P. COVID‐19 testing: the threat of false‐negative results. Mayo Clinic Proceedings. Vol 95. 2020(6):1127‐1129. - PMC - PubMed
    1. Yang Y, Yang M, Yuan J, et al. Laboratory diagnosis and monitoring the viral shedding of SARS‐CoV‐2 infection. Innovation. 2020;1(3):100061. doi:10.1016/j.xinn.2020.100061 - DOI - PMC - PubMed

Publication types