. 2021 Sep 7;17(9):e1009374.

doi: 10.1371/journal.pcbi.1009374. eCollection 2021 Sep.

Using test positivity and reported case rates to estimate state-level COVID-19 prevalence and seroprevalence in the United States

Weihsueh A Chiu¹, Martial L Ndeffo-Mbah^{1

2}

Affiliations

¹ Department of Veterinary Integrative Biosciences, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, Texas, United States of America.
² Department of Epidemiology and Biostatistics, School of Public Health, Texas A&M University, College Station, Texas, United States of America.

PMID: 34491990
PMCID: PMC8448371
DOI: 10.1371/journal.pcbi.1009374

Using test positivity and reported case rates to estimate state-level COVID-19 prevalence and seroprevalence in the United States

Weihsueh A Chiu et al. PLoS Comput Biol. 2021.

. 2021 Sep 7;17(9):e1009374.

doi: 10.1371/journal.pcbi.1009374. eCollection 2021 Sep.

Authors

Weihsueh A Chiu¹, Martial L Ndeffo-Mbah^{1

2}

Affiliations

¹ Department of Veterinary Integrative Biosciences, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, Texas, United States of America.
² Department of Epidemiology and Biostatistics, School of Public Health, Texas A&M University, College Station, Texas, United States of America.

PMID: 34491990
PMCID: PMC8448371
DOI: 10.1371/journal.pcbi.1009374

Abstract

Accurate estimates of infection prevalence and seroprevalence are essential for evaluating and informing public health responses and vaccination coverage needed to address the ongoing spread of COVID-19 in each United States (U.S.) state. However, reliable, timely data based on representative population sampling are unavailable, and reported case and test positivity rates are highly biased. A simple data-driven Bayesian semi-empirical modeling framework was developed and used to evaluate state-level prevalence and seroprevalence of COVID-19 using daily reported cases and test positivity ratios. The model was calibrated to and validated using published state-wide seroprevalence data, and further compared against two independent data-driven mathematical models. The prevalence of undiagnosed COVID-19 infections is found to be well-approximated by a geometrically weighted average of the positivity rate and the reported case rate. Our model accurately fits state-level seroprevalence data from across the U.S. Prevalence estimates of our semi-empirical model compare favorably to those from two data-driven epidemiological models. As of December 31, 2020, we estimate nation-wide a prevalence of 1.4% [Credible Interval (CrI): 1.0%-1.9%] and a seroprevalence of 13.2% [CrI: 12.3%-14.2%], with state-level prevalence ranging from 0.2% [CrI: 0.1%-0.3%] in Hawaii to 2.8% [CrI: 1.8%-4.1%] in Tennessee, and seroprevalence from 1.5% [CrI: 1.2%-2.0%] in Vermont to 23% [CrI: 20%-28%] in New York. Cumulatively, reported cases correspond to only one third of actual infections. The use of this simple and easy-to-communicate approach to estimating COVID-19 prevalence and seroprevalence will improve the ability to make public health decisions that effectively respond to the ongoing COVID-19 pandemic.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Conceptual model for relationship between test positivity, prevalence of infection, and testing rate.**
**(A)** Compartmental representation of how the relationships between new infections, undiagnosed and diagnosed prevalence (I_U and I_D) and seroprevalence (SP_U and SP_D) are modeled for each state, given a bias with power n. All observational inputs are the past τ-day averages of number of positive tests N_+,τ(t) and number of tests performed N_test,τ(t), the corresponding test positivity rate P_+,τ(t) and reported case rate C_+,τ(t), and the state population size N. For diagnosed prevalence and seroprevalence, the observational input is the daily reported cases N_+,τ, and the model parameters are the recovery time after diagnosis T_rec and the time from infection to seropositivity T_inf. For undiagnosed prevalence and seroprevalence, our model assumes the test positivity rate is correlated to delayed undiagnosed disease prevalence with a bias parameter b(t) modeled as a negative power function of the testing rate b(t) = [N_test,τ(t)/N]^–n (Eq 2). The additional parameters consist of the power parameter n and the initial (missed) seroprevalence SP_o. The effective rate parameter 1/T_eff is time-dependent, and accounts for both T_inf and ongoing diagnoses so as to not “double count.” Prevalence and seroprevalence are evaluated with a lag time t_lag, assumed equal to half the averaging time τ/2. In (B), the diagonal lines represent different values of the bias parameter. In **(C),** the relationship between testing rate and bias parameter represented by Eq (4) is illustrated. Here the shaded region represents different powers n ranging from 0.1 (lower bound bias) to 0.9 (upper bound bias), the solid line represents n = ½.

Fig 2. Calibration results of our semi-empirical model for COVID-19 antibody seroprevalence (posterior median and 95% credible intervals for primary random effects model; posterior median only for geometric mean n = ½ model) for each state with state-wide seroprevalence data (reported point estimates and 95% confidence intervals shown).
Open circles represent validation data not used for model calibration; remaining symbols represent calibration data.

Fig 3. Validation of COVID-19 infection prevalence estimates (posterior median for both primary random effects model and simpler geometric mean n = ½ model) for each state in comparison to posterior median estimates and 95% credible intervals from two data-driven epidemiologic models: an extended-SEIR model calibrated to reported cases and confirmed deaths through July 22, 2020 [23] and a semi-mechanistic model calibrated to confirmed deaths through July 20, 2020 by Imperial College [37]).

**Fig 4**
Map of estimated undiagnosed (A) and total (B) prevalence and transmission trends and overall seroprevalence (C) as of December 31, 2020, based on data through January 7, 2021. Values based on primary random effects model. Results for the simpler geometric mean model are provided in **Fig F** in S1 Text. The maps were generated using the R package usmap https://cran.r-project.org/web/packages/usmap/index.html (GPL-3), which uses shape files from the U.S. Census Bureau (the link provided in documentation is here: https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.html).

See this image and copyright information in PMC

Update of

Using Test Positivity and Reported Case Rates to Estimate State-Level COVID-19 Prevalence and Seroprevalence in the United States.
Chiu WA, Ndeffo-Mbah ML. Chiu WA, et al. medRxiv [Preprint]. 2020 Dec 26:2020.10.07.20208504. doi: 10.1101/2020.10.07.20208504. medRxiv. 2020. Update in: PLoS Comput Biol. 2021 Sep 7;17(9):e1009374. doi: 10.1371/journal.pcbi.1009374. PMID: 33398306 Free PMC article. Updated. Preprint.

References

1. National Academies of Sciences and Medicine E. Evaluating Data Types: A Guide for Decision Makers using Data to Understand the Extent and Spread of COVID-19 [Internet]. Washington, DC: The National Academies Press; 2020. Available from: https://www.nap.edu/catalog/25826/evaluating-data-types-a-guide-for-deci...
1. Havers FP, Reed C, Lim T, Montgomery JM, Klena JD, Hall AJ, et al. Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States, March 23-May 12, 2020. JAMA Intern Med [Internet]. 2020. [cited 2020 Aug 28]; Available from: doi: 10.1001/jamainternmed.2020.4130 - DOI - PMC - PubMed
1. Menachemi N, Yiannoutsos CT, Dixon BE, Duszynski TJ, Fadel WF, Wools-Kaloustian KK, et al. Population Point Prevalence of SARS-CoV-2 Infection Based on a Statewide Random Sample—Indiana, April 25–29, 2020. MMWR Morb Mortal Wkly Rep [Internet]. 2020. Jul 24 [cited 2020 Aug 28];69(29):960–4. Available from: http://www.cdc.gov/mmwr/volumes/69/wr/mm6929e1.htm?s_cid=mm6929e1_w doi: 10.15585/mmwr.mm6929e1 - DOI - PMC - PubMed
1. Rosenberg ES, Tesoriero JM, Rosenthal EM, Chung R, Barranco MA, Styer LM, et al. Cumulative incidence and diagnosis of SARS-CoV-2 infection in New York. Ann Epidemiol. 2020Aug1;48:23–29.e4. doi: 10.1016/j.annepidem.2020.06.004 - DOI - PMC - PubMed
1. Anand S, Montez-Rath M, Han J, Bozeman J, Kerschmann R, Beyer P, et al. Prevalence of SARS-CoV-2 antibodies in a large nationwide sample of patients on dialysis in the USA: a cross-sectional study. Lancet [Internet]. 2020. Oct 24 [cited 2020 Dec 14];396(10259):1335–44. Available from: doi: 10.1016/S0140-6736(20)32009-2 - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

P30 ES029067/ES/NIEHS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Consumer Health Information
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Using test positivity and reported case rates to estimate state-level COVID-19 prevalence and seroprevalence in the United States

Affiliations

Using test positivity and reported case rates to estimate state-level COVID-19 prevalence and seroprevalence in the United States

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials