This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2023 Mar 11:arXiv:2009.02654v3.

Semi-parametric modeling of SARS-CoV-2 transmission using tests, cases, deaths, and seroprevalence data

Damon Bayer¹, Isaac H Goldstein¹, Jonathan Fintzi², Keith Lumbard³, Emily Ricotta⁴, Sarah Warner⁵, Lindsay M Busch⁶, Jeffrey R Strich⁵, Daniel S Chertow⁵, Daniel M Parker⁷, Bernadette Boden-Albala⁷, Alissa Dratch⁸, Richard Chhuon⁸, Nichole Quick⁹, Matthew Zahn⁸, Volodymyr M Minin¹

Affiliations

¹ Department of Statistics, University of California, Irvine, California, U.S.A.
² Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Rockville, Maryland, U.S.A.
³ Clinical Monitoring Research Program Directorate, Frederick National Laboratory for Cancer Research, Frederick, Maryland, U.S.A.
⁴ Epidemiology Unit, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, U.S.A.
⁵ Critical Care Medicine Department, Clinical Center, National Institutes of Health, Bethesda, Maryland, U.S.A.
⁶ Division of Infectious Diseases, Emory University School of Medicine, Atlanta, Georgia, U.S.A.
⁷ Susan and Henry Samueli College of Health Sciences, University of California, Irvine, California, U.S.A.
⁸ Orange County Health Care Agency, Santa Ana, California, U.S.A.
⁹ KCS Health Center, Buena Park, California, U.S.A.

PMID: 32908946
PMCID: PMC7480029

Semi-parametric modeling of SARS-CoV-2 transmission using tests, cases, deaths, and seroprevalence data

Damon Bayer et al. ArXiv. 2023.

[Preprint]. 2023 Mar 11:arXiv:2009.02654v3.

Authors

Affiliations

¹ Department of Statistics, University of California, Irvine, California, U.S.A.
² Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Rockville, Maryland, U.S.A.
³ Clinical Monitoring Research Program Directorate, Frederick National Laboratory for Cancer Research, Frederick, Maryland, U.S.A.
⁴ Epidemiology Unit, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, U.S.A.
⁵ Critical Care Medicine Department, Clinical Center, National Institutes of Health, Bethesda, Maryland, U.S.A.
⁶ Division of Infectious Diseases, Emory University School of Medicine, Atlanta, Georgia, U.S.A.
⁷ Susan and Henry Samueli College of Health Sciences, University of California, Irvine, California, U.S.A.
⁸ Orange County Health Care Agency, Santa Ana, California, U.S.A.
⁹ KCS Health Center, Buena Park, California, U.S.A.

PMID: 32908946
PMCID: PMC7480029

Abstract

Mechanistic models fit to streaming surveillance data are critical to understanding the transmission dynamics of an outbreak as it unfolds in real-time. However, transmission model parameter estimation can be imprecise, and sometimes even impossible, because surveillance data are noisy and not informative about all aspects of the mechanistic model. To partially overcome this obstacle, Bayesian models have been proposed to integrate multiple surveillance data streams. We devised a modeling framework for integrating SARS-CoV-2 diagnostics test and mortality time series data, as well as seroprevalence data from cross-sectional studies, and tested the importance of individual data streams for both inference and forecasting. Importantly, our model for incidence data accounts for changes in the total number of tests performed. We model the transmission rate, infection-to-fatality ratio, and a parameter controlling a functional relationship between the true case incidence and the fraction of positive tests as time-varying quantities and estimate changes of these parameters nonparametrically. We compare our base model against modified versions which do not use diagnostics test counts or seroprevalence data to demonstrate the utility of including these often unused data streams. We apply our Bayesian data integration method to COVID-19 surveillance data collected in Orange County, California between March 2020 and February 2021 and find that 32-72% of the Orange County residents experienced SARS-CoV-2 infection by mid-January, 2021. Despite this high number of infections, our results suggest that the abrupt end of the winter surge in January 2021 was due to both behavioral changes and a high level of accumulated natural immunity.

PubMed Disclaimer

Figures

**Figure 1:**
COVID-19 surveillance data from Orange County, CA. The figure shows weekly counts of tests, cases (positive tests), reported deaths due to COVID-19, as well as testing positivity.

**Figure 2:**
Model diagram depicting possible progressions between infection states. The model compartments are as follows: susceptible (S), infected, but not yet infectious (E), infectious (I), recovered (R), and deceased (D).

**Figure 3:**
Posterior distributions of the time-varying basic reproductive number R₀, effective reproductive number R_e, infection-to-fatality ratio (IFR), proportion in the proportional log-odds model of the beta-binomial observational model for cases α, weekly latent:case ratio, and cumulative latent:case ratio. Solid blue lines show point-wise posterior medians, while shaded areas denote 50%, 80%, and 95% Bayesian credible intervals.

**Figure 4:**
Posterior inference for the effective reproduction number from the full model and epidemia fit to the Orange County data.

**Figure 5:**
Latent and observed cumulative death (left) and incidence (center) trajectories and latent prevalence trajectories (right) in Orange County, CA (population 3.2 million). Solid blue lines show point-wise posterior medians, while shaded areas denote 50%, 80%, and 95% Bayesian credible intervals. Black circles denote observed data. Note that the posterior predictive distributions are of latent deaths and cases are not forecasts of their observed counterparts. Forecasts are plotted in Figure 6.

**Figure 6:**
Forecast distributions for observed deaths (left column) and testing positivity (right column). Solid blue lines show point-wise posterior medians, while shaded areas denote 50%, 80%, and 95% Bayesian credible intervals. Observed values are presented as black circles.

**Figure 7:**
Comparison of Continuous Rank Probability Score for models fit to the Orange County data. Lower is better.

See this image and copyright information in PMC

References

1. Anderson S. C., Edwards A. M., Yerlanov M., Mulberry N., Stockdale J. E., Iyaniwura S. A., Falcao R. C., Otterstatter M. C., Irvine M. A., Janjua N. Z., Coombs D., and Colijn C. (2020), “Quantifying the impact of COVID-19 control measures using a Bayesian model of physical distancing,” PLOS Computational Biology, 16, 1–15. - PMC - PubMed
1. Andrieu C., Doucet A., and Holenstein R. (2010), “Particle Markov chain Monte Carlo methods,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), 72, 269–342.
1. Bhargava A., Fukushima E. A., Levine M., Zhao W., Tanveer F., Szpunar S. M., and Saravolatz L. (2020), “Predictors for severe COVID-19 infection,” Clinical Infectious Diseases, 71, 1962–1968. - PMC - PubMed
1. Bosse N. I., Gruson H., Cori A., van Leeuwen E., Funk S., and Abbott S. (2022), “Evaluating forecasts with scoringutils in R,” arXiv preprint arXiv:2205.07090.
1. Bretó C., He D., Ionides E., and King A. (2009), “Time series analysis via mechanistic models,” The Annals of Applied Statistics, 3, 319–348.

Publication types

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central
- eScholarship, University of California - Access Free Full Text
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

Semi-parametric modeling of SARS-CoV-2 transmission using tests, cases, deaths, and seroprevalence data

Affiliations

Semi-parametric modeling of SARS-CoV-2 transmission using tests, cases, deaths, and seroprevalence data

Authors

Affiliations

Abstract

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous