. 2021 Jul 16;373(6552):eabh0635.

doi: 10.1126/science.abh0635. Epub 2021 Jun 3.

Estimating epidemiologic dynamics from cross-sectional viral load distributions

James A Hay^#^{1

2

3}, Lee Kennedy-Shaffer^#^{1

2

4}, Sanjat Kanjilal^{5

6}, Niall J Lennon⁷, Stacey B Gabriel⁷, Marc Lipsitch^{8

2

3}, Michael J Mina^{1

2

3

9}

Affiliations

¹ Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA. jhay@hsph.harvard.edu lkennedyshaffer@vassar.edu mmina@hsph.harvard.edu.
² Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
³ Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁴ Department of Mathematics and Statistics, Vassar College, Poughkeepsie, NY, USA.
⁵ Department of Population Medicine, Harvard Pilgrim Health Care Institute, Boston, MA, USA.
⁶ Department of Infectious Diseases, Brigham and Women's Hospital, Boston, MA, USA.
⁷ Broad Institute of MIT and Harvard, Cambridge, MA, USA.
⁸ Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁹ Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA.

^# Contributed equally.

PMID: 34083451
PMCID: PMC8527857
DOI: 10.1126/science.abh0635

Estimating epidemiologic dynamics from cross-sectional viral load distributions

James A Hay et al. Science. 2021.

. 2021 Jul 16;373(6552):eabh0635.

doi: 10.1126/science.abh0635. Epub 2021 Jun 3.

Authors

James A Hay^#^{1

2

3}, Lee Kennedy-Shaffer^#^{1

2

4}, Sanjat Kanjilal^{5

6}, Niall J Lennon⁷, Stacey B Gabriel⁷, Marc Lipsitch^{8

2

3}, Michael J Mina^{1

2

3

9}

Affiliations

¹ Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA. jhay@hsph.harvard.edu lkennedyshaffer@vassar.edu mmina@hsph.harvard.edu.
² Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
³ Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁴ Department of Mathematics and Statistics, Vassar College, Poughkeepsie, NY, USA.
⁵ Department of Population Medicine, Harvard Pilgrim Health Care Institute, Boston, MA, USA.
⁶ Department of Infectious Diseases, Brigham and Women's Hospital, Boston, MA, USA.
⁷ Broad Institute of MIT and Harvard, Cambridge, MA, USA.
⁸ Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁹ Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA.

^# Contributed equally.

PMID: 34083451
PMCID: PMC8527857
DOI: 10.1126/science.abh0635

Abstract

Estimating an epidemic's trajectory is crucial for developing public health responses to infectious diseases, but case data used for such estimation are confounded by variable testing practices. We show that the population distribution of viral loads observed under random or symptom-based surveillance-in the form of cycle threshold (Ct) values obtained from reverse transcription quantitative polymerase chain reaction testing-changes during an epidemic. Thus, Ct values from even limited numbers of random samples can provide improved estimates of an epidemic's trajectory. Combining data from multiple such samples improves the precision and robustness of this estimation. We apply our methods to Ct values from surveillance conducted during the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic in a variety of settings and offer alternative approaches for real-time estimates of epidemic trajectories for outbreak management and response.

PubMed Disclaimer

Figures

None — **Ct values reflect the epidemic trajectory and can be used to estimate incidence.** (A and B) Whether an epidemic has rising or falling incidence will be reflected in the distribution of times since infection (A), which in turn affects the distribution of Ct values in a surveillance sample (B). (C) These values can be used to assess whether the epidemic is rising or falling and estimate the incidence curve.

**Fig. 1.. The Ct value distribution reflects epidemiological dynamics over the course of an outbreak.**
(A) Per capita daily incidence (histogram) and daily growth rate (blue line) of new infections in a simulated epidemic using an SEIR model. (B) Median days since infection versus daily growth rate of new infections by epidemic day. Labeled points here, and in (E) to (G), show five time points in the simulated epidemic. (C) Observed Ct value by day for 500 randomly sampled infected individuals. (D) Viral kinetics model (increasing Ct value after peak and subsequent plateau near the limit of detection), demonstrating the time course of Ct values (x axis; line shows mean, and ribbon shows 95% quantile range) against days since infection (y axis). Note that the y axis is arranged to align with (E). (E) Distribution of days since infection (violin plots and histograms) for randomly selected individuals over the course of the epidemic. Median and first and third quartiles are shown as green lines and points, respectively. (F) Skewness of observed Ct value distribution versus daily growth rate of new infections by epidemic day. (G) Distribution of observed Ct values (violin plots and histograms) among sampled infected individuals by epidemic day. Median and first and third quartile are shown as purple lines and points, respectively. (H) Time-varying effective reproductive number, R_t, derived from the SEIR simulation, plotted against median and skewness of observed Ct value distribution.

**Fig. 2.. Single cross-sectional distributions of observed Ct values can be used to reconstruct epidemic trajectories in a Massachusetts long-term care facility.**
(A) Estimated prevalence [faint orange lines show posterior samples, solid orange line shows posterior median, and orange ribbon shows 95% credible intervals (CrIs)] and incidence (red line shows posterior median and red ribbon shows 95% CrI) from the standard compartmental (SEEIRR) model fit to point prevalence at three sampling times (error bars show 95% binomial confidence intervals). (B) Model-predicted Ct distributions (blue) fitted to the observed Ct values (gray bars) from each of three cross-sectional samples. Shown are the posterior median (black line) and 95% CrI for the expected Ct distribution (dark blue ribbon) and 95% prediction intervals based on simulated observations (light blue ribbon). Note that prediction intervals are much wider than CrIs because they result from simulating observations with a small sample size. (C) Each panel shows results from fitting the Ct-based SEIR model separately to three cross sections of virologic data. Shown are random posterior samples (red lines) and the maximum posterior probability (MAP) trajectory (black line) for the incidence curve. (D) Fitted median (blue point) and 95% CrI (blue error bars) for the proportion of samples testing positive based on the Ct model compared with the observed proportion tested positive (gray cross). (E) Thirty-five–day (green) and 1-day (pink) average growth rates from the Ct model estimates in (C) at three time points (violin plots) compared with growth-rate estimates from the SEEIRR model in (A) (lines and shaded ribbons).

Fig. 3.. Inferring epidemic trajectory from cross-sectional surveillance samples with observed Ct values yields nearly unbiased estimates of the time-varying effective reproductive number, R_t, whereas changing testing rates lead to biased estimation using reported case counts.
(A) Number of positive tests per day by sampling time in epidemic and testing scheme for reported case counts (top row) and surveillance Ct sampling (bottom row), from a simulated SEIR epidemic. Analysis times corresponding to (B) are shown by the dashed vertical lines. (B) R_t estimates from 100 simulations for each epidemic sampling time, testing scheme, and estimation method. Each point is the posterior median from a single simulation. R_t estimates for reported case counts use *EpiNow2* estimation and for surveillance Ct samples use the Ct-based likelihood for one or multiple cross sections fitted to an SEIR model. The semitransparent points at the right of the plots are those surveillance samples fit to an SEIR model using only a binary result of testing, assuming PCR positivity reflects the infectious compartment. True model-based R_t on the sampling day is indicated by the black star and dashed horizontal line, whereas an R_t of 1, indicating a flat outbreak, is indicated by the solid horizontal line.

**Fig. 4.. Cross-sectional distributions of observed Ct values can estimate the complex statewide epidemic trajectory from hospital-based surveillance at Brigham and Women’s Hospital in Massachusetts.**
(A) Daily confirmed new cases in Massachusetts (gray bars) and estimated time-varying effective reproductive number, R_t. (B) Estimated R_t from the case counts versus median and skewness of observed Ct value distribution by weekly sampling times. (C) Distribution (violin plots and points) and smoothed median (blue line) of observed Ct values by sampling week. Red box highlights data used to inform estimates in (D). (D) Posterior median (yellow arrow) and distribution (blue shaded area) of estimated daily growth rate of incident infections from an SEIR model fit to a single cross section of observed Ct value data from the week commencing 14 June 2020. Shading density is proportional to posterior density. Fits to all single weekly cross sections are shown in fig. S14. (E) Posterior distribution of relative probability of infection by date from a GP model fit to all observed Ct values (ribbons show 95% and 50% CrIs; line shows posterior median). Note that the y axis shows relative rather than absolute probability of infection, as the underlying incidence curve must sum to one: Only positive samples were included in the estimation, and all samples were therefore assumed to have been from infections. (F) Comparison of estimated daily growth rate of incident infections from the GP model (blue line and shaded ribbons show posterior median and 95% CrI) to that from R_t estimation using observed case counts (red and green line and shaded ribbons show posterior median and 95% CrI) by date. Note that estimates of infection incidence are made for dates before the first observed sample date of 15 April 2020, as far back as 1 March 2020, but the x axis is truncated at 1 April 2020 (fig. S19).

See this image and copyright information in PMC

Update of

Estimating epidemiologic dynamics from cross-sectional viral load distributions.
Hay JA, Kennedy-Shaffer L, Kanjilal S, Lennon NJ, Gabriel SB, Lipsitch M, Mina MJ. Hay JA, et al. medRxiv [Preprint]. 2021 Feb 13:2020.10.08.20204222. doi: 10.1101/2020.10.08.20204222. medRxiv. 2021. Update in: Science. 2021 Jul 16;373(6552):eabh0635. doi: 10.1126/science.abh0635. PMID: 33594381 Free PMC article. Updated. Preprint.

Comment in

Using viral load to model disease dynamics.
Lopman BA, McQuade ETR. Lopman BA, et al. Science. 2021 Jul 16;373(6552):280-281. doi: 10.1126/science.abj4185. Science. 2021. PMID: 34437141 No abstract available.

References

1. Fineberg HV, Wilson ME, Epidemic science in real time. Science 324, 987 (2009). doi: 10.1126/science.1176297 - DOI - PubMed
1. World Health Organization, Public health surveillance for COVID-19: Interim guidance (2020); www.who.int/publications/i/item/who-2019-nCoV-surveillanceguidance-2020.8.
1. Jombart T et al. Inferring the number of COVID-19 cases from recently reported deaths. Wellcome Open Res. 5, 78 (2020). doi: 10.12688/wellcomeopenres.15786.1 - DOI - PMC - PubMed
1. Lipsitch M, Swerdlow DL, Finelli L, Defining the epidemiology of COVID-19—Studies needed. N. Engl. J. Med 382,1194–1196 (2020). doi: 10.1056/NEJMp2002125 - DOI - PubMed
1. Betensky RA, Feng Y, Accounting for incomplete testing in the estimation of epidemic parameters. Inf. J. Epidemiol 49, 1419–1426 (2020). doi: 10.1093/ije/dyaa116 - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Estimating epidemiologic dynamics from cross-sectional viral load distributions

Affiliations

Estimating epidemiologic dynamics from cross-sectional viral load distributions

Authors

Affiliations

Abstract

Figures

Update of

Comment in

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous