Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 5;10(4):1418-1432.
doi: 10.1016/j.idm.2025.07.017. eCollection 2025 Dec.

A Bayesian modelling framework with model comparison for epidemics with super-spreading

Affiliations

A Bayesian modelling framework with model comparison for epidemics with super-spreading

Hannah Craddock et al. Infect Dis Model. .

Abstract

The transmission dynamics of an epidemic are rarely homogeneous. Super-spreading events and super-spreading individuals are two types of heterogeneous transmissibility. Inference of super-spreading is commonly carried out on secondary case data, the expected distribution of which is known as the offspring distribution. However, this data is seldom available. Here we introduce a multi-model framework fit to incidence time-series, data that is much more readily available. The framework consists of five discrete-time, stochastic, branching-process models of epidemics spread through a susceptible population. The framework includes a baseline model of homogeneous transmission, a unimodal and a bimodal model for super-spreading events, as well as a unimodal and a bimodal model for super-spreading individuals. Bayesian statistics is used to infer model parameters using Markov Chain Monte-Carlo methods. Model comparison is conducted by computing Bayes factors, with importance sampling used to estimate the marginal likelihood of each model. This estimator is selected for its consistency and lower variance compared to alternatives. Application to simulated data from each model identifies the correct model for the majority of simulations and accurately infers the true parameters, such as the basic reproduction number. We also apply our methods to incidence data from the 2003 SARS outbreak and the Covid-19 pandemic caused by SARS-CoV-2. Model selection consistently identifies the same model and mechanism for a given disease, even when using different time series. Our estimates are consistent with previous studies based on secondary case data. Quantifying the contribution of super-spreading to disease transmission has important implications for infectious disease management and control. Our modelling framework is disease-agnostic and implemented as an R package, with potential to be a valuable tool for public health.

Keywords: Bayesian modelling; Infectious disease epidemiology; Model comparison; Super-spreading; Transmission heterogenity.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Typical simulation from the five models of epidemic transmission. The time series plots show incidence data I[1:T] simulated from the Baseline, SSE, SSI, SSEB and SSIB models for R0 = 2.0, duration T = 50.
Fig. 2
Fig. 2
Results of inference of R0 across the five models. Posterior estimates in each model fit to 5000 simulated datasets from the model itself for R0 in the range [0.9, 4.0]. Dots indicate the mean of the estimated posteriors, bars represent the 95 % CI of the posterior, and the black diagonal line represents the true value used for simulation.
Fig. 3
Fig. 3
Incidence data from New Zealand's Southland district during the SARS-CoV-2 outbreak in 2020, before and after the wedding-related super-spreading event on March 21st.
Fig. 4
Fig. 4
Reported cases from the Waitemata district of Auckland, New Zealand during a wave of SARS-CoV-2 cases in August 2021 resulting in a Level 4 lockdown.
Fig. 5
Fig. 5
Bar plots of the posterior model probabilities of the five models applied to the two waves of SARS-CoV-2, New Zealand in 2020 and 2021.
Fig. 6
Fig. 6
Incidence data and bar plots of the posterior model probabilities of the five models applied to the two waves of SARS outbreaks in 2003.

Similar articles

References

    1. Adam D., Gostic K., Tsang T., Wu P., Lim W.W., Yeung A., Wong J., Lau E., Du Z., Chen D., Ho L.-M., Martín-Sánchez M., Cauchemez S., Cobey S., Leung G., Cowling B. Time-varying transmission heterogeneity of SARS and COVID-19 in Hong Kong. Research Square. 2022
    1. Adam D.C., Wu P., Wong J.Y., Lau E.H., Tsang T.K., Cauchemez S., Leung G.M., Cowling B.J. Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nature Medicine. 2020;26(11):1714–1719. - PubMed
    1. Adelson R. Compound poisson distributions. Journal of the Operational Research Society. 1966;17(1):73–75.
    1. Ando T. CRC Press; 2010. Bayesian model selection and statistical modeling.
    1. Brainard J., Jones N.R., Harrison F.C., Hammer C.C., Lake I.R. Super-spreaders of novel coronaviruses that cause SARS, MERS and COVID-19: A systematic review. Annals of Epidemiology. 2023;82:66–76. - PMC - PubMed

LinkOut - more resources