Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 22;20(6):e3001685.
doi: 10.1371/journal.pbio.3001685. eCollection 2022 Jun.

An open-access database of infectious disease transmission trees to explore superspreader epidemiology

Affiliations

An open-access database of infectious disease transmission trees to explore superspreader epidemiology

Juliana C Taube et al. PLoS Biol. .

Abstract

Historically, emerging and reemerging infectious diseases have caused large, deadly, and expensive multinational outbreaks. Often outbreak investigations aim to identify who infected whom by reconstructing the outbreak transmission tree, which visualizes transmission between individuals as a network with nodes representing individuals and branches representing transmission from person to person. We compiled a database, called OutbreakTrees, of 382 published, standardized transmission trees consisting of 16 directly transmitted diseases ranging in size from 2 to 286 cases. For each tree and disease, we calculated several key statistics, such as tree size, average number of secondary infections, the dispersion parameter, and the proportion of cases considered superspreaders, and examined how these statistics varied over the course of each outbreak and under different assumptions about the completeness of outbreak investigations. We demonstrated the potential utility of the database through 2 short analyses addressing questions about superspreader epidemiology for a variety of diseases, including Coronavirus Disease 2019 (COVID-19). First, we found that our transmission trees were consistent with theory predicting that intermediate dispersion parameters give rise to the highest proportion of cases causing superspreading events. Additionally, we investigated patterns in how superspreaders are infected. Across trees with more than 1 superspreader, we found preliminary support for the theory that superspreaders generate other superspreaders. In sum, our findings put the role of superspreading in COVID-19 transmission in perspective with that of other diseases and suggest an approach to further research regarding the generation of superspreaders. These data have been made openly available to encourage reuse and further scientific inquiry.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. We compiled infectious disease transmission trees from the literature along with reported attribute information.
Shown here are example trees in the database. (A) Ebola spread in different contexts [8]. (B) Measles spread in different locations [9]. (C) COVID-19 spread among age classes [10]. Primary sources for transmission trees are available in OutbreakTrees and listed in the Supporting information. OutbreakTrees may be accessed online at http://outbreaktrees.ecology.uga.edu. COVID-19, Coronavirus Disease 2019.
Fig 2
Fig 2. Characteristics of transmission trees in OutbreakTrees.
(A) Tree size varies from 2 to 286 with a median of 3 and most trees represent outbreaks taking place in the past 20 years (only trees with 10 or more cases shown in date plot due to large number of small COVID-19 trees from 2020). (B) The largest trees are from H1N1 and SARS outbreaks, while the highest proportion of trees in the database are from outbreaks of COVID-19, followed by adenovirus and Ebola. Tree size axes in both plots are shown on a log10 scale to better illustrate variation in medium-sized trees. All trees are used in this analysis. The data to reproduce this figure can be found at https://doi.org/10.5061/dryad.nk98sf7w7. COVID-19, Coronavirus Disease 2019; MERS, Middle East Respiratory Syndrome; SARS, Severe Acute Respiratory Syndrome.
Fig 3
Fig 3. The time dependence of R, k, and the proportion of cases causing superspreading events.
(A) R decreased significantly between the first and second halves of transmission trees. (B) k did not differ significantly between the first and second halves of transmission trees. Y-axis is on a log10 scale for visual aid. (C) The proportion of cases causing superspreading events decreased significantly between the first and second halves of transmission trees. (D) Decrease in R shown for each tree by disease. R was below 1 in the second half of all trees; red line denotes R = 1. The Wilcoxon rank test was used for all significance tests (*: p≤0.05, **: p≤0.01, ***: p≤0.001, ****: p≤0.0001), and results are shown in red stars. Trees were assumed to be complete and only trees with 20 or more cases and at least 2 generations of spread were used in these analyses. Results assuming tree incompleteness are shown in S3 Fig. The data to reproduce this figure can be found at https://doi.org/10.5061/dryad.nk98sf7w7. COVID-19, Coronavirus Disease 2019; MERS, Middle East Respiratory Syndrome; SARS, Severe Acute Respiratory Syndrome.
Fig 4
Fig 4. The importance and expected frequency of superspreading across diseases.
(A) The highest proportion of cases causing superspreading events is observed at intermediate dispersion parameters, as predicted by theory [3]. (B) Dispersion parameter (k) of a negative binomial distribution fit to the offspring distribution of trees by disease (for diseases with at least 3 trees). Lower dispersion parameters are indicative of greater variation in number of secondary infections. Vertical line and value printed in each facet shows the median k and standard error for each disease. X-axes are on a log10 scale in both plots for visual aid. Trees were assumed to be complete and only trees with 20 or more cases and at least 2 generations of spread were used in these analyses. Other size cutoffs are shown in S4 and S5 Figs and results assuming tree incompleteness are shown in S6 Fig. The data to reproduce this figure can be found at https://doi.org/10.5061/dryad.nk98sf7w7. COVID-19, Coronavirus Disease 2019; MERS, Middle East Respiratory Syndrome; SARS, Severe Acute Respiratory Syndrome.
Fig 5
Fig 5. In two-thirds of transmission trees, superspreaders infect superspreaders more often than would be expected by chance.
The expected number of superspreader-superspreader dyads was calculated by s(s1)St for each tree, where s is the number of superspreaders in the tree, t is the number of terminal nodes (nodes that do not cause onward transmission), and S is tree size. Ratios larger than 1 indicate more superspreader-superspreader dyads were observed than would be expected by chance. This analysis was limited to trees with more than 1 superspreader, 20 or more cases, and 2 or more generations of spread. We assumed tree completeness here, but results assuming incompleteness are shown in S7 Fig. The data to reproduce this figure can be found at https://doi.org/10.5061/dryad.nk98sf7w7. COVID-19, Coronavirus Disease 2019; MERS, Middle East Respiratory Syndrome; SARS, Severe Acute Respiratory Syndrome.

Similar articles

Cited by

References

    1. Teunis P, Heijne JC, Sukhrie F, van Eijkeren J, Koopmans M, Kretzschmar M. Infectious disease transmission as a forensic problem: Who infected whom? J R Soc Interface. 2013;10(81):20120955. doi: 10.1098/rsif.2012.0955 - DOI - PMC - PubMed
    1. Campbell F, Didelot X, Fitzjohn R, Ferguson N, Cori A, Jombart T. outbreaker2: A modular platform for outbreak reconstruction. BMC Bioinformatics. 2018;19(Suppl 11)(363):17–24. doi: 10.1186/s12859-018-2330-z - DOI - PMC - PubMed
    1. Lloyd-Smith JO, Schreiber SJ, Kopp PE, Getz WM. Superspreading and the effect of individual variation on disease emergence. Nature. 2005;438:355–9. doi: 10.1038/nature04153 - DOI - PMC - PubMed
    1. Althouse BM, Wenger EA, Miller JC, Scarpino SV, Allard A, Hébert-Dufresne L, et al.. Superspreading events in the transmission dynamics of SARS-CoV-2: Opportunities for interventions and control. PLoS Biol. 2020;18(11):e3000897. doi: 10.1371/journal.pbio.3000897 - DOI - PMC - PubMed
    1. Beldomenico PM. Do superspreaders generate new superspreaders? A hypothesis to explain the propagation pattern of COVID-19. Int J Infect Dis. 2020;96:461–3. doi: 10.1016/j.ijid.2020.05.025 - DOI - PMC - PubMed

Publication types