Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;8(11):e1002768.
doi: 10.1371/journal.pcbi.1002768. Epub 2012 Nov 15.

A Bayesian inference framework to reconstruct transmission trees using epidemiological and genetic data

Affiliations

A Bayesian inference framework to reconstruct transmission trees using epidemiological and genetic data

Marco J Morelli et al. PLoS Comput Biol. 2012.

Abstract

The accurate identification of the route of transmission taken by an infectious agent through a host population is critical to understanding its epidemiology and informing measures for its control. However, reconstruction of transmission routes during an epidemic is often an underdetermined problem: data about the location and timings of infections can be incomplete, inaccurate, and compatible with a large number of different transmission scenarios. For fast-evolving pathogens like RNA viruses, inference can be strengthened by using genetic data, nowadays easily and affordably generated. However, significant statistical challenges remain to be overcome in the full integration of these different data types if transmission trees are to be reliably estimated. We present here a framework leading to a bayesian inference scheme that combines genetic and epidemiological data, able to reconstruct most likely transmission patterns and infection dates. After testing our approach with simulated data, we apply the method to two UK epidemics of Foot-and-Mouth Disease Virus (FMDV): the 2007 outbreak, and a subset of the large 2001 epidemic. In the first case, we are able to confirm the role of a specific premise as the link between the two phases of the epidemics, while transmissions more densely clustered in space and time remain harder to resolve. When we consider data collected from the 2001 epidemic during a time of national emergency, our inference scheme robustly infers transmission chains, and uncovers the presence of undetected premises, thus providing a useful tool for epidemiological studies in real time. The generation of genetic data is becoming routine in epidemiological investigations, but the development of analytical tools maximizing the value of these data remains a priority. Our method, while applied here in the context of FMDV, is general and with slight modification can be used in any situation where both spatiotemporal and genetic data are available.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Dynamical model of pathogen transmission between a source premise and a receptor premise .
Premises are considered confined at fixed locations in space. Variables covered by the grey rectangles are observable. A premise formula image is infected at time formula image, becomes infectious after a latent period formula image, is observed at time formula image, when a viral sequence formula image is obtained, and is removed from susceptible population (i.e. loses its ability to infect other premises) at time formula image. When an infected premise is reported, the duration period from infectiousness to detection, formula image, is assessed by experts based on symptom observation. This assessment is called formula image.
Figure 2
Figure 2. Estimation output for the simulated outbreak.
Top left: true transmissions (circles) and posterior probabilities of transmissions (dot sizes are proportional to probabilities). Top right: tree with the highest posterior probability (solid arrows); Only transmission formula image is not consistent with the true tree (the true transmission is formula image dashed arrow). Bottom: posterior distributions (histograms) of mean latency duration (formula image; left) and mean transmission distance (formula image; right); dashed lines: true values; dotted-dashed curves: prior distributions; solid lines: posterior medians; dotted lines: posterior quantiles 0.025 and 0.975.
Figure 3
Figure 3. Estimation output for the 2007 UK outbreak.
Top left: posterior probabilities of transmissions (dot sizes proportional to probabilities). Top right: tree with the highest posterior probability mapped in space (black arrows). Bottom: posterior distributions (histograms) of mean latency duration (formula image; left) and mean transmission distance (formula image; right); dotted-dashed curves: prior distributions; solid lines: posterior medians; dotted lines: posterior quantiles 0.025 and 0.975.
Figure 4
Figure 4. Estimation output for the 2001 UK outbreak (Darlington cluster).
Top left: posterior probabilities of transmissions (dot sizes proportional to probabilities). Top right: tree with the highest posterior probability mapped in space (black arrows). Bottom: posterior distributions (histograms) of mean latency duration (formula image; left) and mean transmission distance (formula image; right); dotted-dashed curves: prior distributions; solid lines: posterior medians; dotted lines: posterior quantiles 0.025 and 0.975.
Figure 5
Figure 5. Direct acyclic graph illustrating the dependencies in the model.
Bold letters are used to represent sets of variables, with one variable per farm, e.g. formula image. For a full description of the symbols, see Table 2.

References

    1. Ferguson NM, Donnelly CA, Anderson RM (2001) Transmission intensity and impact of control policies on the foot and mouth epidemic in great britain. Nature 413: 542–548. - PubMed
    1. Keeling MJ, Woolhouse MEJ, May RM, Davies G, Grenfell BT (2003) Modelling vaccination strategies against foot-and-mouth disease. Nature 421: 136–142. - PubMed
    1. Cauchemez S, Ferguson NM (2011) Methods to infer transmission risk factors in complex outbreak data. J R Soc Interface 9: 456–469. - PMC - PubMed
    1. Haydon DT, Chase-Topping M, Shaw DJ, Matthews L, Friar JK, et al. (2003) The construction and analysis of epidemic trees with reference to the 2001 uk foot–and–mouth outbreak. Proc R Soc B 270: 121–127. - PMC - PubMed
    1. Metzker ML (2010) Sequencing technologies - the next generation. Nat Rev Genet 11: 31–46. - PubMed

Publication types