Bayesian Inference of Pathogen Phylogeography using the Structured Coalescent Model
- PMID: 40258093
- PMCID: PMC12040344
- DOI: 10.1371/journal.pcbi.1012995
Bayesian Inference of Pathogen Phylogeography using the Structured Coalescent Model
Abstract
Over the past decade, pathogen genome sequencing has become well established as a powerful approach to study infectious disease epidemiology. In particular, when multiple genomes are available from several geographical locations, comparing them is informative about the relative size of the local pathogen populations as well as past migration rates and events between locations. The structured coalescent model has a long history of being used as the underlying process for such phylogeographic analysis. However, the computational cost of using this model does not scale well to the large number of genomes frequently analysed in pathogen genomic epidemiology studies. Several approximations of the structured coalescent model have been proposed, but their effects are difficult to predict. Here we show how the exact structured coalescent model can be used to analyse a precomputed dated phylogeny, in order to perform Bayesian inference on the past migration history, the effective population sizes in each location, and the directed migration rates from any location to another. We describe an efficient reversible jump Markov Chain Monte Carlo scheme which is implemented in a new R package StructCoalescent. We use simulations to demonstrate the scalability and correctness of our method and to compare it with existing software. We also applied our new method to several state-of-the-art datasets on the population structure of real pathogens to showcase the relevance of our method to current data scales and research questions.
Copyright: © 2025 Roberts et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures








Similar articles
-
The Structured Coalescent and Its Approximations.Mol Biol Evol. 2017 Nov 1;34(11):2970-2981. doi: 10.1093/molbev/msx186. Mol Biol Evol. 2017. PMID: 28666382 Free PMC article.
-
New Routes to Phylogeography: A Bayesian Structured Coalescent Approximation.PLoS Genet. 2015 Aug 12;11(8):e1005421. doi: 10.1371/journal.pgen.1005421. eCollection 2015 Aug. PLoS Genet. 2015. PMID: 26267488 Free PMC article.
-
An Efficient Coalescent Epoch Model for Bayesian Phylogenetic Inference.Syst Biol. 2022 Oct 12;71(6):1549-1560. doi: 10.1093/sysbio/syac015. Syst Biol. 2022. PMID: 35212733 Free PMC article.
-
StarBeast3: Adaptive Parallelized Bayesian Inference under the Multispecies Coalescent.Syst Biol. 2022 Jun 16;71(4):901-916. doi: 10.1093/sysbio/syac010. Syst Biol. 2022. PMID: 35176772 Free PMC article.
-
Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent.PLoS Genet. 2016 May 4;12(5):e1006006. doi: 10.1371/journal.pgen.1006006. eCollection 2016 May. PLoS Genet. 2016. PMID: 27144273 Free PMC article.
Cited by
-
Bayesian phylodynamic inference of population dynamics with dormancy.bioRxiv [Preprint]. 2025 Jan 22:2025.01.19.633741. doi: 10.1101/2025.01.19.633741. bioRxiv. 2025. Update in: Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2501394122. doi: 10.1073/pnas.2501394122. PMID: 39896623 Free PMC article. Updated. Preprint.
-
Bayesian phylodynamic inference of population dynamics with dormancy.Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2501394122. doi: 10.1073/pnas.2501394122. Epub 2025 May 2. Proc Natl Acad Sci U S A. 2025. PMID: 40314983 Free PMC article.
References
-
- Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, et al. GenBank. Nucleic Acids Res. 2017;45(D1):D37–42. doi: 10.1093/nar/gkw1070 - DOI - PMC - PubMed
-
- Shu Y, McCauley J. GISAID: global initiative on sharing all influenza data—from vision to reality. Euro Surveill. 2017;22(13):30494. doi: 10.2807/1560-7917.ES.2017.22.13.30494 - DOI - PMC - PubMed
-
- Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.Org website and their applications. Wellcome Open Res. 2018;3:1–20. doi: 10.12688/wellcomeopenres.14826.1 - DOI - PMC - PubMed
-
- Didelot X, Bowden R, Wilson DJ, Peto TEA, Crook DW. Transforming clinical microbiology with bacterial genome sequencing. Nat Rev Genet. 2012;13:601–12. doi: 10.1038/nrg3226 - DOI - PMC - PubMed
-
- Houldcroft CJ, Beale MA, Breuer J. Clinical and biological insights from viral genome sequencing. Nat Rev Microbiol. 2017;15(3):183–92. doi: 10.1038/nrmicro.2016.182 - DOI - PMC - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous