Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 2;39(9):btad571.
doi: 10.1093/bioinformatics/btad571.

P-DOR, an easy-to-use pipeline to reconstruct bacterial outbreaks using genomics

Affiliations

P-DOR, an easy-to-use pipeline to reconstruct bacterial outbreaks using genomics

Gherard Batisti Biffignandi et al. Bioinformatics. .

Abstract

Summary: Bacterial Healthcare-Associated Infections (HAIs) are a major threat worldwide, which can be counteracted by establishing effective infection control measures, guided by constant surveillance and timely epidemiological investigations. Genomics is crucial in modern epidemiology but lacks standard methods and user-friendly software, accessible to users without a strong bioinformatics proficiency. To overcome these issues we developed P-DOR, a novel tool for rapid bacterial outbreak characterization. P-DOR accepts genome assemblies as input, it automatically selects a background of publicly available genomes using k-mer distances and adds it to the analysis dataset before inferring a Single-Nucleotide Polymorphism (SNP)-based phylogeny. Epidemiological clusters are identified considering the phylogenetic tree topology and SNP distances. By analyzing the SNP-distance distribution, the user can gauge the correct threshold. Patient metadata can be inputted as well, to provide a spatio-temporal representation of the outbreak. The entire pipeline is fast and scalable and can be also run on low-end computers.

Availability and implementation: P-DOR is implemented in Python3 and R and can be installed using conda environments. It is available from GitHub https://github.com/SteMIDIfactory/P-DOR under the GPL-3.0 license.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
The P-DOR outputs of the test analysis. (A) Phylogenetic tree of the Analysis Dataset (AD). The first column shows epidemiological clusters of strains with SNP distances below the threshold set by the user. In addition, a heatmap representing the detection of resistance and virulence determinants is shown next to the tree for a better representation of the epidemic event. (B) Distribution of SNP distances calculated between all permutations of genome pairs in the AD. (C) Timeline depicting the movements of the patients during the hospitalization. Points indicate outbreak genomes and are shaped according to the isolation source of the corresponding strain. Samples are linked if their genetic distance in terms of SNPs is below the threshold.

Similar articles

Cited by

References

    1. Balloux F, Brønstad Brynildsrud O, van Dorp L. et al. From theory to practice: translating whole-genome sequencing (WGS) into the clinic. Trends Microbiol 2018;26:1035–48. - PMC - PubMed
    1. Campbell F, Didelot X, Fitzjohn R. et al. outbreaker2: a modular platform for outbreak reconstruction. BMC Bioinformatics 2018;19:363. - PMC - PubMed
    1. Dallman TJ, Ashton PM, Byrne L. et al. Applying phylogenomics to understand the emergence of Shiga-toxin-producing O157:H7 strains causing severe human disease in the UK. Microb Genom 2015;1:e000029. - PMC - PubMed
    1. David S, Reuter S, Harris SR. et al.; ESGEM Study Group. Epidemic of carbapenem-resistant Klebsiella pneumoniae in Europe is driven by nosocomial spread. Nat Microbiol 2019;4:1919–29. - PMC - PubMed
    1. Davis JJ, Wattam AW, Aziz RK. et al. The PATRIC bioinformatics resource center: expanding data and analysis capabilities. Nucleic Acids Res 2020;48:D606–12. - PMC - PubMed

Publication types