Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 May 2:rs.3.rs-4271356.
doi: 10.21203/rs.3.rs-4271356/v1.

Simultaneous detection of pathogens and antimicrobial resistance genes with the open source, cloud-based, CZ ID pipeline

Affiliations

Simultaneous detection of pathogens and antimicrobial resistance genes with the open source, cloud-based, CZ ID pipeline

Charles Langelier et al. Res Sq. .

Update in

Abstract

Antimicrobial resistant (AMR) pathogens represent urgent threats to human health, and their surveillance is of paramount importance. Metagenomic next generation sequencing (mNGS) has revolutionized such efforts, but remains challenging due to the lack of open-access bioinformatics tools capable of simultaneously analyzing both microbial and AMR gene sequences. To address this need, we developed the CZ ID AMR module, an open-access, cloud-based workflow designed to integrate detection of both microbes and AMR genes in mNGS and whole-genome sequencing (WGS) data. It leverages the Comprehensive Antibiotic Resistance Database and associated Resistance Gene Identifier software, and works synergistically with the CZ ID short-read mNGS module to enable broad detection of both microbes and AMR genes. We highlight diverse applications of the AMR module through analysis of both publicly available and newly generated mNGS and WGS data from four clinical cohort studies and an environmental surveillance project. Through genomic investigations of bacterial sepsis and pneumonia cases, hospital outbreaks, and wastewater surveillance data, we gain a deeper understanding of infectious agents and their resistomes, highlighting the value of integrating microbial identification and AMR profiling for both research and public health. We leverage additional functionalities of the CZ ID mNGS platform to couple resistome profiling with the assessment of phylogenetic relationships between nosocomial pathogens, and further demonstrate the potential to capture the longitudinal dynamics of pathogen and AMR genes in hospital acquired bacterial infections. In sum, the new AMR module advances the capabilities of the open-access CZ ID microbial bioinformatics platform by integrating pathogen detection and AMR profiling from mNGS and WGS data. Its development represents a critical step toward democratizing pathogen genomic analysis and supporting collaborative efforts to combat the growing threat of AMR.

PubMed Disclaimer

Conflict of interest statement

Declarations Competing interests The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1. High-level flow diagram highlighting the integrated AMR and mNGS modules within the CZ ID pipeline.
A more detailed diagram is provided in Figure S1.
Figure 2
Figure 2. Examples of CZ ID web tool sample reports.
(A) The report in the AMR module with a filter of Number of Reads >= 5 and Reads/Contig % coverage >= 10% applied to the AMR genes. (B) The report in the mNGS module showing the list of detected species and the coverage visualization for one species. Details about report metrics are discussed in the main text and CZ ID help center https://help.czid.org/.
Figure 3
Figure 3. Combining pathogen detection and AMR gene profiling of mNGS and WGS data to investigate Klebsiella pneumoniae transfusion-related sepsis.
(A) Abundance and genome coverage of Klebsiella pneumonia from direct mNGS of plasma or serum samples versus WGS of cultured bacterial isolates. (B) AMRgenes detected in each sample. *denotes AMR gene(s) for which resistance originates due to pointmutations (as opposed to presence/absence of the gene); these were detected by the “protein variant model” in CARD and the gene name shown is a representative reference gene containing the mutations known to lead to resistance. Legend: NT rPM = reads mapping to pathogen in the NCBI NT database per million reads sequenced. Contig = contiguous sequence. Strict/Perfect/Nudged refers to RGI’s alignment stringency threshold. “pt1” = patient 1, “pt2” = patient 2. “pre” = pre-transfusion, “post” = post-transfusion.
Figure 4
Figure 4. Outbreak investigation pairing WGS of methicillin susceptible Staphylococcus aureus isolates and mNGS of surveillance skin swabs from babies in a neonatal intensive care unit.
(A) Unsupervised clustering of AMR gene profiles from WGS data reveals a cluster of related isolates indicated by the dashed-line box. (B) Matrix of single nucleotide polymorphism (SNP) distances between each sequenced isolate confirms the genetic relatedness of this cluster, which is highlighted by a dashed-line box.
Figure 5
Figure 5. Bacterial genera and AMR genes detected by mNGS of skin swabs from babies in a neonatal intensive care unit.
(A) mNGS of swab samples demonstrated a diversity of genera in both samples from patients within an outbreak cluster of genetically related S. aureus, as well as in those from patients outside of the cluster. (B) mNGS analysis revealed a greater number and type of AMR gene families versus those identified by WGS of S. aureus isolated in culture from the swabs. Selected AMR gene families of high public health concern are highlighted in red with the specific genes detected in parenthesis.
Figure 6
Figure 6. Co-detection of microbes and AMR genes in patients with critical bacterial infections using the CZ ID mNGS and AMR modules.
(A) Relative abundance (reads per million, rpM) of the eight most abundant taxa in the lower respiratory tract detected by RNA mNGS of tracheal aspirate from a patient with Serratia marcescens pneumonia. The dominant microbe is highlighted in blue. (B) AMR genes and their species prediction by the AMR module. Columns indicate the species these AMR genes and their variants are found in according to CARD Resistomes & Variants database, and those found in the dominant species as in (A) are colored in blue. AMR genes that are further associated with the dominant species by the pathogen-of-origin analysis are colored in purple. (C) Relative abundance (rpM) of the eight most abundant taxa detected by plasma DNA mNGS in a patient with sepsis due to MRSA bloodstream infection. The dominant microbe is highlighted in blue. (D) AMR genes and their species prediction by the AMR module. Columns indicate the species these AMR genes and their variants are found in according to CARD Resistomes & Variants database, and those found in the dominant species as in (C) are colored in blue. AMR genes that are further associated with the dominant species by the pathogen-of-origin analysis are colored in purple.
Figure 7
Figure 7. Longitudinal profiling of pathogen and AMR gene abundance in a patient hospitalized for severe Respiratory Syncytial Virus (RSV) infection who developed Pseudomonas aeruginosa Ventilator Associated Pneumonia (VAP).
(A) Relative abundance in reads per million (rpM) of RSV and P. aeruginosa detected by the CZ ID mNGS pipeline. (B) AMR genes detected in the lower respiratory tract microbiome at each time point. Perfect or strict AMR alignments from contigs are highlighted in yellow, while those nudged are orange. Short read alignments are in red. AMR genes mapping to Pseudomonas aeruginosa or any Pseudomonas species are highlighted in blue and purple, respectively. *Sample from Day 12 did not have enough sequencing reads but was plotted to maintain even scaling on the x-axis.
Figure 8
Figure 8. AMR surveillance from environmental water samples.
AMR gene families identified from global surveillance of surface or wastewater samples from Boston, USA and Vellore, India. AMR genes found by contigs that passed Perfect or Strict cutoff are included in heatmap. Gene families of high public health concern are highlighted in red.

References

    1. Antimicrobial Resistance Collaborators. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet 399, 629–655 (2022). - PMC - PubMed
    1. Review on Antimicrobial Resistance. Tackling Drug-Resistant Infections Globally: Final Report and Recommendations. (2016).
    1. global health issues to track in 2021. https://www.who.int/news-room/spotlight/10-global-health-issues-to-track...2021.
    1. Baker K. S. et al. Evidence review and recommendations for the implementation of genomics for antimicrobial resistance surveillance: reports from an international expert group. Lancet Microbe 4, e1035–e1039 (2023). - PubMed
    1. Anjum M. F., Zankari E. & Hasman H. Molecular Methods for Detection of Antimicrobial Resistance. Microbiol Spectr 5, (2017). - PMC - PubMed

Publication types

LinkOut - more resources