Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 4:12:e16695.
doi: 10.7717/peerj.16695. eCollection 2024.

Metagenomic assembly is the main bottleneck in the identification of mobile genetic elements

Affiliations

Metagenomic assembly is the main bottleneck in the identification of mobile genetic elements

Jesse J Kerkvliet et al. PeerJ. .

Abstract

Antimicrobial resistance genes (ARG) are commonly found on acquired mobile genetic elements (MGEs) such as plasmids or transposons. Understanding the spread of resistance genes associated with mobile elements (mARGs) across different hosts and environments requires linking ARGs to the existing mobile reservoir within bacterial communities. However, reconstructing mARGs in metagenomic data from diverse ecosystems poses computational challenges, including genome fragment reconstruction (assembly), high-throughput annotation of MGEs, and identification of their association with ARGs. Recently, several bioinformatics tools have been developed to identify assembled fragments of plasmids, phages, and insertion sequence (IS) elements in metagenomic data. These methods can help in understanding the dissemination of mARGs. To streamline the process of identifying mARGs in multiple samples, we combined these tools in an automated high-throughput open-source pipeline, MetaMobilePicker, that identifies ARGs associated with plasmids, IS elements and phages, starting from short metagenomic sequencing reads. This pipeline was used to identify these three elements on a simplified simulated metagenome dataset, comprising whole genome sequences from seven clinically relevant bacterial species containing 55 ARGs, nine plasmids and five phages. The results demonstrated moderate precision for the identification of plasmids (0.57) and phages (0.71), and moderate sensitivity of identification of IS elements (0.58) and ARGs (0.70). In this study, we aim to assess the main causes of this moderate performance of the MGE prediction tools in a comprehensive manner. We conducted a systematic benchmark, considering metagenomic read coverage, contig length cutoffs and investigating the performance of the classification algorithms. Our analysis revealed that the metagenomic assembly process is the primary bottleneck when linking ARGs to identified MGEs in short-read metagenomics sequencing experiments rather than ARGs and MGEs identification by the different tools.

Keywords: Bacteria; Bacterial; Computational biology; Drug resistance; Metagenomics; Plasmid.

PubMed Disclaimer

Conflict of interest statement

The authors declare there are no competing interests.

Figures

Figure 1
Figure 1. Workflow of the MetaMobilePicker pipeline.
Colors indicate the different steps/modules in the pipeline. Light blue: preprocessing and assembly. Blue: MGE identification. Dark blue: ARG annotation. Orange: output construction. Software tools used are indicated in brackets. AMR: Antimicrobial resistance. MGE: Mobile Genetic Element. IS: Insertion Sequence. The pipeline is available at http://metamobilepicker.nl. For software references and versions see Table S1.
Figure 2
Figure 2. Annotation plot of bacterial reference genomes in validation dataset.
Blue rectangles: insertion sequences. Orange arrows: antimicrobial resistance genes. (A) Klebsiella pneumoniae chromosome and three plasmids. (B) Acinetobacter baumannii chromosome and plasmid. (C) Enterococcus faecalis chromosome and plasmid. (D) Mycobacterium tuberculosis chromosome. (E) Salmonella enterica chromosome and plasmid. (F) Staphylococcus aureus chromosome. (G) Escherichia coli chromosome and plasmid.
Figure 3
Figure 3. Percentage of reads covering the reference genomes per sampled depth, averaged per species.
Solid lines indicate bacterial origin, dotted lines indicate phage origin.
Figure 4
Figure 4. Circular graph displaying the mapping of antibiotic resistance genes (ARGs) (inner lines) on assembled contigs (light blue inner circle) and reference genomes (dark blue inner circle).
Second circle displays sequence origin and classification. Purple: (true positive) chromosomes. Green: (true positive) plasmids. Blue: (true positive) phages. Red: False positive plasmids. Yellow: false negative plasmids. Orange lines in the outer ring denote the location of the ARGs. Outer lines denote ARGs present more than once in the reference genomes.
Figure 5
Figure 5. Number of false positive plasmid predictions per replicon.

Similar articles

Cited by

References

    1. Albertsen M. Long-read metagenomics paves the way toward a complete microbial tree of life. Nature Methods. 2023;20(1):30–31. doi: 10.1038/s41592-022-01726-6. - DOI - PubMed
    1. Berbers B, Saltykova A, Garcia-Graells C, Philipp P, Arella F, Marchal K, Winand R, Vanneste K, Roosens NHC, De Keersmaecker SCJ. Combining short and long read sequencing to characterize antimicrobial resistance genes on plasmids applied to an unauthorized genetically modified bacillus. Scientific Reports. 2020;10(1):4310. doi: 10.1038/s41598-020-61158-0. - DOI - PMC - PubMed
    1. Bernard G, Pathmanathan JS, Lannes R, Lopez P, Bapteste E. Microbial dark matter investigations: how microbial studies transform biological knowledge and empirically sketch a logic of scientific discovery. Genome Biology and Evolution. 2018;10(3):707–715. doi: 10.1093/gbe/evy031. - DOI - PMC - PubMed
    1. Bindari YR, Moore RJ, Van TTH, Hilliar M, Wu S-B, Walkden-Brown SW, Gerber PF. Microbial communities of poultry house dust, excreta and litter are partially representative of microbiota of chicken caecum and ileum. PLOS ONE. 2021;16(8):e0255633. doi: 10.1371/journal.pone.0255633. - DOI - PMC - PubMed
    1. Borowiak M, Fischer J, Hammerl JA, Hendriksen RS, Szabo I, Malorny B. Identification of a novel transposon-associated phosphoethanolamine transferase gene, Mcr-5, conferring colistin resistance in d-Tartrate fermenting Salmonella enterica Subsp. Enterica Serovar Paratyphi B. Journal of Antimicrobial Chemotherapy. 2017;72(12):3317–3324. doi: 10.1093/jac/dkx327. - DOI - PubMed

Substances

LinkOut - more resources