Enhancing Long-Read-Based Strain-Aware Metagenome Assembly
- PMID: 35646097
- PMCID: PMC9136235
- DOI: 10.3389/fgene.2022.868280
Enhancing Long-Read-Based Strain-Aware Metagenome Assembly
Abstract
Microbial communities are usually highly diverse and often involve multiple strains from the participating species due to the rapid evolution of microorganisms. In such a complex microecosystem, different strains may show different biological functions. While reconstruction of individual genomes at the strain level is vital for accurately deciphering the composition of microbial communities, the problem has largely remained unresolved so far. Next-generation sequencing has been routinely used in metagenome assembly but there have been struggles to generate strain-specific genome sequences due to the short-read length. This explains why long-read sequencing technologies have recently provided unprecedented opportunities to carry out haplotype- or strain-resolved genome assembly. Here, we propose MetaBooster and MetaBooster-HiFi, as two pipelines for strain-aware metagenome assembly from PacBio CLR and Oxford Nanopore long-read sequencing data. Benchmarking experiments on both simulated and real sequencing data demonstrate that either the MetaBooster or the MetaBooster-HiFi pipeline drastically outperforms the state-of-the-art de novo metagenome assemblers, in terms of all relevant metagenome assembly criteria, involving genome fraction, contig length, and error rates.
Keywords: genome assembly; haplotype; long reads; metagenome; strain.
Copyright © 2022 Luo, Kang and Schönhuth.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures

References
-
- Baaijens J. A., Stougie L., Schönhuth A. (2020). Strain-aware Assembly of Genomes from Mixed Samples Using Flow Variation Graphs. RECOMB, 221–222. 10.1007/978-3-030-45257-5_14 - DOI
-
- Bonanno L., Loukiadis E., Mariani-Kurkdjian P., Oswald E., Garnier L., Michel V., et al. (2015). Diversity of Shiga Toxin-Producing escherichia Coli (Stec) O26: H11 Strains Examined via Stx Subtypes and Insertion Sites of Stx and Espk Bacteriophages. Appl. Environ. Microbiol. 81, 3712–3721. 10.1128/aem.00077-15 - DOI - PMC - PubMed
-
- Burger R. (2012). Ehec o104: H4 in germany 2011: Large outbreak of bloody diarrhea and haemolytic uraemic syndrome by shiga toxin-producing e. coli via contaminated food
LinkOut - more resources
Full Text Sources