Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun;19(6):671-674.
doi: 10.1038/s41592-022-01478-3. Epub 2022 May 9.

Metagenome assembly of high-fidelity long reads with hifiasm-meta

Affiliations

Metagenome assembly of high-fidelity long reads with hifiasm-meta

Xiaowen Feng et al. Nat Methods. 2022 Jun.

Abstract

De novo assembly of metagenome samples is a common approach to the study of microbial communities. Current metagenome assemblers developed for short sequence reads or noisy long reads were not optimized for accurate long reads. We thus developed hifiasm-meta, a metagenome assembler that exploits the high accuracy of recent data. Evaluated on seven empirical datasets, hifiasm-meta reconstructed tens to hundreds of complete circular bacterial genomes per dataset, consistently outperforming other metagenome assemblers.

PubMed Disclaimer

Conflict of interest statement

Competing Interests Statement

D.P. is an employee of Pacific Biosciences. H.L. is a consultant of Integrated DNA Technologies and on the Scientific Advisory Boards of Sentieon and Innozeen. The remaining authors declare no competing interests.

Figures

Extended Data Figure 1.
Extended Data Figure 1.
The hifiasm-meta assembly graph of the sheepA dataset. Short disconnected contigs are not shown.
Extended Data Figure 2.
Extended Data Figure 2.
Yak QV score correlated with contig coverage. Plots shwoing >1Mb contigs in sheepB assemblies. Contig coverage was estimated by jgi_summarize_bam_contig_depths from metabat2, and alignment was done with minimap2 -a -k 19 -w 10 -I 10G -g 5000 -r 2000 –lj-min-ratio 0.5 -A 2 -B 5 -O 5,56 -E 4,1 -z 400,50. Hifiasm-meta assembled 37 contigs without k-mer errors. HiCanu and metaFlye each assembled 4 contigs without k-mer errors.
Figure 1.
Figure 1.
Metagenome assemblies of empirical datasets. (a) Quality score of contigs longer than 300kb from the hifiasm-meta assembly of sheepA. The quality score of a MAG is defined as “completeness −5 × contamination” based on CheckM reports. (b) CheckM evaluation. A MAG is “near-complete” if its CheckM completeness is ≥90% and its contamination level ≤5%, is “high-quality” if completeness ≥70% and contamination ≤10%, or is “medium-quality” if its quality score is ≥50%. “HumanPooled” represents the co-assembly of all four human gut samples. (c) Composition of long contigs in the hifiasm-meta co-assembly of four human gut samples. Each bar represents a contig of ≥1Mb in length. It gives the number of reads used in the contig. Each color corresponds to a human gut sample. A cross at the top of a bar indicates the contig being circular. (d) Phylogeny of human gut MAGs from the co-assembly. A colored clade corresponds to a phylum inferred by GTDB-Tk,. A MAG is omnivore/vegan-specific if 90% of reads in the MAG come from omnivore/vegan samples, respectively. Two adjacent leaves coming in the same genus have the same shade on the “Genus” outer ring.

References

    1. Lapidus AL & Korobeynikov AI. Metagenomic data assembly - the way of decoding unknown microorganisms. Front Microbiol 12, 613791 (2021). - PMC - PubMed
    1. Almeida A. et al. A new genomic blueprint of the human gut microbiota. Nature 568, 499–504 (2019). - PMC - PubMed
    1. Chen L-X, Anantharaman K, Shaiber A, Eren AM. & Banfield JF. Accurate and complete genomes from metagenomes. Genome Res 30, 315–333 (2020). - PMC - PubMed
    1. Kang DD. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019). - PMC - PubMed
    1. Kolmogorov M. et al. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat. Methods 17, 1103–1110 (2020). - PMC - PubMed

Publication types