Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jun;38(6):701-707.
doi: 10.1038/s41587-020-0422-6. Epub 2020 Feb 10.

Complete, closed bacterial genomes from microbiomes using nanopore sequencing

Affiliations

Complete, closed bacterial genomes from microbiomes using nanopore sequencing

Eli L Moss et al. Nat Biotechnol. 2020 Jun.

Abstract

Microbial genomes can be assembled from short-read sequencing data, but the assembly contiguity of these metagenome-assembled genomes is constrained by repeat elements. Correct assignment of genomic positions of repeats is crucial for understanding the effect of genome structure on genome function. We applied nanopore sequencing and our workflow, named Lathe, which incorporates long-read assembly and short-read error correction, to assemble closed bacterial genomes from complex microbiomes. We validated our approach with a synthetic mixture of 12 bacterial species. Seven genomes were completely assembled into single contigs and three genomes were assembled into four or fewer contigs. Next, we used our methods to analyze metagenomics data from 13 human stool samples. We assembled 20 circular genomes, including genomes of Prevotella copri and a candidate Cibiobacter sp. Despite the decreased nucleotide accuracy compared with alternative sequencing and assembly approaches, our methods improved assembly contiguity, allowing for investigation of the role of repeat elements in microbial function and adaptation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Taxonomic read composition, per-organism read-length distributions and genome assemblies in a defined 12-species bacterial mixture.
a, Relative read counts are shown for the expected equal composition of bacterial cells and the observed composition, with correction for relative genome size. b, Read-length distributions per organism. Individual organisms demonstrate varying read-length distributions in some cases. c, Circos plots demonstrate the relative assembly contiguity of the nanopore versus short-read assembly approaches. Nanopore sequencing and assembly (colored outer ring) outperforms short-read assembly (black inner ring), producing complete genome assemblies (small black inner dots) in seven of 12 cases, with a further three assembled in four contigs or fewer. Numbers indicate genome size in megabases. Note that complete assemblies may contain one apparent break due to differing linearization breakpoints in reference and assembly sequences.
Fig. 2
Fig. 2. Per-organism assembly contiguity, diversity and taxonomic read composition in two healthy human stool microbiomes.
a, Species-level Shannon diversity is shown for the sequence datasets obtained. Higher diversity is found in libraries prepared with the present DNA extraction method. Relative species-level abundances are shown for a conventional workflow consisting of bead-beating extraction and short-read sequencing, as well as the present workflow consisting of HMW DNA extraction and long-read sequencing. b, Contiguity is expressed as per-bin N50 divided by per-bin length (the total length of sequences assigned to the bin). As bin assembly approaches completion, the quantity N50 divided by bin length approaches one, regardless of genome size. Nanopore sequencing and assembly (blue, purple) demonstrates higher assembly contiguity than read-cloud (gold) and short-read (green) approaches. For all organisms achieving assembly N50 of at least 500 kbp or a complete genome draft by any approach, genome draft quality and contiguity are shown for long reads, read clouds and short reads. Shapes indicate draft quality. Asterisk marks a genome later annotated as putative Cibiobacter.
Fig. 3
Fig. 3. Circos diagrams of closed, circular genomes of P. copri and Cibiobacter sp.
a, P. copri from P2-A. b, Cibiobacter sp. from P1. In both plots, the outermost ring represents the complete, closed and circularized genome of the given organism. The middle and inner rings represent contigs from the corresponding read-cloud and short-read assemblies, respectively, that were mapped to the nanopore assembly. The inner track in each case displays annotated, predicted mobile genetic elements such as insertion sequences (ISs), transposases and prophage.

References

    1. Bowers RM, et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat. Biotechnol. 2017;35:725–731. doi: 10.1038/nbt.3893. - DOI - PMC - PubMed
    1. Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3:e1165. doi: 10.7717/peerj.1165. - DOI - PMC - PubMed
    1. Forster SC, et al. A human gut bacterial genome and culture collection for improved metagenomic analyses. Nat. Biotechnol. 2019;37:186–192. doi: 10.1038/s41587-018-0009-7. - DOI - PMC - PubMed
    1. Nayfach S, Shi ZJ, Seshadri R, Pollard KS, Kyrpides N. Novel insights from uncultivated genomes of the global human gut microbiome. Nature. 2019;568:505–510. doi: 10.1038/s41586-019-1058-x. - DOI - PMC - PubMed
    1. Bishara A, et al. High-quality genome sequences of uncultured microbes by assembly of read clouds. Nat. Biotechnol. 2018;36:1067–1075. doi: 10.1038/nbt.4266. - DOI - PMC - PubMed

Publication types