Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan;16(1):458-471.
doi: 10.1038/s41596-020-00424-x. Epub 2020 Dec 4.

Improved high-molecular-weight DNA extraction, nanopore sequencing and metagenomic assembly from the human gut microbiome

Affiliations

Improved high-molecular-weight DNA extraction, nanopore sequencing and metagenomic assembly from the human gut microbiome

Dylan G Maghini et al. Nat Protoc. 2021 Jan.

Abstract

Short-read metagenomic sequencing and de novo genome assembly of the human gut microbiome can yield draft bacterial genomes without isolation and culture. However, bacterial genomes assembled from short-read sequencing are often fragmented. Furthermore, these metagenome-assembled genomes often exclude repeated genomic elements, such as mobile genetic elements, compromising our understanding of the contribution of these elements to important bacterial phenotypes. Although long-read sequencing has been applied successfully to the assembly of contiguous bacterial isolate genomes, extraction of DNA of sufficient molecular weight, purity and quantity for metagenomic sequencing from stool samples can be challenging. Here, we present a protocol for the extraction of microgram quantities of high-molecular-weight DNA from human stool samples that are suitable for downstream long-read sequencing applications. We also present Lathe ( www.github.com/bhattlab/lathe ), a computational workflow for long-read basecalling, assembly, consensus refinement with long reads or Illumina short reads and genome circularization. Altogether, this protocol can yield high-quality contiguous or circular bacterial genomes from a complex human gut sample in approximately 10 d, with 2 d of hands-on bench and computational effort.

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare no competing interests.

Figures

Fig. 1 |
Fig. 1 |. High-molecular-weight DNA extraction workflow.
Frozen stool is best divided into aliquots by using a biopsy punch with a plunger, taking extreme caution with the blade. After dividing the stool into aliquots, the DNA extraction workflow proceeds with sample suspension, enzymatic and chemical lysis and DNA purification. The Qiagen Genomic-tip is a column-based purification method designed to minimize shearing of DNA. Allow all reagents to flow through the Genomic-tip column by gravity; do not use a syringe or vacuum to pull liquid through, as this will destroy the column. After DNA has been extracted, large fragments can then be selected for using SPRI beads prepared in a custom buffer. Extracted DNA should be assessed for concentration, contamination and size distribution by using a Qubit fluorometer, NanoDrop spectrophotometer and Agilent TapeStation, respectively (not depicted). After DNA has been extracted and meets quality thresholds, it can be carried through library preparation protocols for nanopore sequencing. EtOH, ethanol.
Fig. 2 |
Fig. 2 |. Post-sequencing bioinformatic workflow.
After sequencing, the computational assembly workflow (Step 25) is used to perform basecalling of raw nanopore signal before performing assembly by using either Flye or Canu assemblers. The workflow then provides optional polishing steps by aligning short or long reads back to the assembly and correcting single nucleotide errors and indels (green) with consensus bases (orange) from the aligned reads. Finally, the workflow is used to perform circularization steps through self-alignment and trimming or assembly of endpoint contigs.

References

    1. Pasolli E et al. Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle. Cell 176, 649–662.e20 (2019). - PMC - PubMed
    1. Almeida A et al. A new genomic blueprint of the human gut microbiota. Nature 568, 499–504 (2019). - PMC - PubMed
    1. Nayfach S, Shi ZJ, Seshadri R, Pollard KS & Kyrpides NC New insights from uncultivated genomes of the global human gut microbiome. Nature 568, 505–510 (2019). - PMC - PubMed
    1. Almeida A et al. A unified sequence catalogue of over 280,000 genomes obtained from the human gut microbiome. Nat. Biotechnol Forthcoming; (2020).
    1. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P & Tyson GW CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015). - PMC - PubMed

Publication types