Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug 2;5(1):34.
doi: 10.1186/s13742-016-0140-7.

INC-Seq: accurate single molecule reads using nanopore sequencing

Affiliations

INC-Seq: accurate single molecule reads using nanopore sequencing

Chenhao Li et al. Gigascience. .

Abstract

Background: Nanopore sequencing provides a rapid, cheap and portable real-time sequencing platform with the potential to revolutionize genomics. However, several applications are limited by relatively high single-read error rates (>10 %), including RNA-seq, haplotype sequencing and 16S sequencing.

Results: We developed the Intramolecular-ligated Nanopore Consensus Sequencing (INC-Seq) as a strategy for obtaining long and accurate nanopore reads, starting with low input DNA. Applying INC-Seq for 16S rRNA-based bacterial profiling generated full-length amplicon sequences with a median accuracy >97 %.

Conclusions: INC-Seq reads enabled accurate species-level classification, identification of species at 0.1 % abundance and robust quantification of relative abundances, providing a cheap and effective approach for pathogen detection and microbiome profiling on the MinION system.

Keywords: Barcode sequencing; Consensus algorithms; Nanopore sequencing; Rolling circle amplification.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Overview of the INC-Seq workflow. a Template molecules are circularized in optimized conditions, and the remaining linear molecules are removed. The circular products are amplified with RCA and sequenced on the MinION platform. b A subsequence from the raw nanopore reads is used as an anchor to scan the entire read for the location of repeating units. The repeating units flanked by adjacent anchor starting points are aligned, and a consensus sequence is constructed. c In INC-Seq library preparation, chimeras are expected through intermolecular ligation and template switching. Chimeras from template switching are likely to be detected by the anchor mapping protocol. Chimeras from intermolecular ligation were observed to be rare under the experimental conditions used in INC-Seq
Fig. 2
Fig. 2
INC-Seq evaluation with simulated reads. a 100 representative reference sequences were selected from a customized database. Simulated reads were generated for each reference sequence, as well as artificial RCA products (that is, INC-Seq libraries). Synthetic INC-Seq reads significantly improved the proportion of correctly mapped reads over raw ONT 2D reads at both species and reference level (p-value <10-9; one-sided paired Wilxcon test). b Species-level classification F1 score (that is, the harmonic mean of precision and recall) using simulated raw, CANU corrected and INC-Seq reads from 100 reference sequences and with different identity thresholds for classification (each curve represents the average across 10 replicates)
Fig. 3
Fig. 3
INC-Seq produces long and accurate reads. a The ratio between the length of INC-Seq corrected reads and the reference sequence is tightly distributed around 1 (only reads with length from 600-800 bp are shown here). b, c INC-Seq boosts overall read accuracy and significantly reduces mismatch, insertion and deletion error rates (one-sided Wilcoxon test p-value <10-15 in all cases). d Accuracy of INC-Seq sequences increases with the number of segments used for consensus construction
Fig. 4
Fig. 4
Species detection using INC-Seq. a Ten species were selected to construct an artificial community. Some species have highly similar 16S sequences (for example, S. aureus and S. epidermidis share 99 % identity). b Two separate INC-Seq runs produce consistent results that are well correlated with defined abundances (Pearson ρ = 0.83, p-value = 0.003; Spearman ρ = 0.98, p-value <10-15)

References

    1. Ashton PM, Nair S, Dallman T, Rubino S, Rabsch W, Mwaigwisya S, et al. MinION nanopore sequencing identifies the position and structure of a bacterial antibiotic resistance island. Nat Biotechnol. 2014;33:296–300. doi: 10.1038/nbt.3103. - DOI - PubMed
    1. Quick J, Loman NJ, Duraffour S, Simpson JT, Severi E, Cowley L, et al. Real-time, portable genome sequencing for Ebola surveillance. Nature. 2016;530:228–32. doi: 10.1038/nature16996. - DOI - PMC - PubMed
    1. Bolisetty MT, Rajadinakaran G, Graveley BR. Determining exon connectivity in complex mRNAs by nanopore sequencing. Genome Biol. 2015;16:204. doi: 10.1186/s13059-015-0777-z. - DOI - PMC - PubMed
    1. Ammar R, Paton TA, Torti D, Shlien A, Bader GD. Long read nanopore sequencing for detection of HLA and CYP2D6 variants and haplotypes. F1000Res. 2015;4:17. - PMC - PubMed
    1. Benítez-Páez A, Portune KJ, Sanz Y. Species-level resolution of 16S rRNA gene amplicons sequenced through the MinIONTM portable nanopore sequencer. Gigascience. 2016;5:4. - PMC - PubMed

Publication types