Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 7:5:11996.
doi: 10.1038/srep11996.

Scaffolding of a bacterial genome using MinION nanopore sequencing

Affiliations

Scaffolding of a bacterial genome using MinION nanopore sequencing

E Karlsson et al. Sci Rep. .

Abstract

Second generation sequencing has revolutionized genomic studies. However, most genomes contain repeated DNA elements that are longer than the read lengths achievable with typical sequencers, so the genomic order of several generated contigs cannot be easily resolved. A new generation of sequencers offering substantially longer reads is emerging, notably the Pacific Biosciences (PacBio) RS II system and the MinION system, released in early 2014 by Oxford Nanopore Technologies through an early access program. The latter has highly advantageous portability and sequences samples by measuring changes in ionic current when single-stranded DNA molecules are translocated through nanopores. We show that the MinION system produces long reads with high mapability that can be used for scaffolding bacterial genomes, despite currently producing substantially higher error rates than PacBio reads. With further development we anticipate that MinION will be useful not only for assembling genomes, but also for rapid detection of organisms, potentially in the field.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Quality of MinION (R7.3) and PacBio sequencing reads.
(a) Length distribution of the reads. MinION reads are divided into three length categories that are coloured separately. Note that the high number of MinION reads of about 3.5 kb originate from the ligation control fragment. (b) Mapability of PacBio and MinION reads divided into the same length categories as in (a). Read alignment length is the fraction of the reads covered in the BLAST alignment against the reference genome. (c) Mean frequencies of deletion, insertion and substitution errors per nucleotide per read for MinION sequence reads in four genomic regions: whole genome (32% GC), high GC-content regions (47.8% GC), A/T and G/C monomers at least 5 bp long. (d) Consensus accuracy versus average read coverage of the genome. Different coverages were obtained by subsampling the reads from a single MinION run.
Figure 2
Figure 2. Overview of the FSC996 reference genome and the MinION data.
The outermost track represents the reference genome with genomic positions (in kb). The blue lines provide a visual representation of every MinION read longer than 2 kb used in the analysis and its genomic alignment position (tile layers limited to 100). The black lines show the contigs generated from the assembly of Illumina Hiseq reads and their alignment positions. The inner circle shows a heat map of the GC content of the reference genome, with the brightest colour representing 20% GC and the darkest 40%.

References

    1. Ross M. G. et al. Characterizing and measuring bias in sequence data. Genome Biol 14, R51, 10.1186/gb-2013-14-5-r51 (2013). - DOI - PMC - PubMed
    1. Loman N. J. et al. Performance comparison of benchtop high-throughput sequencing platforms. Nat Biotechnol 30, 434–439, 10.1038/nbt.2198 (2012). - DOI - PubMed
    1. Sjödin A. et al. Genome characterisation of the genus Francisella reveals insight into similar evolutionary paths in pathogens of mammals and fish. BMC Genomics 13, 268, 10.1186/1471-2164-13-268 (2012). - DOI - PMC - PubMed
    1. Eid J. et al. Real-time DNA sequencing from single polymerase molecules. Science 323, 133–138, 10.1126/science.1162986 (2009). - DOI - PubMed
    1. Travers K. J., Chin C. S., Rank D. R., Eid J. S. & Turner S. W. A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucleic Acids Res 38, e159, 10.1093/nar/gkq543 (2010). - DOI - PMC - PubMed

Publication types