Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 May 26;58(6):e00060-20.
doi: 10.1128/JCM.00060-20. Print 2020 May 26.

A Sample-to-Report Solution for Taxonomic Identification of Cultured Bacteria in the Clinical Setting Based on Nanopore Sequencing

Affiliations

A Sample-to-Report Solution for Taxonomic Identification of Cultured Bacteria in the Clinical Setting Based on Nanopore Sequencing

Stefan Moritz Neuenschwander et al. J Clin Microbiol. .

Abstract

Amplicon sequencing of the 16S rRNA gene is commonly used for the identification of bacterial isolates in diagnostic laboratories and mostly relies on the Sanger sequencing method. The latter, however, suffers from a number of limitations, with the most significant being the inability to resolve mixed amplicons when closely related species are coamplified from a mixed culture. This often leads to either increased turnaround time or absence of usable sequence data. Short-read next-generation sequencing (NGS) technologies could solve the mixed amplicon issue but would lack both cost efficiency at low throughput and fast turnaround times. Nanopore sequencing developed by Oxford Nanopore Technologies (ONT) could solve those issues by enabling a flexible number of samples per run and an adjustable sequencing time. Here, we report on the development of a standardized laboratory workflow combined with a fully automated analysis pipeline LORCAN (long read consensus analysis), which together provide a sample-to-report solution for amplicon sequencing and taxonomic identification of the resulting consensus sequences. Validation of the approach was conducted on a panel of reference strains and on clinical samples consisting of single or mixed rRNA amplicons associated with various bacterial genera by direct comparison to the corresponding Sanger sequences. Additionally, simulated read and amplicon mixtures were used to assess LORCAN's behavior when dealing with samples with known cross-contamination levels. We demonstrate that by combining ONT amplicon sequencing results with LORCAN, the accuracy of Sanger sequencing can be closely matched (>99.6% sequence identity) and that mixed samples can be resolved at the single-base resolution level. The presented approach has the potential to significantly improve the flexibility, reliability, and availability of amplicon sequencing in diagnostic settings.

Keywords: 16S RNA gene; bioinformatics; clinical methods; diagnostics; nanopore; sequencing; taxonomy.

PubMed Disclaimer

Figures

FIG 1
FIG 1
(A) Overview of the wet laboratory workflow. Steps of the LORCAN analysis (B) and corresponding sections of the generated report (C). (Step 1) Demultiplexing and adapter trimming. (Step 2) Read filtering by size. (Step 3) Mapping to a reference database. (Step 4) Read extraction, binning by species, and remapping. (Step 5) Consensus calling. (Step 6) Selection of the closest references by BLAST. (Step 7) Taxonomic tree building.
FIG 2
FIG 2
Taxonomic analysis of amplicon mixtures by LORCAN. Amplicons from Staphylococcus aureus, Enterococcus faecalis, and Pseudomonas aeruginosa mixed after PCR amplification (A) and mixed in silico from reads obtained from pure amplicons (B). Standard deviations indicate the variability across three independent replicate samples. None of the observed ratios was significantly different from the expected ratios (chi-square test for expected probabilities; P > 0.99). (C) In silico mixtures of Mycobacterium gordonae and Mycobacterium avium.
FIG 3
FIG 3
Influence of reference database completeness on consensus sequence accuracy. Each consensus sequence was compared to a consensus sequence produced with a perfectly matching reference sequence. Additionally, each consensus sequence was identified by BLAST similarity search against the full reference database. The uneven spacing of the data points reflects the database composition after subsetting. Missing values are a result of insufficient numbers of reads mapping to the reference database. (A) Filled circles indicate correct taxonomic identification of the ATCC strains. The low identities and unsuccessful identification of Eggerthella lenta are a result of a low-level contamination in combination with unsuccessful mapping of the Eggerthella reads. (B) The diameter of the circles is proportional to the number of reads mapped and further used in the consensus generation step (obtained from the LORCAN output). Additional detail is provided in Table S3 and Fig. S10 in the supplemental material.
FIG 4
FIG 4
Cost estimate based on current list prices in Switzerland (currency CHF, December 2019). Prices for Illumina and Nanopore sequencing include reagents and consumables; prices for Sanger sequencing correspond to the rates at a large local service provider. The lines of MiniSeq and MiSeq v3 are confounded in the figure. Detail is provided in Table S4 in the supplemental material.

References

    1. Maruvada P, Leone V, Kaplan LM, Chang EB. 2017. The human microbiome and obesity: moving beyond associations. Cell Host Microbe 22:589–599. doi:10.1016/j.chom.2017.10.005. - DOI - PubMed
    1. Durban A, Abellan JJ, Jimenez-Hernandez N, Ponce M, Ponce J, Sala T, D’Auria G, Latorre A, Moya A. 2011. Assessing gut microbial diversity from feces and rectal mucosa. Microb Ecol 61:123–133. doi:10.1007/s00248-010-9738-y. - DOI - PubMed
    1. Janda JM, Abbott SL. 2007. 16S rRNA gene sequencing for bacterial identification in the diagnostic laboratory: pluses, perils, and pitfalls. J Clin Microbiol 45:2761–2764. doi:10.1128/JCM.01228-07. - DOI - PMC - PubMed
    1. Srinivasan R, Karaoz U, Volegova M, MacKichan J, Kato-Maeda M, Miller S, Nadarajan R, Brodie EL, Lynch SV. 2015. Use of 16S rRNA gene for identification of a broad range of clinically relevant bacterial pathogens. PLoS One 10:e0117617. doi:10.1371/journal.pone.0117617. - DOI - PMC - PubMed
    1. Zhang J, Fang Y, Hou JY, Ren HJ, Jiang R, Roos P, Dovichi NJ. 1995. Use of non-cross-linked polyacrylamide for four-color DNA sequencing by capillary electrophoresis separation of fragments up to 640 bases in length in two hours. Anal Chem 67:4589–4593. doi:10.1021/ac00120a026. - DOI - PubMed

Publication types

Substances