Accurate and reproducible whole-genome genotyping for bacterial genomic surveillance with Nanopore sequencing data
- PMID: 40511924
- PMCID: PMC12239720
- DOI: 10.1128/jcm.00369-25
Accurate and reproducible whole-genome genotyping for bacterial genomic surveillance with Nanopore sequencing data
Abstract
Despite recent advances in error rate reduction, until recently, Oxford Nanopore Technologies (ONT) sequences lacked the accuracy required for fine-scale bacterial genomic analysis. Here, recent software improvements of ONT and the ONT-core-genome multilocus sequence typing (cgMLST)-Polisher within the SeqSphere+ software were evaluated. We used short-read (Illumina) and long-read ONT sequences of 80 multidrug-resistant organisms (MDROs) for benchmarking. Illumina reads were de novo assembled using SKESA. For ONT, Dorado Super Accurate (SUP) model v.4.3 or v.5.0 basecalled reads were assembled with Flye and then polished with Medaka v.1.12 m4.3 or Medaka v.2.0 bacterial methylation model. In addition, the ONT-cgMLST-Polisher was run over all assemblies. The "ground truth" (GT) hybrid assemblies were created using Hybracter v.0.10.0. Sixteen isolates from four species out of the original 80 isolates were sent to six laboratories for a ring trial. The 80 MDROs basecalled with SUP m4.3 had an average cgMLST allele distance (AD) to the GT of 4.94 with Medaka v.1.12 and 1.78 with Medaka v.2.0, respectively. After further polishing the Medaka v.2.0 data with the ONT-cgMLST-Polisher, the AD dropped to 0.09. Using data basecalled with SUP m5.0 with Medaka v.2.0 further reduced the AD significantly to 0.04. While the ring trial data basecalled with Dorado SUP m4.3 showed more variability and insufficient results for some samples, model 5.0 data resulted in average ADs of 0.36 and 0.17 without and with the ONT-cgMLST-Polisher, respectively. In conclusion, recent ONT Dorado and Medaka models combined with the ONT-cgMLST-Polisher improved ONT sequencing accuracy and made it sufficiently reproducible for genomic surveillance of bacteria.IMPORTANCEONT sequencing methodology is especially attractive for small and medium-sized laboratories due to its relatively low capital investment and price per sample consumable costs. However, until recently, it lacked accuracy and reproducibility for bacterial genomic genotyping. Here, we present an evaluation of the most recent ONT bioinformatic (basecalling and polishing of consensus) improvements and a new ONT-cgMLST-Polisher tool. We demonstrate that by applying those procedures, ONT whole-genome genotyping-based surveillance of bacteria is finally accurate and reproducible enough for routine application even in small laboratories.
Keywords: Nanopore sequencing; error rate; polishing; ring trial.
Conflict of interest statement
The authors declare no conflict of interest.
References
-
- Sheppard AE, Stoesser N, Wilson DJ, Sebra R, Kasarskis A, Anson LW, Giess A, Pankhurst LJ, Vaughan A, Grim CJ, Cox HL, Yeh AJ, Sifri CD, Walker AS, Peto TE, Crook DW, Mathers AJ, the Modernising Medical Microbiology (MMM) Informatics Group . 2016. Nested Russian doll-like genetic mobility drives rapid dissemination of the carbapenem resistance gene bla KPC . Antimicrob Agents Chemother 60:3767–3778. doi: 10.1128/AAC.00464-16 - DOI - PMC - PubMed
-
- Sereika M, Kirkegaard RH, Karst SM, Michaelsen TY, Sørensen EA, Wollenberg RD, Albertsen M. 2022. Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing. Nat Methods 19:823–826. doi: 10.1038/s41592-022-01539-7 - DOI - PMC - PubMed
-
- Landman F, Jamin C, de Haan A, Witteveen S, Bos J, van der Heide HGJ, Schouls LM, Hendrickx APA, Dutch CPE/MRSA surveillance study group . 2024. Genomic surveillance of multidrug-resistant organisms based on long-read sequencing. Genome Med 16:137. doi: 10.1186/s13073-024-01412-6 - DOI - PMC - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous
