Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep;8(9):mgen000849.
doi: 10.1099/mgen.0.000849.

Ribosomal MLST nucleotide identity (rMLST-NI), a rapid bacterial species identification method: application to Klebsiella and Raoultella genomic species validation

Affiliations

Ribosomal MLST nucleotide identity (rMLST-NI), a rapid bacterial species identification method: application to Klebsiella and Raoultella genomic species validation

James E Bray et al. Microb Genom. 2022 Sep.

Abstract

Bacterial genomics is making an increasing contribution to the fields of medicine and public health microbiology. Consequently, accurate species identification of bacterial genomes is an important task, particularly as the number of genomes stored in online databases increases rapidly and new species are frequently discovered. Existing database entries require regular re-evaluation to ensure that species annotations are consistent with the latest species definitions. We have developed an automated method for bacterial species identification that is an extension of ribosomal multilocus sequence typing (rMLST). The method calculates an 'rMLST nucleotide identity' (rMLST-NI) based on the nucleotides present in the protein-encoding ribosomal genes derived from bacterial genomes. rMLST-NI was used to validate the species annotations of 11839 publicly available Klebsiella and Raoultella genomes based on a comparison with a library of type strain genomes. rMLST-NI was compared with two whole-genome average nucleotide identity methods (OrthoANIu and FastANI) and the k-mer based Kleborate software. The results of the four methods agreed across a dataset of 11839 bacterial genomes and identified a small number of entries (n=89) with species annotations that required updating. The rMLST-NI method was 3.5 times faster than Kleborate, 4.5 times faster than FastANI and 1600 times faster than OrthoANIu. rMLST-NI represents a fast and generic method for species identification using type strains as a reference.

Keywords: Klebsiella; Raoultella; bacterial genomes; bacterial species identification; nucleotide identity; rMLST.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflicts of interest.

Figures

Fig. 1.
Fig. 1.
Flowchart describing the stages required to scan a query genome against a library of rMLST alleles and calculate the rMLST nucleotide identity (rMLST-NI) and profile coverage values. These stages are implemented in the MLSS.pl program (Multilocus Sequence Search).
Fig. 2.
Fig. 2.
The rMLST nucleotide identity (rMLST-NI) for a query genome against each profile is calculated by extracting blast match information from each genome/allelic pairwise comparison (number of identical bases matched and the number of aligned bases). The rMLST-NI is defined as the total number of identical bases across all alleles in the profile divided by the sum of all aligned profile allele bases (expressed as a percentage). The profile coverage is similarly defined using the total number of aligned bases across all allelic matches. [··] indicates profile alleles and match information that have been removed. rMLST profiles typically contain allele indices for 53 loci. Allele indices and lengths shown are for illustrative purposes and do not represent the actual values.
Fig. 3.
Fig. 3.
Summary of 89 inconsistent species annotations where the NCBI Assembly database has a different species than the consensus result obtained from the four automated approaches used in this study (OrthoANIu, FastANI, rMLST-NI and Kleborate). The left-hand side represents the original species annotations from NCBI Assembly and the right-hand side represents the predicted species.
Fig. 4.
Fig. 4.
Phylogenetic tree of 30 Klebsiella/Raoultella isolates including 17 type strains and 13 non-type strain isolates where the NCBI Assembly species annotation was not consistent with the automated species identification scan results. Each isolate is annotated with (1) database identifier, (2) isolate name, (3) species (T indicates type strain), (4) species in source database (shown in square brackets if different). This tree was constructed in mega X (www.megasoftware.net) using the maximum-likelihood method and Tamura–Nei model (1000 replicates) from a multiple sequence alignment of the concatenated alleles of 51 non-paralogous rMLST loci generated by BIGSdb.
Fig. 5.
Fig. 5.
rMLST nucleotide identity webserver results for scanning UCI 27, (NCBI Assembly entry GCA_000534255.1, Klebsiella aerogenes ). The top five species are displayed in the results table and are colour coded according to the traffic light system of thresholds observed from the analysis of scan results from the 11750 dataset.

Similar articles

Cited by

References

    1. Richter M, Rosselló-Móra R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A. 2009;106:19126–19131. doi: 10.1073/pnas.0906412106. - DOI - PMC - PubMed
    1. Chun J, Rainey FA. Integrating genomics into the taxonomy and systematics of the Bacteria and Archaea. Int J Syst Evol Microbiol. 2014;64:316–324. doi: 10.1099/ijs.0.054171-0. - DOI - PubMed
    1. Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018;9:5114. doi: 10.1038/s41467-018-07641-9. - DOI - PMC - PubMed
    1. Chun J, Oren A, Ventosa A, Christensen H, Arahal DR, et al. Proposed minimal standards for the use of genome data for the taxonomy of prokaryotes. Int J Syst Evol Microbiol. 2018;68:461–466. doi: 10.1099/ijsem.0.002516. - DOI - PubMed
    1. Mende DR, Sunagawa S, Zeller G, Bork P. Accurate and universal delineation of prokaryotic species. Nat Methods. 2013;10:881–884. doi: 10.1038/nmeth.2575. - DOI - PubMed

Publication types

LinkOut - more resources