Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug 17:17:647.
doi: 10.1186/s12864-016-2991-9.

Gene content dissimilarity for subclassification of highly similar microbial strains

Affiliations

Gene content dissimilarity for subclassification of highly similar microbial strains

Qichao Tu et al. BMC Genomics. .

Abstract

Background: Identification and classification of highly similar microbial strains is a challenging issue in microbiology, ecology and evolutionary biology. Among various available approaches, gene content analysis is also at the core of microbial taxonomy. However, no threshold has been determined for grouping microorgnisms to different taxonomic levels, and it is still not clear that to what extent genomic fluidity should occur to form a microbial taxonomic group.

Results: By taking advantage of the eggNOG database for orthologous groups, we calculated gene content dissimilarity among different microbial strains based on the orthologous gene profiles and tested the possibility of applying gene content dissimilarity as a quantitative index in classifying microbial taxonomic groups, as well as its potential application in subclassification of highly similar microbial strains. Evaluation of gene content dissimilarity to completed microbial genomes at different taxonomic levels suggested that cutoffs of 0.2 and 0.4 can be respectively used for species and family delineation, and that 0.2 gene content dissimilarity cutoff approximately corresponded to 98 % 16S rRNA gene identity and 94 % ANI for microbial species delineation. Furthermore, application of gene content dissimilarity to highly similar microbial strains suggested it as an effective approach in classifying closely related microorganisms into subgroups.

Conclusions: This approach is especially useful in identifying pathogens from commensals in clinical microbiology. It also provides novel insights into how genomic fluidity is linked with microbial taxonomy.

Keywords: Gene content dissimilarity; Genomic fluidity; Highly similar strains; Microbial subclassification.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The flowchart of applying gene content dissimilarity for microbial delineation and classification. Three main steps were included. First, orthologous gene profiles were obtained for all selected microbial genomes by searching against the eggNOG database. Second, pairwise gene content dissimilarity as measured by Bray-Curtis dissimilarity was calculated for all pairs of microbial strains. Third, microbial strains were clustered into different groups
Fig. 2
Fig. 2
Distribution of gene content dissimilarity for the retrieved microbial genomes at different taxonomic levels, including species (a), genus (b), family (c), and order (d). Cutoffs of 0.2 and 0.4 were recommended for microbial species and family delineation, respectively
Fig. 3
Fig. 3
Comparison of 16S rRNA gene identity, ANI, and gene content dissimilarity in microbial species delineation. A cutoff of 0.2 corresponded to 98 % 16S rRNA gene identity and 94 % ANI in species delineation. A total of 5008 intra-species and 8642 intra-genus comparisons were plotted. Red dots falling in the Q1 quadrant were mostly several clostridium strains, for which misclassification may have occurred. Red dots represented intra-species comparisons, and blue dots indicated intra-genus comparisons
Fig. 4
Fig. 4
Application of gene content dissimilarity in classifying microbial strains belonging to Enterobacteriaceae. a PCoA clustering of all selected microbial strains belonging to Enterobacteriaceae. b PCoA clustering of highly similar microbial strains including E. coli and Shigella. A clear separation of Shigella and E. coli O157:H7 from other E. coli strains could be observed
Fig. 5
Fig. 5
Application of gene content dissimilarity in classifying Streptococcus strains. Clear separation of different species into different groups could be observed. Highly similar strains belonging to S. mitis, S. oralis, and S. pneumoniae were also well separated

Similar articles

Cited by

References

    1. Rodriguez-R LM, Konstantinidis KT. Bypassing cultivation to identify bacterial species. Microbe. 2014;9(3):111–8.
    1. Gevers D, Cohan FM, Lawrence JG, Spratt BG, Coenye T, Feil EJ, Stackebrandt E, de Peer YV, Vandamme P, Thompson FL, et al. Re-evaluating prokaryotic species. Nat Rev Micro. 2005;3(9):733–9. doi: 10.1038/nrmicro1236. - DOI - PubMed
    1. Achtman M, Wagner M. Microbial diversity and the genetic nature of microbial species. Nat Rev Microbiol. 2008;6(6):431–40. - PubMed
    1. Janda JM, Abbott SL. 16S rRNA gene sequencing for bacterial identification in the diagnostic laboratory: pluses, perils, and pitfalls. J Clin Microbiol. 2007;45(9):2761–4. doi: 10.1128/JCM.01228-07. - DOI - PMC - PubMed
    1. STACKEBRANDT E, GOEBEL BM. Taxonomic note: a place for DNA-DNA reassociation and 16S rRNA sequence analysis in the present species definition in bacteriology. Int J Syst Evol Microbiol. 1994;44(4):846–9. doi: 10.1099/00207713-44-4-846. - DOI

Publication types

Substances

LinkOut - more resources