Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 19:6:1573.
doi: 10.3389/fmicb.2015.01573. eCollection 2015.

Phylogenetic Analyses of Shigella and Enteroinvasive Escherichia coli for the Identification of Molecular Epidemiological Markers: Whole-Genome Comparative Analysis Does Not Support Distinct Genera Designation

Affiliations

Phylogenetic Analyses of Shigella and Enteroinvasive Escherichia coli for the Identification of Molecular Epidemiological Markers: Whole-Genome Comparative Analysis Does Not Support Distinct Genera Designation

Emily A Pettengill et al. Front Microbiol. .

Erratum in

Abstract

As a leading cause of bacterial dysentery, Shigella represents a significant threat to public health and food safety. Related, but often overlooked, enteroinvasive Escherichia coli (EIEC) can also cause dysentery. Current typing methods have limited ability to identify and differentiate between these pathogens despite the need for rapid and accurate identification of pathogens for clinical treatment and outbreak response. We present a comprehensive phylogeny of Shigella and EIEC using whole genome sequencing of 169 samples, constituting unparalleled strain diversity, and observe a lack of monophyly between Shigella and EIEC and among Shigella taxonomic groups. The evolutionary relationships in the phylogeny are supported by analyses of population structure and hierarchical clustering patterns of translated gene homolog abundance. Lastly, we identified a panel of 404 single nucleotide polymorphism (SNP) markers specific to each phylogenetic cluster for more accurate identification of Shigella and EIEC. Our findings show that Shigella and EIEC are not distinct evolutionary groups within the E. coli genus and, thus, EIEC as a group is not the ancestor to Shigella. The multiple analyses presented provide evidence for reconsidering the taxonomic placement of Shigella. The SNP markers offer more discriminatory power to molecular epidemiological typing methods involving these bacterial pathogens.

Keywords: Shigella; classification; enteroinvasive E. coli (EIEC); epidemiological markers; phylogeny; whole genome sequencing.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
A maximum-likelihood (ML) phylogeny of Shigella, enteroinvasive Escherichia coli (EIEC) and non-invasive E. coli strains based on 7,062 core SNPs using kSNP (Gardner and Hall, 2013). The ML tree was generated using GARLI v. 2.0.1019 under the GTR + I + Γ model and other default settings (Zwickl, 2006). Trees were visualized with Figtree v. 1.3.1 (Rambaut and Drummond, 2009). The best tree was chosen from 1,000 runs of the data set and bootstrap values (1,000 iterations) are reported above each node. Bootstrap values <80% are not shown. A tree that includes the Salmonella outgroup can be found in Supplementary Figure S1.
FIGURE 2
FIGURE 2
DISTRUCT diagrams showing clustering of Shigella, EIEC and non-enteroinvasive E. coli genomes derived from the STRUCTURE analyses with the core SNP matrix: (A) six genetic groups were determined to be the best fitting number of groups by STRUCTURE HARVESTER program and (B) the 11 groups identified from the phylogeny. Each bar represents a single genome and the color represents the proportion of SNPs that represent a cluster. Color assignment was random and does not coordinate with phylogenetic clusters in Figure 1.
FIGURE 3
FIGURE 3
Hierarchical clustering and heat map illustrating the differences in predicted protein homologs between genomes. Manhattan distances were calculated from a pairwise abundance matrix of 3,777 predicted protein homologs that were identified using the default BLASTP bidirectional best hit approach (75% amino acid sequence coverage, 1e-05 E-value and 60% sequence identity) within the program GET_HOMOLOGUES (Contreras-Moreira and Vinuesa, 2013). Only genes shared by at least two samples were included. Blue cells on the heat map indicate that genomes share more similar genes. The dendrogram on y-axis indicates hierarchical clustering of the abundance matrix using the average linkage method and Manhattan distances with bootstrap probabilities (BP, only values of ≥80 shown in black) and approximately unbiased p-values (AU, only values of ≥95 shown in red) from 10,000 replicates. The phylogenetic group of each genome from Figure 1 is represented as a colored bar in between the dendrogram and the heat map.

References

    1. Bankevich A., Nurk S., Antipov D., Gurevich A. A., Dvorkin M., Kulikov A. S., et al. (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19 455–477. 10.1089/cmb.2012.0021 - DOI - PMC - PubMed
    1. Binet R., Deer D. M., Uhlfelder S. J. (2014). Rapid detection of Shigella and enteroinvasive Escherichia coli in produce enrichments by a conventional multiplex PCR assay. Food Microbiol. 40 48–54. 10.1016/j.fm.2013.12.001 - DOI - PubMed
    1. Center for Food Safety and Applied Nutrition (2001). Laboratory Methods - Bacteriological Analytical Manual (BAM). Available at: http://www.fda.gov/Food/FoodScienceResearch/LaboratoryMethods/ucm2006949...
    1. Chaudhuri R. R., Henderson I. R. (2012). Infection, genetics and evolution. Infect. Genet. Evol. 12 214–226. 10.1016/j.meegid.2012.01.005 - DOI - PubMed
    1. Clements A., Young J. C., Constantinou N., Frankel G. (2012). Infection strategies of enteric pathogenic Escherichia coli. Gut Microbes 3 71–87. 10.4161/gmic.19182 - DOI - PMC - PubMed