Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jan 14;5(1):e8700.
doi: 10.1371/journal.pone.0008700.

Derivation of Escherichia coli O157:H7 from its O55:H7 precursor

Affiliations

Derivation of Escherichia coli O157:H7 from its O55:H7 precursor

Zhemin Zhou et al. PLoS One. .

Abstract

There are 29 E. coli genome sequences available, mostly related to studies of species diversity or mode of pathogenicity, including two genomes of the well-known O157:H7 clone. However, there have been no genome studies of closely related clones aimed at exposing the details of evolutionary change. Here we sequenced the genome of an O55:H7 strain, closely related to the major pathogenic O157:H7 clone, with published genome sequences, and undertook comparative genomic and proteomic analysis. We were able to allocate most differences between the genomes to individual mutations, recombination events, or lateral gene transfer events, in specific lineages. Major differences include a type II secretion system present only in the O55:H7 chromosome, fewer type III secretion system effectors in O55:H7, and 19 phage genomes or phagelike elements in O55:H7 compared to 23 in O157:H7, with only three common to both. Many other changes were found in both O55:H7 and O157:H7 lineages, but in general there has been more change in the O157:H7 lineages. For example, we found 50% more synonymous mutational substitutions in O157:H7 compared to O55:H7. The two strains also diverged at the proteomic level. Mutational synonymous SNPs were used to estimate a divergence time of 400 years using a new clock rate, in contrast to 14,000 to 70,000 years using the traditional clock rates. The same approaches were applied to three closely related extraintestinal pathogenic E. coli genomes, and similar levels of mutation and recombination were found. This study revealed for the first time the full range of events involved in the evolution of the O157:H7 clone from its O55:H7 ancestor, and suggested that O157:H7 arose quite recently. Our findings also suggest that E. coli has a much lower frequency of recombination relative to mutation than was observed in a comparable study of a Vibrio cholerae lineage.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Alignment of the genomes of CB9615, Sakai and EDL933.
Scales are Mbp. The grey and yellow shaded regions represent segments present in all strains, with inverted segments shaded in yellow. Purple and cyan boxes within the grey or yellow regions represent the indels of phages or phage-like elements (insertions and deletions respectively) defined in Table S9. Red and green boxes represent other major indels (insertions and deletions respectively) that involve the changes of the gene numbers in Table S10, with the indel numbers also shown as defined in Table S9. The orange boxes indicate the E. coli O157:H7 O-antigen segment gained by recombination. Figure S1 is a greatly expanded version of this figure showing individual genes.
Figure 2
Figure 2. Tree showing the relationships of CB9615 and 2 O157:H7 strains.
The tree topography is taken from the alignment of 26 completed genomes (Figure 3) and Whittam . For each lineage the number of mutations (including small indels), recombination events and insertion or deletion events (large indels) are shown in a grid, as specified with the key. Mutations are shown as intergenic, other non-coding, non-synonymous or synonymous SNPs (igSNPs, ncSNPs, nsSNP, sSNPs), small insertions and small deletions or indels if not differentiable. Large indels are separated into insertions or deletions where possible. Events allocated to the divergence between CB9615 and O157:H7, or between Sakai and EDL933, respectively, but not to either lineage, are shown in the grids between the two lineages. The branch point estimates for group B including strain 493-89), G5101 and F6141, and clusters 1 (Strains 14359 and 87-14) and 2 (86-24) are marked with dotted lines on the O157:H7 lineage, and TB182A on the O55:H7 lineage. The distribution of SNPs along that lineage is based on reanalysis of data from Zhang et al. and Leopold et al. .
Figure 3
Figure 3. Maximum likelihood phylogenetic tree of 26 Escherichia coli and Shigella strains.
The phylogenetic tree of the Escherichia core genome genes was constructed from the concatenated alignments of the 2034 genes in the core genome of the E. coli/Shigella genomes. The closely related species, E. fergusonii (CU928158), was chosen to root the tree.
Figure 4
Figure 4. Tree showing the relationships of UTI89, S88 and APEC 01 ExPEC strains.
The tree topography is for mutational SNPs as allocated by the virtual outgroup analysis (Tables S7 and S8). For each lineage the number of mutations and recombination events is shown in grids as for Figure 1.

Similar articles

Cited by

References

    1. Gordon DM, FitzGibbon F. The distribution of enteric bacteria from Australian mammals: host and geographical effects. Microbiology. 1999;145 ( Pt 10):2663–2671. - PubMed
    1. Souza V, Rocha M, Valera A, Eguiarte LE. Genetic structure of natural populations of Escherichia coli in wild hosts on different continents. Appl Environ Microbiol. 1999;65:3373–3385. - PMC - PubMed
    1. Kaper JB, Nataro JP, Mobley HL. Pathogenic Escherichia coli. Nat Rev Microbiol. 2004;2:123–140. - PubMed
    1. Rasko DA, Rosovitz MJ, Myers GS, Mongodin EF, Fricke WF, et al. The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates. J Bacteriol. 2008;190:6881–6893. - PMC - PubMed
    1. Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, et al. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet. 2009;5:e1000344. - PMC - PubMed

Publication types