Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Sep;3(9):1614-21.
doi: 10.1371/journal.pgen.0030153.

Identification of DNA motifs implicated in maintenance of bacterial core genomes by predictive modeling

Affiliations

Identification of DNA motifs implicated in maintenance of bacterial core genomes by predictive modeling

David Halpern et al. PLoS Genet. 2007 Sep.

Abstract

Bacterial biodiversity at the species level, in terms of gene acquisition or loss, is so immense that it raises the question of how essential chromosomal regions are spared from uncontrolled rearrangements. Protection of the genome likely depends on specific DNA motifs that impose limits on the regions that undergo recombination. Although most such motifs remain unidentified, they are theoretically predictable based on their genomic distribution properties. We examined the distribution of the "crossover hotspot instigator," or Chi, in Escherichia coli, and found that its exceptional distribution is restricted to the core genome common to three strains. We then formulated a set of criteria that were incorporated in a statistical model to search core genomes for motifs potentially involved in genome stability in other species. Our strategy led us to identify and biologically validate two distinct heptamers that possess Chi properties, one in Staphylococcus aureus, and the other in several streptococci. This strategy paves the way for wide-scale discovery of other important functional noncoding motifs that distinguish core genomes from the strain-variable regions.

PubMed Disclaimer

Conflict of interest statement

Competing interests. The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The E. coli Chi site, 5′-GCTGGTGG-3′, Is Significantly Overrepresented and Skewed on E. coli Backbone DNA
Statistical scores of overrepresentation and skew are plotted for all octamers whose overrepresentation scores are higher than 5 (corresponding to a p-value < 10−7). Chi and one of the motifs of the KOPS family [6] are indicated.
Figure 2
Figure 2. Motif C, 5′-GAAGCGG-3′, Is the S. aureus Chi Site
(A) Overrepresentation and skew of heptamers on the S. aureus backbone. Plots are represented as in Figure 1. Only heptamers with an overrepresentation score higher than 5 (corresponding to a p-value < 10−7) are shown. The four best candidates (motifs A to D) are indicated. (B) Experimental validation that motif C has Chi activity in S. aureus. Motif C was cloned in plasmid pRC and evaluated for HMW accumulation, an indicator of Chi activity [27]. Total genomic DNA was hybridized with a probe specific to the RC plasmid. Plasmid monomer (mn), dimer (dm), and open circle (oc) forms, and HMW are indicated by arrows. Motif C (lane 2, 5′-GAAGCGG-3′) is HMW-positive, but it gives a negative signal when cloned in the reverse orientation (lane 3) or when it carries a mismatch (lanes 4–6).
Figure 3
Figure 3. Chi Sequence Conservation through Evolution
The tree shows phylogenetic relationships of the eight bacterial species in which Chi is known, based on 16S RNA, and was constructed using the tree-builder command of the Ribosomal Database Project (https://rdp.cme.msu.edu/). E. coli, H. influenzae, L. lactis, and B. subtilis Chi sites were previously identified [5,25,26]. Chi sites of S. aureus and the three streptococci were identified by predictive modeling and confirmed experimentally (this work). Partial conservation of Chi correlates with phylogenetic conservation among bacteria.

Similar articles

Cited by

References

    1. Hayashi T, Makino K, Ohnishi M, Kurokawa K, Ishii K, et al. Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12. DNA Res. 2001;8:11–22. - PubMed
    1. Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R. The microbial pan-genome. Curr Opin Genet Dev. 2005;15:589–594. - PubMed
    1. Smith HO, Gwinn ML, Salzberg SL. DNA uptake signal sequences in naturally transformable bacteria. Res Microbiol. 1999;150:603–616. - PubMed
    1. Lam ST, Stahl MM, McMilin KD, Stahl FW. Rec-mediated recombinational hot spot activity in bacteriophage lambda. II. A mutation which causes hot spot activity. Genetics. 1974;77:425–433. - PMC - PubMed
    1. Smith GR, Kunes SM, Schultz DW, Taylor A, Triman KL. Structure of chi hotspots of generalized recombination. Cell. 1981;24:429–436. - PubMed

Publication types