Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 7;10(7):e0132618.
doi: 10.1371/journal.pone.0132618. eCollection 2015.

GntR Family of Bacterial Transcription Factors and Their DNA Binding Motifs: Structure, Positioning and Co-Evolution

Affiliations

GntR Family of Bacterial Transcription Factors and Their DNA Binding Motifs: Structure, Positioning and Co-Evolution

Inna A Suvorova et al. PLoS One. .

Abstract

The GNTR family of transcription factors (TFs) is a large group of proteins present in diverse bacteria and regulating various biological processes. Here we use the comparative genomics approach to reconstruct regulons and identify binding motifs of regulators from three subfamilies of the GNTR family, FADR, HUTC, and YTRA. Using these data, we attempt to predict DNA-protein contacts by analyzing correlations between binding motifs in DNA and amino acid sequences of TFs. We identify pairs of positions with high correlation between amino acids and nucleotides for FADR, HUTC, and YTRA subfamilies and show that the most predicted DNA-protein interactions are quite similar in all subfamilies and conform well to the experimentally identified contacts formed by FadR from E. coli and AraR from B. subtilis. The most frequent predicted contacts in the analyzed subfamilies are Arg-G, Asn-A, Asp-C. We also analyze the divergon structure and preferred site positions relative to regulated genes in the FADR and HUTC subfamilies. A single site in a divergon usually regulates both operons and is approximately in the middle of the intergenic area. Double sites are either involved in the co-operative regulation of both operons and then are in the center of the intergenic area, or each site in the pair independently regulates its own operon and tends to be near it. We also identify additional candidate TF-binding boxes near palindromic binding sites of TFs from the FADR, HUTC, and YTRA subfamilies, which may play role in the binding of additional TF-subunits.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Heat map of correlations between amino acids and nucleotides for FadR-subfamily TFs and their binding sites.
Sequence logos of HTH DNA-binding domains and corresponding binding sites are shown on the top and to the left of the heat map, respectively. The total height of the symbols in each position equals the positional information content, whereas the height of individual symbols is proportional to the positional amino acid or nucleotide frequency. The correlation scores are color ramped from yellow to red for amino acid-nucleotide pairs with statistical significance greater than an automatically defined threshold (with red assigned for the most correlated pair). The violet-black palette is used for other pairs.
Fig 2
Fig 2. Heat map of correlations between amino acids and nucleotides for HutC-subfamily TFs and their binding sites.
Notation as in Fig 1.
Fig 3
Fig 3. Heat map of correlations between amino acids and nucleotides for YtrA-subfamily TFs and their binding sites.
Notation as in Fig 1.
Fig 4
Fig 4. Distances between regulated genes and TF-binding sites in divergons with single sites.
A—operons with a TF gene; B—operons with structural genes only; C—the control group (includes divergons without TF genes). The vertical axis is the distance between the site center and the start codon. The horizontal axis is the intergenic distance. Each dot corresponds to one site. The regression lines are shown. Blue color denotes the FadR subfamily; red color, the HutC subfamily.
Fig 5
Fig 5. Distances between double sites in divergons.
A—divergons with a TF gene; B—the control group. The vertical axis is the inter-site distance. The horizontal axis is the intergenic distance. Notation as in Fig 4.
Fig 6
Fig 6. Two groups of divergons with double sites.
A—the first group (constant inter-site distance); B—the second group (increasing inter-site distance). For details, see the text. The vertical axis is the inter-site distance. The horizontal axis is the intergenic distance. Notation as in Fig 4.
Fig 7
Fig 7. Distances between regulated genes and proximal TF-binding sites in divergons with common double sites.
A—operons with a TF gene; B—operons with structural genes only. The vertical axis is the distance between the site center and the start codon. The horizontal axis is the intergenic distance. Notation as in Fig 4.
Fig 8
Fig 8. Distances between regulated genes and proximal TF-binding sites in divergons with separate double sites.
A—operons with a TF gene; B—operons with structural genes only; C—the control group. The vertical axis is the distance between the site center and the start codon. The horizontal axis is the intergenic distance. Notation as in Fig 4.
Fig 9
Fig 9. Positioning of additional boxes and control pseudoboxes.
True binding half-sites are shown in blue; additional boxes, in violet; pseudoboxes, in red arrows. For details, see the text.
Fig 10
Fig 10. Distribution of Snear and Srandom in the FadR, HutC and YtrA subfamilies.
The vertical axis—the number of S values falling in the given interval. The horizontal axis—intervals of S values. Blue color denotes Snear values; red color—Srandom values. FadR subfamily data is shown in continuous lines; HutC subfamily, in dotted lines; YtrA subfamily, in dashed lines.

Similar articles

Cited by

References

    1. Ofran Y, Mysore V, Rost B. Prediction of DNA-binding residues from sequence. Bioinformatics. 2007. July 1; 23(13):i347–53. - PubMed
    1. Rodionov DA. Comparative genomic reconstruction of transcriptional regulatory networks in bacteria. Chem Rev. 2007. August;107(8):3467–97. - PMC - PubMed
    1. Pérez-Rueda E, Collado-Vides J. The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12. Nucleic Acids Res. 2000. April 15;28(8):1838–47. - PMC - PubMed
    1. Lustig B, Jernigan RL. Consistencies of individual DNA base amino acid interactions in structures and sequences. Nucleic Acids Res. 1995. November 25;23(22):4707–11. - PMC - PubMed
    1. Morozov AV, Siggia ED. Connecting protein structure with predictions of regulatory sites. Proc Natl Acad Sci U S A. 2007. April 24;104(17):7068–73. - PMC - PubMed

Publication types

MeSH terms