Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jun;39(11):4553-63.
doi: 10.1093/nar/gkr070. Epub 2011 Feb 18.

Using a structural and logics systems approach to infer bHLH-DNA binding specificity determinants

Affiliations

Using a structural and logics systems approach to infer bHLH-DNA binding specificity determinants

Federico De Masi et al. Nucleic Acids Res. 2011 Jun.

Abstract

Numerous efforts are underway to determine gene regulatory networks that describe physical relationships between transcription factors (TFs) and their target DNA sequences. Members of paralogous TF families typically recognize similar DNA sequences. Knowledge of the molecular determinants of protein-DNA recognition by paralogous TFs is of central importance for understanding how small differences in DNA specificities can dictate target gene selection. Previously, we determined the in vitro DNA binding specificities of 19 Caenorhabditis elegans basic helix-loop-helix (bHLH) dimers using protein binding microarrays. These TFs bind E-box (CANNTG) and E-box-like sequences. Here, we combine these data with logics, bHLH-DNA co-crystal structures and computational modeling to infer which bHLH monomer can interact with which CAN E-box half-site and we identify a critical residue in the protein that dictates this specificity. Validation experiments using mutant bHLH proteins provide support for our inferences. Our study provides insights into the mechanisms of DNA recognition by bHLH dimers as well as a blueprint for system-level studies of the DNA binding determinants of other TF families in different model organisms and humans.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Deduction of interactions between bHLH monomers and CAN E-box half-sites. HLH-2/HLH-3 heterodimers are shown as an example.
Figure 2.
Figure 2.
Analysis of bHLH residue/DNA-contact frequency. (a) Canonical structure of a bHLH monomer contacting the CANNTG E-box. The E-box nucleotides on each DNA strand are indicated in blue, the bHLH basic region is shown in green and bHLH helices 1 and 2 are indicated in red and cyan, respectively. Residues 1, 15 and 50 indicate the start of the basic region, helices 1 and 2, respectively. (b) Plot indicating the frequency of contacts between bHLH residues and the bases in or flanking the E-box based on seven co-crystal structures. (c) As in (b) but indicating contacts between bHLH residues and the DNA backbone. P, phosphate; dR, deoxyribose. bHLH residue consensus numbering scheme based on Atchley et al. (33).
Figure 3.
Figure 3.
bHLH domain amino acids correlate with E-box half-site recognition (a) Amino acid residues at each position in C. elegans bHLH proteins that are involved in DNA contacts. The final column shows the list of recognized half-sites for each bHLH (7). ‘A’ and ‘B’ (e.g. HLH-26A and HLH-26B) refer to the first and second bHLH domain of two within the same protein. (b) Diagram illustrating the distribution of bHLH proteins according to their half-site specification. (c) Schematic representation of the rules inferred from (a) and illustrated in (b). Filled circles represent residues at position 13; purple rectangles represent half-sites and open circles represent specific sequence determinants for linking residue 13 to a particular half-site. (d) Graphical representation of the average ΔΔGint per mutation for the seven bHLH structures analyzed. Mutations for which ΔΔGint is larger than 1.0 kcal/mol (indicated by the red line) may have considerably diminished, or even abolished, the binding capabilities of the mutant. Position 13 is split between the average ΔΔGint for the scaffold of bHLH proteins that normally do or do not have an arginine in that position.
Figure 4.
Figure 4.
Prediction and experimental validation of HLH-1 Leu13 mutant E-box half-site specificities. (a) Classification of the three HLH-1 mutant proteins based on the E-box half-site specificity diagram in Figure 3b. (b) Predictions of the half-site specificities for each of the HLH-1 Leu13 mutants. (c) Summary of the statistical analysis of the observed half-site specificities determined from PBM data for each of the HLH-1 mutants. (d) Boxplot representation of the PBM-derived E-box specificities of HLH-1 wild-type and mutant proteins. Significantly bound E-boxes are colored in black (see ‘Materials and Methods’ section). Eight-mers that do not contain an E-box are marked as ‘other’. For each box, the central horizontal bar shows the median of the distribution, the box’s edges mark the 25th and 75th percentile and the whiskers represent the most extreme points of the distribution, which were not determined as being outliers. The horizontal bar of ES value of 0.4 shows our significance ES threshold, as previously determined (7).

References

    1. Walhout AJM. Unraveling transcription regulatory networks by protein-DNA and protein-protein interaction mapping. Genome Res. 2006;16:1445–1454. - PubMed
    1. Grove CA, Walhout AJM. Transcription factor functionality and transcription regulatory networks. Mol. Biosyst. 2008;4:309–314. - PMC - PubMed
    1. Reece-Hoyes JS, Deplancke B, Shingles J, Grove CA, Hope IA, Walhout AJM. A compendium of C. elegans regulatory transcription factors: a resource for mapping transcription regulatory networks. Genome Biol. 2005;6:R110. - PMC - PubMed
    1. Kummerfeld SK, Teichmann SA. DBD: a transcription factor prediction database. Nucleic Acids Res. 2006;34:D74–D81. - PMC - PubMed
    1. Badis G, Berger MF, Philippakis AA, Talukder S, Gehrke AR, Jaeger SA, Chan ET, Metzler G, Vedenko A, Chen X, et al. Diversity and complexity in DNA recognition by transcription factors. Science. 2009;324:1720–1723. - PMC - PubMed

Publication types

MeSH terms

Substances