Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May;21(5):373-84.
doi: 10.1089/cmb.2014.0032. Epub 2014 Apr 1.

Informational requirements for transcriptional regulation

Affiliations

Informational requirements for transcriptional regulation

Patrick K O'Neill et al. J Comput Biol. 2014 May.

Abstract

Transcription factors (TFs) regulate transcription by binding to specific sites in promoter regions. Information theory provides a useful mathematical framework to analyze the binding motifs associated with TFs but imposes several assumptions that limit their applicability to specific regulatory scenarios. Explicit simulations of the co-evolution of TFs and their binding motifs allow the study of the evolution of regulatory networks with a high degree of realism. In this work we analyze the impact of differential regulatory demands on the information content of TF-binding motifs by means of evolutionary simulations. We generalize a predictive index based on information theory, and we validate its applicability to regulatory scenarios in which the TF binds significantly to the genomic background. Our results show a logarithmic dependence of the evolved information content on the occupancy of target sites and indicate that TFs may actively exploit pseudo-sites to modulate their occupancy of target sites. In regulatory networks with differentially regulated targets, we observe that information content in TF-binding motifs is dictated primarily by the fraction of total probability mass that the TF assigns to its target sites, and we provide a predictive index to estimate the amount of information associated with arbitrarily complex regulatory systems. We observe that complex regulatory patterns can exert additional demands on evolved information content, but, given a total occupancy for target sites, we do not find conclusive evidence that this effect is because of the range of required binding affinities.

PubMed Disclaimer

Figures

<b>FIG. 1.</b>
FIG. 1.
Schematic representation of the evolutionary simulator of transcriptional regulatory motifs (ESTReMo). Diagram summarizes the internal organization of an individual organism and its connection with the fitness model. TF, transcription factor.
<b>FIG. 2.</b>
FIG. 2.
Trace indicators for a complete evolutionary simulation. All values correspond to the fittest organism in the population at any given generation. (A) Organism fitness. Sequence logos illustrate the step-wise evolution of the associated TF-binding motif as new sites are recognized. (B) Free energy of binding for each site in the motif. (C) Information content Rsequence (corrected and uncorrected for small-sample error) of the evolved TF-binding motif. The predicted Rfrequency for the system is superimposed. (D) Contributions to the partition function from the sampled background (blue) and the promoter regions (green).
<b>FIG. 3.</b>
FIG. 3.
Evolution of uniform regulatory systems. Plot shows evolved Rsequence values for simulations on randomly generated backgrounds (solid circles) and on the Escherichia coli genome (squares) as a function of the fraction of total probability mass (α) assigned to target sites. Error bars depict the 95% confidence interval for the mean. Corresponding values of Rfrequency, formula image, formula image, and R are shown.
<b>FIG. 4.</b>
FIG. 4.
Evolution of bimodal regulatory systems. The plot shows evolved Rsequence values for different evolutionary simulation scenarios with half of the sites set to higher occupancy (αhigh) than the other half (αlow). Error bars depict the 95% confidence interval for the mean. For each Rsequence value, the corresponding R values for αhigh, αlow, and αeff values are shown.
<b>FIG. 5.</b>
FIG. 5.
Evolution of stepwise regulatory ladders. The plot shows evolved Rsequence values for different evolutionary simulation scenarios with sites gradually increasing from low (αlow) to high (αhigh) occupancy. Error bars depict the 95% confidence interval for the mean. For each Rsequence value, the corresponding R values for αhigh, αlow, and αeff values are shown.
<b>FIG. 6.</b>
FIG. 6.
Evolution of randomized regulatory scenarios. The plot shows 198 evolved Rsequence values for different evolutionary simulation scenarios. The full dataset is shown instead of summary statistics because of the uniqueness of α values. For each Rsequence value, the corresponding R values for αhigh and αlow are shown.

References

    1. Alon U.2007. Network motifs: theory and experimental approaches. Nat. Rev. Genet. 8, 450–461 - PubMed
    1. Badis G., Berger M.F., Philippakis A.A., et al. . 2009. Diversity and complexity in DNA recognition by transcription factors. Science 324, 1720–1723 - PMC - PubMed
    1. Berg J., Willmann S., and Lässig M.2004. Adaptive evolution of transcription factor binding sites. BMC Evol. Biol. 4, 42. - PMC - PubMed
    1. Berg O.G., and von Hippel P.H.1987. Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. J. Mol. Biol. 193, 723–750 - PubMed
    1. Browning D.F., and Busby S.J.W.2004. The regulation of bacterial transcription initiation. Nat. Rev. Microbiol. 2, 57–65 - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources