Position specific variation in the rate of evolution in transcription factor binding sites
- PMID: 12946282
- PMCID: PMC212491
- DOI: 10.1186/1471-2148-3-19
Position specific variation in the rate of evolution in transcription factor binding sites
Abstract
Background: The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution.
Results: Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms.
Conclusion: As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA.
Figures





Similar articles
-
Integrating genomic data to predict transcription factor binding.Genome Inform. 2005;16(1):83-94. Genome Inform. 2005. PMID: 16362910
-
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9. PLoS Comput Biol. 2005. PMID: 16477324 Free PMC article.
-
Diversification at transcription factor binding sites within a species and the implications for environmental adaptation.Mol Biol Evol. 2011 Dec;28(12):3331-44. doi: 10.1093/molbev/msr167. Epub 2011 Jun 20. Mol Biol Evol. 2011. PMID: 21693437
-
Computational prediction of transcription-factor binding site locations.Genome Biol. 2003;5(1):201. doi: 10.1186/gb-2003-5-1-201. Epub 2003 Dec 23. Genome Biol. 2003. PMID: 14709165 Free PMC article. Review.
-
Regulatory variation within and between species.Annu Rev Genomics Hum Genet. 2011;12:327-46. doi: 10.1146/annurev-genom-082908-150139. Annu Rev Genomics Hum Genet. 2011. PMID: 21721942 Review.
Cited by
-
Incorporating evolution of transcription factor binding sites into annotated alignments.J Biosci. 2007 Aug;32(5):841-50. doi: 10.1007/s12038-007-0084-2. J Biosci. 2007. PMID: 17914226
-
Hypervariable noncoding sequences in Saccharomyces cerevisiae.Genetics. 2005 Aug;170(4):1575-87. doi: 10.1534/genetics.105.042283. Epub 2005 Jun 14. Genetics. 2005. PMID: 15956668 Free PMC article.
-
Abundance, arrangement, and function of sequence motifs in the chicken promoters.BMC Genomics. 2014 Oct 15;15(1):900. doi: 10.1186/1471-2164-15-900. BMC Genomics. 2014. PMID: 25318583 Free PMC article.
-
Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm.PLoS Biol. 2008 Feb;6(2):e27. doi: 10.1371/journal.pbio.0060027. PLoS Biol. 2008. PMID: 18271625 Free PMC article.
-
The population genetics of cooperative gene regulation.BMC Evol Biol. 2012 Sep 6;12:173. doi: 10.1186/1471-2148-12-173. BMC Evol Biol. 2012. PMID: 22954408 Free PMC article.
References
-
- Lawrence CE, Altschul SF, Boguski MS, Liu JS, Neuwald AF, Wootton JC. Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science. 1993;262:208–214. - PubMed
-
- Bailey TL, Elkan C. Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology. AAAI Press, Menlo Park, California; 1994. Fitting a mixture model by expectation maximization to discover motifs in biopolymers; pp. 28–36. - PubMed
-
- Eskin E, Pevzner PA. Finding composite regulatory patterns in DNA sequences. Bioinformatics. 2002;18:S354–363. - PubMed
-
- Liu XS, Brutlag DL, Liu JS. An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments. Nat Biotechnol. 2002;20:835–839. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials