Position dependencies in transcription factor binding sites
- PMID: 17308339
- DOI: 10.1093/bioinformatics/btm055
Position dependencies in transcription factor binding sites
Abstract
Motivation: Most of the available tools for transcription factor binding site prediction are based on methods which assume no sequence dependence between the binding site base positions. Our primary objective was to investigate the statistical basis for either a claim of dependence or independence, to determine whether such a claim is generally true, and to use the resulting data to develop improved scoring functions for binding-site prediction.
Results: Using three statistical tests, we analyzed the number of binding sites showing dependent positions. We analyzed transcription factor-DNA crystal structures for evidence of position dependence. Our final conclusions were that some factors show evidence of dependencies whereas others do not. We observed that the conformational energy (Z-score) of the transcription factor-DNA complexes was lower (better) for sequences that showed dependency than for those that did not (P < 0.02). We suggest that where evidence exists for dependencies, these should be modeled to improve binding-site predictions. However, when no significant dependency is found, this correction should be omitted. This may be done by converting any existing scoring function which assumes independence into a form which includes a dependency correction. We present an example of such an algorithm and its implementation as a web tool.
Availability: http://promoterplot.fmi.ch/cgi-bin/dep.html
Similar articles
-
Informative priors based on transcription factor structural class improve de novo motif discovery.Bioinformatics. 2006 Jul 15;22(14):e384-92. doi: 10.1093/bioinformatics/btl251. Bioinformatics. 2006. PMID: 16873497
-
Context-specific independence mixture modeling for positional weight matrices.Bioinformatics. 2006 Jul 15;22(14):e166-73. doi: 10.1093/bioinformatics/btl249. Bioinformatics. 2006. PMID: 16873468
-
ClusterDraw web server: a tool to identify and visualize clusters of binding motifs for transcription factors.Bioinformatics. 2007 Apr 15;23(8):1032-4. doi: 10.1093/bioinformatics/btm047. Epub 2007 Feb 18. Bioinformatics. 2007. PMID: 17308342
-
Eukaryotic transcription factor binding sites--modeling and integrative search methods.Bioinformatics. 2008 Jun 1;24(11):1325-31. doi: 10.1093/bioinformatics/btn198. Epub 2008 Apr 21. Bioinformatics. 2008. PMID: 18426806 Review.
-
Structure-based ab initio prediction of transcription factor-binding sites.Methods Mol Biol. 2009;541:23-41. doi: 10.1007/978-1-59745-243-4_2. Methods Mol Biol. 2009. PMID: 19381536 Review.
Cited by
-
PhysBinder: Improving the prediction of transcription factor binding sites by flexible inclusion of biophysical properties.Nucleic Acids Res. 2013 Jul;41(Web Server issue):W531-4. doi: 10.1093/nar/gkt288. Epub 2013 Apr 24. Nucleic Acids Res. 2013. PMID: 23620286 Free PMC article.
-
Nonparametric Bayes Modeling of Multivariate Categorical Data.J Am Stat Assoc. 2012 Jan 1;104(487):1042-1051. doi: 10.1198/jasa.2009.tm08439. J Am Stat Assoc. 2012. PMID: 23606777 Free PMC article.
-
CisMiner: genome-wide in-silico cis-regulatory module prediction by fuzzy itemset mining.PLoS One. 2014 Sep 30;9(9):e108065. doi: 10.1371/journal.pone.0108065. eCollection 2014. PLoS One. 2014. PMID: 25268582 Free PMC article.
-
Varying levels of complexity in transcription factor binding motifs.Nucleic Acids Res. 2015 Oct 15;43(18):e119. doi: 10.1093/nar/gkv577. Epub 2015 Jun 26. Nucleic Acids Res. 2015. PMID: 26116565 Free PMC article.
-
Position Weight Matrix or Acyclic Probabilistic Finite Automaton: Which model to use? A decision rule inferred for the prediction of transcription factor binding sites.Genet Mol Biol. 2024 Jan 19;46(4):e20230048. doi: 10.1590/1678-4685-GMB-2023-0048. eCollection 2024. Genet Mol Biol. 2024. PMID: 38285430 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources