An approach to predict transcription factor DNA binding site specificity based upon gene and transcription factor functional categorization
- PMID: 17623704
- DOI: 10.1093/bioinformatics/btm348
An approach to predict transcription factor DNA binding site specificity based upon gene and transcription factor functional categorization
Abstract
Motivation: To understand transcription regulatory mechanisms, it is indispensable to investigate transcription factor (TF) DNA binding preferences. We noted that the generally acknowledged information of functional annotations of TFs as well as that of their target genes should provide useful hints in determining TF DNA binding preferences.
Results: In this contribution, we developed an integrative method based on the Nearest Neighbor Algorithm, to predict DNA binding preferences through integrating both the functional/structural information of TFs and the interaction between TFs and their targets. The accuracy of cross-validation tests on the dataset consisting of 3430 positive samples and 7000 negative samples reaches 87.0% for 10-fold cross-validation and 87.9% for jackknife cross-validation test, which is a much better result than that in our previous work. The prediction result indicates that the improved method we developed could be a powerful approach to infer the TF DNA preference in silico.
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Informative priors based on transcription factor structural class improve de novo motif discovery.Bioinformatics. 2006 Jul 15;22(14):e384-92. doi: 10.1093/bioinformatics/btl251. Bioinformatics. 2006. PMID: 16873497
-
Sequence features of DNA binding sites reveal structural class of associated transcription factor.Bioinformatics. 2006 Jan 15;22(2):157-63. doi: 10.1093/bioinformatics/bti731. Epub 2005 Nov 2. Bioinformatics. 2006. PMID: 16267080
-
Position dependencies in transcription factor binding sites.Bioinformatics. 2007 Apr 15;23(8):933-41. doi: 10.1093/bioinformatics/btm055. Epub 2007 Feb 18. Bioinformatics. 2007. PMID: 17308339
-
Eukaryotic transcription factor binding sites--modeling and integrative search methods.Bioinformatics. 2008 Jun 1;24(11):1325-31. doi: 10.1093/bioinformatics/btn198. Epub 2008 Apr 21. Bioinformatics. 2008. PMID: 18426806 Review.
-
Structure-based ab initio prediction of transcription factor-binding sites.Methods Mol Biol. 2009;541:23-41. doi: 10.1007/978-1-59745-243-4_2. Methods Mol Biol. 2009. PMID: 19381536 Review.
Cited by
-
Predicting the binding preference of transcription factors to individual DNA k-mers.Bioinformatics. 2009 Apr 15;25(8):1012-8. doi: 10.1093/bioinformatics/btn645. Epub 2008 Dec 16. Bioinformatics. 2009. PMID: 19088121 Free PMC article.
-
A novel method for improved accuracy of transcription factor binding site prediction.Nucleic Acids Res. 2018 Jul 6;46(12):e72. doi: 10.1093/nar/gky237. Nucleic Acids Res. 2018. PMID: 29617876 Free PMC article.
-
Prediction of nucleosome positioning based on transcription factor binding sites.PLoS One. 2010 Sep 1;5(9):e12495. doi: 10.1371/journal.pone.0012495. PLoS One. 2010. PMID: 20824131 Free PMC article.
-
Prediction of compounds' biological function (metabolic pathways) based on functional group composition.Mol Divers. 2008 May;12(2):131-7. doi: 10.1007/s11030-008-9085-9. Epub 2008 Aug 14. Mol Divers. 2008. PMID: 18704735
-
An overview of the prediction of protein DNA-binding sites.Int J Mol Sci. 2015 Mar 6;16(3):5194-215. doi: 10.3390/ijms16035194. Int J Mol Sci. 2015. PMID: 25756377 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous