VOMBAT: prediction of transcription factor binding sites using variable order Bayesian trees
- PMID: 16845064
- PMCID: PMC1538886
- DOI: 10.1093/nar/gkl212
VOMBAT: prediction of transcription factor binding sites using variable order Bayesian trees
Abstract
Variable order Markov models and variable order Bayesian trees have been proposed for the recognition of transcription factor binding sites, and it could be demonstrated that they outperform traditional models, such as position weight matrices, Markov models and Bayesian trees. We develop a web server for the recognition of DNA binding sites based on variable order Markov models and variable order Bayesian trees offering the following functionality: (i) given datasets with annotated binding sites and genomic background sequences, variable order Markov models and variable order Bayesian trees can be trained; (ii) given a set of trained models, putative DNA binding sites can be predicted in a given set of genomic sequences and (iii) given a dataset with annotated binding sites and a dataset with genomic background sequences, cross-validation experiments for different model combinations with different parameter settings can be performed. Several of the offered services are computationally demanding, such as genome-wide predictions of DNA binding sites in mammalian genomes or sets of 10(4)-fold cross-validation experiments for different model combinations based on problem-specific data sets. In order to execute these jobs, and in order to serve multiple users at the same time, the web server is attached to a Linux cluster with 150 processors. VOMBAT is available at http://pdw-24.ipk-gatersleben.de:8080/VOMBAT/.
Figures
Similar articles
-
Recognition of cis-regulatory elements with vombat.J Bioinform Comput Biol. 2007 Apr;5(2B):561-77. doi: 10.1142/s0219720007002886. J Bioinform Comput Biol. 2007. PMID: 17636862
-
Identification of transcription factor binding sites with variable-order Bayesian networks.Bioinformatics. 2005 Jun 1;21(11):2657-66. doi: 10.1093/bioinformatics/bti410. Epub 2005 Mar 29. Bioinformatics. 2005. PMID: 15797905
-
CREME: Cis-Regulatory Module Explorer for the human genome.Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W253-6. doi: 10.1093/nar/gkh385. Nucleic Acids Res. 2004. PMID: 15215390 Free PMC article.
-
Stubb: a program for discovery and analysis of cis-regulatory modules.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W555-9. doi: 10.1093/nar/gkl224. Nucleic Acids Res. 2006. PMID: 16845069 Free PMC article.
-
Hidden Markov model and its applications in motif findings.Methods Mol Biol. 2010;620:405-16. doi: 10.1007/978-1-60761-580-4_13. Methods Mol Biol. 2010. PMID: 20652513 Review.
Cited by
-
A new approach to bias correction in RNA-Seq.Bioinformatics. 2012 Apr 1;28(7):921-8. doi: 10.1093/bioinformatics/bts055. Epub 2012 Jan 28. Bioinformatics. 2012. PMID: 22285831 Free PMC article.
-
NURBS: a database of experimental and predicted nuclear receptor binding sites of mouse.Bioinformatics. 2013 Jan 15;29(2):295-7. doi: 10.1093/bioinformatics/bts693. Epub 2012 Nov 29. Bioinformatics. 2013. PMID: 23196988 Free PMC article.
-
The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes.BMC Bioinformatics. 2024 Dec 2;25(1):371. doi: 10.1186/s12859-024-05995-0. BMC Bioinformatics. 2024. PMID: 39623329 Free PMC article.
-
Employees recruitment: A prescriptive analytics approach via machine learning and mathematical programming.Decis Support Syst. 2020 Jul;134:113290. doi: 10.1016/j.dss.2020.113290. Epub 2020 Apr 3. Decis Support Syst. 2020. PMID: 32501316 Free PMC article.
-
Analysis of Genomic Sequence Motifs for Deciphering Transcription Factor Binding and Transcriptional Regulation in Eukaryotic Cells.Front Genet. 2016 Feb 23;7:24. doi: 10.3389/fgene.2016.00024. eCollection 2016. Front Genet. 2016. PMID: 26941778 Free PMC article. Review.
References
-
- Fickett J., Hatzigeorgiou A. Eukaryotic promoter recognition. Genome Res. 1997;7:861–878. - PubMed
-
- Salzberg S. A method for identifying splice sites, translational start sites in eukaryotic mRNA. CABIOS. 1997;13:365–376. - PubMed
-
- Zhang M.Q., Marr T.G. A weight array method for splicing signal analysis. Comput. Appl. Biosci. 1993;9:499–509. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources