Improving promoter prediction for the NNPP2.2 algorithm: a case study using Escherichia coli DNA sequences
- PMID: 15454410
- DOI: 10.1093/bioinformatics/bti047
Improving promoter prediction for the NNPP2.2 algorithm: a case study using Escherichia coli DNA sequences
Abstract
Motivation: Although a great deal of research has been undertaken in the area of promoter prediction, prediction techniques are still not fully developed. Many algorithms tend to exhibit poor specificity, generating many false positives, or poor sensitivity. The neural network prediction program NNPP2.2 is one such example.
Results: To improve the NNPP2.2 prediction technique, the distance between the transcription start site (TSS) associated with the promoter and the translation start site (TLS) of the subsequent gene coding region has been studied for Escherichia coli K12 bacteria. An empirical probability distribution that is consistent for all E.coli promoters has been established. This information is combined with the results from NNPP2.2 to create a new technique called TLS-NNPP, which improves the specificity of promoter prediction. The technique is shown to be effective using E.coli DNA sequences, however, it is applicable to any organism for which a set of promoters has been experimentally defined.
Availability: The data used in this project and the prediction results for the tested sequences can be obtained from http://www.uow.edu.au/~yanxia/E_Coli_paper/SBurden_Results.xls
Contact: alh98@uow.edu.au.
Similar articles
-
Improved prediction of bacterial transcription start sites.Bioinformatics. 2006 Jan 15;22(2):142-8. doi: 10.1093/bioinformatics/bti771. Epub 2005 Nov 15. Bioinformatics. 2006. PMID: 16287942
-
E. coli promoter prediction using feed-forward neural networks.Conf Proc IEEE Eng Med Biol Soc. 2006;2006:2025-7. doi: 10.1109/IEMBS.2006.260365. Conf Proc IEEE Eng Med Biol Soc. 2006. PMID: 17946085
-
A Bayesian network approach to operon prediction.Bioinformatics. 2003 Jul 1;19(10):1227-35. doi: 10.1093/bioinformatics/btg147. Bioinformatics. 2003. PMID: 12835266
-
Escherichia coli promoter sequences: analysis and prediction.Methods Enzymol. 1996;273:30-42. doi: 10.1016/s0076-6879(96)73004-5. Methods Enzymol. 1996. PMID: 8791597 Review. No abstract available.
-
The relative value of operon predictions.Brief Bioinform. 2008 Sep;9(5):367-75. doi: 10.1093/bib/bbn019. Epub 2008 Apr 17. Brief Bioinform. 2008. PMID: 18420711 Review.
Cited by
-
Image-based promoter prediction: a promoter prediction method based on evolutionarily generated patterns.Sci Rep. 2018 Dec 6;8(1):17695. doi: 10.1038/s41598-018-36308-0. Sci Rep. 2018. PMID: 30523308 Free PMC article.
-
Bioinformatics resources for the study of gene regulation in bacteria.J Bacteriol. 2009 Jan;191(1):23-31. doi: 10.1128/JB.01017-08. Epub 2008 Oct 31. J Bacteriol. 2009. PMID: 18978060 Free PMC article. Review. No abstract available.
-
Eukaryotic and prokaryotic promoter prediction using hybrid approach.Theory Biosci. 2011 Jun;130(2):91-100. doi: 10.1007/s12064-010-0114-8. Epub 2010 Nov 3. Theory Biosci. 2011. PMID: 21046474
-
Technical considerations in using DNA microarrays to define regulons.Methods. 2009 Jan;47(1):63-72. doi: 10.1016/j.ymeth.2008.10.017. Epub 2008 Oct 26. Methods. 2009. PMID: 18955146 Free PMC article.
-
Recent computational approaches to understand gene regulation: mining gene regulation in silico.Curr Genomics. 2007 Apr;8(2):79-91. doi: 10.2174/138920207780368150. Curr Genomics. 2007. PMID: 18660846 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources