Toward Algorithms for Automation of Postgenomic Data Analyses: Bacillus subtilis Promoter Prediction with Artificial Neural Network
- PMID: 31573385
- DOI: 10.1089/omi.2019.0041
Toward Algorithms for Automation of Postgenomic Data Analyses: Bacillus subtilis Promoter Prediction with Artificial Neural Network
Abstract
In the present postgenomic era, the capacity to generate big data has far exceeded the capacity to analyze, contextualize, and make sense of the data in clinical, biological, and ecological applications. There is a great unmet need for automation and algorithms to aid in analyses of big data, in biology in particular. In this context, it is noteworthy that computational methods used to analyze the regulation of bacterial gene expression have in the past focused mainly on Escherichia coli promoters due to the large amount of data available. The challenge and prospects of automation in prediction and recognition of bacteria sequences as promoters have not been properly addressed due to the promoter size and degenerate pattern. We report here an original neural network approach for recognition and prediction of Bacillus subtilis promoters. The artificial neural network used as input 767 B. subtilis promoter sequences, while also aiming at identifying the architecture, provides the most optimal prediction. Two multilayer perceptron neural network architectures offered the highest accuracy: one with five, and another with seven neurons in the hidden layer. Each architecture achieved an accuracy of 98.57% and 97.69%, respectively. The results collectively indicate the promise of the application of neural network approaches to the B. subtilis promoter recognition problem, while also suggesting the broader potential of algorithms for automation of data analyses in the postgenomic era.
Keywords: Bacillus subtilis; algorithms; artificial neural network; automation; computational biology; genome annotation.
Similar articles
-
E. coli promoter prediction using feed-forward neural networks.Conf Proc IEEE Eng Med Biol Soc. 2006;2006:2025-7. doi: 10.1109/IEMBS.2006.260365. Conf Proc IEEE Eng Med Biol Soc. 2006. PMID: 17946085
-
Inferring regulatory elements from a whole genome. An analysis of Helicobacter pylori sigma(80) family of promoter signals.J Mol Biol. 2000 Mar 24;297(2):335-53. doi: 10.1006/jmbi.2000.3576. J Mol Biol. 2000. PMID: 10715205
-
Prediction and analysis of prokaryotic promoters based on sequence features.Biosystems. 2020 Nov;197:104218. doi: 10.1016/j.biosystems.2020.104218. Epub 2020 Aug 2. Biosystems. 2020. PMID: 32755610
-
Biologically inspired intelligent decision making: a commentary on the use of artificial neural networks in bioinformatics.Bioengineered. 2014 Mar-Apr;5(2):80-95. doi: 10.4161/bioe.26997. Epub 2013 Dec 16. Bioengineered. 2014. PMID: 24335433 Free PMC article. Review.
-
Bacillus subtilis and Escherichia coli essential genes and minimal cell factories after one decade of genome engineering.Microbiology (Reading). 2014 Nov;160(Pt 11):2341-2351. doi: 10.1099/mic.0.079376-0. Epub 2014 Aug 4. Microbiology (Reading). 2014. PMID: 25092907 Review.
Cited by
-
An overview and future prospects of recombinant protein production in Bacillus subtilis.Appl Microbiol Biotechnol. 2021 Sep;105(18):6607-6626. doi: 10.1007/s00253-021-11533-2. Epub 2021 Sep 1. Appl Microbiol Biotechnol. 2021. PMID: 34468804 Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous