Prokaryotic and eukaryotic promoters identification based on residual network transfer learning
- PMID: 35279747
- DOI: 10.1007/s00449-022-02716-w
Prokaryotic and eukaryotic promoters identification based on residual network transfer learning
Abstract
Promoters contribute to research in the context of many diseases, such as coronary heart disease, diabetes and tumors, and one fundamental task is to identify promoters. Deep learning is widely used in the study of promoter sequence recognition. Although deep models have fast and accurate recognition capabilities, they are also limited by their reliance on large amounts of high-quality data. Therefore, we performed transfer learning on a typical deep network based on residual ideas, called a deep residual network (ResNet), to solve the problem of a deep network's high dependence on large amounts of data in the process of promoter prediction. We used binary one-hot encoding to represent the promoter and took advantage of ResNet to extract feature representations from organisms with a large amount of promoter data. Then, we transferred the learned structural parameters to target organisms with insufficient promoter data to improve the generalization performance of ResNet in target organisms. We evaluated the promoter datasets of four organisms (Bacillus subtilis, Escherichia coli, Saccharomyces cerevisiae and Drosophila melanogaster). The experimental results showed that the AUCs of ResNet's promoter prediction after deep transfer were 0.8537 and 0.8633, which increased by 0.1513 and 0.1376 in prokaryotes and eukaryotes, respectively.
Keywords: Deep learning; Promoter prediction; ResNet; Transfer learning.
© 2022. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
Similar articles
-
Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.Brief Bioinform. 2022 Mar 10;23(2):bbab551. doi: 10.1093/bib/bbab551. Brief Bioinform. 2022. PMID: 35021193 Free PMC article.
-
Eukaryotic and prokaryotic promoter prediction using hybrid approach.Theory Biosci. 2011 Jun;130(2):91-100. doi: 10.1007/s12064-010-0114-8. Epub 2010 Nov 3. Theory Biosci. 2011. PMID: 21046474
-
Species-specific design of artificial promoters by transfer-learning based generative deep-learning model.Nucleic Acids Res. 2024 Jun 24;52(11):6145-6157. doi: 10.1093/nar/gkae429. Nucleic Acids Res. 2024. PMID: 38783063 Free PMC article.
-
Prokaryotic promoters in biotechnology.Biotechnol Annu Rev. 1995;1:105-28. doi: 10.1016/s1387-2656(08)70049-8. Biotechnol Annu Rev. 1995. PMID: 9704086 Review.
-
Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes - Biotechnological implications.Biotechnol Adv. 2022 Jan-Feb;54:107822. doi: 10.1016/j.biotechadv.2021.107822. Epub 2021 Aug 27. Biotechnol Adv. 2022. PMID: 34461202 Review.
Cited by
-
Natural promoters and promoter engineering strategies for metabolic regulation in Saccharomyces cerevisiae.J Ind Microbiol Biotechnol. 2023 Feb 17;50(1):kuac029. doi: 10.1093/jimb/kuac029. J Ind Microbiol Biotechnol. 2023. PMID: 36633543 Free PMC article. Review.
References
-
- Kondapalli MS, Galimudi RK, Gundapaneni KK, Padala C, Cingeetham A, Gantala S, Ali A, Shyamala N, Sahu SK, Nallari P (2016) Mmp 1 circulating levels and promoter polymorphism in risk prediction of coronary artery disease in asymptomatic first degree relatives. Gene 595(1):115–120. https://doi.org/10.1016/j.gene.2016.09.041 - DOI - PubMed
-
- Gantala SR, Kon Da Palli MS, Kummari R, Padala C, Tupurani MA, Kupsal K, Galimudi RK, Gun Da Paneni KK, Puranam K, Shyamala N (2018) Collagenase-1 (-1607 1g/2g), gelatinase-a (-1306 c/t), stromelysin-1 (-1171 5a/6a) functional promoter polymorphisms in risk prediction of type 2 diabetic nephropathy. Gene 673(5):22–31. https://doi.org/10.1016/j.gene.2018.06.007 - DOI - PubMed
-
- Saif I, Kasmi Y, Allali K, Ennaji MM (2018) Prediction of DNA methylation in the promoter of gene suppressor tumor. Gene 651(20):166–173. https://doi.org/10.1016/j.gene.2018.01.082 - DOI - PubMed
-
- Towsey M, Timms P, Hogan J, Mathews SA (2008) The cross-species prediction of bacterial promoters using a support vector machine. Comput Biol Chem 32(5):359–366. https://doi.org/10.1016/j.compbiolchem.2008.07.009 - DOI - PubMed
-
- Demeler B, Zhou G (1991) Neural network optimization for E Coli promoter prediction. Nucleic Acids Res 19(7):1593–1599. https://doi.org/10.1093/nar/19.7.1593 - DOI - PubMed - PMC
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases