Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features
- PMID: 17624492
- DOI: 10.1007/s00726-007-0568-2
Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features
Abstract
DNA-binding proteins play a pivotal role in gene regulation. It is vitally important to develop an automated and efficient method for timely identification of novel DNA-binding proteins. In this study, we proposed a method based on alone the primary sequences of proteins to predict the DNA-binding proteins. DNA-binding proteins were encoded by autocross-covariance transform, pseudo-amino acid composition, dipeptide composition, respectively and also the different combinations of the three encoded methods; further, these feature matrices were applied to support vector machine classifiers to predict the DNA-binding proteins. All modules were trained and validated by the jackknife cross-validation test. Through comparing the performance of these substituted modules, the best result was obtained from pseudo-amino acid composition with the overall accuracy of 96.6% and the sensitivity of 90.7%. The results suggest that it can efficiently predict the novel DNA-binding proteins only using the primary sequences.
Similar articles
-
Combing ontologies and dipeptide composition for predicting DNA-binding proteins.Amino Acids. 2008 May;34(4):635-41. doi: 10.1007/s00726-007-0016-3. Epub 2008 Jan 4. Amino Acids. 2008. PMID: 18175049
-
Genetic programming for creating Chou's pseudo amino acid based features for submitochondria localization.Amino Acids. 2008 May;34(4):653-60. doi: 10.1007/s00726-007-0018-1. Epub 2008 Jan 4. Amino Acids. 2008. PMID: 18175047
-
DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.J Theor Biol. 2018 Sep 7;452:22-34. doi: 10.1016/j.jtbi.2018.05.006. Epub 2018 May 16. J Theor Biol. 2018. PMID: 29753757
-
An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins.Amino Acids. 2009 Feb;36(2):167-75. doi: 10.1007/s00726-008-0044-7. Epub 2008 Feb 21. Amino Acids. 2009. PMID: 18288459
-
Identify DNA-binding proteins with optimal Chou's amino acid composition.Protein Pept Lett. 2012 Apr;19(4):398-405. doi: 10.2174/092986612799789404. Protein Pept Lett. 2012. PMID: 22316304
Cited by
-
Improved detection of DNA-binding proteins via compression technology on PSSM information.PLoS One. 2017 Sep 29;12(9):e0185587. doi: 10.1371/journal.pone.0185587. eCollection 2017. PLoS One. 2017. PMID: 28961273 Free PMC article.
-
Some illuminating remarks on molecular genetics and genomics as well as drug development.Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1. Mol Genet Genomics. 2020. PMID: 31894399 Review.
-
enDNA-Prot: identification of DNA-binding proteins by applying ensemble learning.Biomed Res Int. 2014;2014:294279. doi: 10.1155/2014/294279. Epub 2014 May 26. Biomed Res Int. 2014. PMID: 24977146 Free PMC article.
-
Predicting and analyzing DNA-binding domains using a systematic approach to identifying a set of informative physicochemical and biochemical properties.BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S47. doi: 10.1186/1471-2105-12-S1-S47. BMC Bioinformatics. 2011. PMID: 21342579 Free PMC article.
-
Prediction of RNA- and DNA-Binding Proteins Using Various Machine Learning Classifiers.Avicenna J Med Biotechnol. 2019 Jan-Mar;11(1):104-111. Avicenna J Med Biotechnol. 2019. PMID: 30800250 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources