Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes
- PMID: 17628605
- DOI: 10.1016/j.jtbi.2007.06.001
Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes
Abstract
With the rapid increment of protein sequence data, it is indispensable to develop automated and reliable predictive methods for protein function annotation. One approach for facilitating protein function prediction is to classify proteins into functional families from primary sequence. Being the most important group of all proteins, the accurate prediction for enzyme family classes and subfamily classes is closely related to their biological functions. In this paper, for the prediction of enzyme subfamily classes, the Chou's amphiphilic pseudo-amino acid composition [Chou, K.C., 2005. Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes. Bioinformatics 21, 10-19] has been adopted to represent the protein samples for training the 'one-versus-rest' support vector machine. As a demonstration, the jackknife test was performed on the dataset that contains 2640 oxidoreductase sequences classified into 16 subfamily classes [Chou, K.C., Elrod, D.W., 2003. Prediction of enzyme family classes. J. Proteome Res. 2, 183-190]. The overall accuracy thus obtained was 80.87%. The significant enhancement in the accuracy indicates that the current method might play a complementary role to the exiting methods.
Similar articles
-
Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition.J Theor Biol. 2007 Sep 21;248(2):377-81. doi: 10.1016/j.jtbi.2007.05.019. Epub 2007 May 18. J Theor Biol. 2007. PMID: 17572445
-
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23. Amino Acids. 2009. PMID: 18726140
-
Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes.Amino Acids. 2007 Nov;33(4):623-9. doi: 10.1007/s00726-007-0496-1. Epub 2007 Feb 19. Amino Acids. 2007. PMID: 17308864
-
A Brief Review on Software Tools in Generating Chou's Pseudo-factor Representations for All Types of Biological Sequences.Protein Pept Lett. 2018;25(9):822-829. doi: 10.2174/0929866525666180905111124. Protein Pept Lett. 2018. PMID: 30182829 Review.
-
An overview on predicting the subcellular location of a protein.In Silico Biol. 2002;2(3):291-303. In Silico Biol. 2002. PMID: 12542414 Review.
Cited by
-
Predicting secretory proteins of malaria parasite by incorporating sequence evolution information into pseudo amino acid composition via grey system model.PLoS One. 2012;7(11):e49040. doi: 10.1371/journal.pone.0049040. Epub 2012 Nov 26. PLoS One. 2012. PMID: 23189138 Free PMC article.
-
A survey of computational intelligence techniques in protein function prediction.Int J Proteomics. 2014;2014:845479. doi: 10.1155/2014/845479. Epub 2014 Dec 11. Int J Proteomics. 2014. PMID: 25574395 Free PMC article. Review.
-
Bridging protein local structures and protein functions.Amino Acids. 2008 Oct;35(3):627-50. doi: 10.1007/s00726-008-0088-8. Epub 2008 Apr 18. Amino Acids. 2008. PMID: 18421562 Free PMC article. Review.
-
iSNO-AAPair: incorporating amino acid pairwise coupling into PseAAC for predicting cysteine S-nitrosylation sites in proteins.PeerJ. 2013 Oct 3;1:e171. doi: 10.7717/peerj.171. eCollection 2013. PeerJ. 2013. PMID: 24109555 Free PMC article.
-
iACP: a sequence-based tool for identifying anticancer peptides.Oncotarget. 2016 Mar 29;7(13):16895-909. doi: 10.18632/oncotarget.7815. Oncotarget. 2016. PMID: 26942877 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials