HIV-1 protease cleavage site prediction based on amino acid property
- PMID: 18496789
- DOI: 10.1002/jcc.21024
HIV-1 protease cleavage site prediction based on amino acid property
Abstract
Knowledge of the polyprotein cleavage sites by HIV protease will refine our understanding of its specificity, and the information thus acquired is useful for designing specific and efficient HIV protease inhibitors. Recently, several works have approached the HIV-1 protease specificity problem by applying a number of classifier creation and combination methods. The pace in searching for the proper inhibitors of HIV protease will be greatly expedited if one can find an accurate, robust, and rapid method for predicting the cleavage sites in proteins by HIV protease. In this article, we selected HIV-1 protease as the subject of the study. 299 oligopeptides were chosen for the training set, while the other 63 oligopeptides were taken as a test set. The peptides are represented by features constructed by AAIndex (Kawashima et al., Nucleic Acids Res 1999, 27, 368; Kawashima and Kanehisa, Nucleic Acids Res 2000, 28, 374). The mRMR method (Maximum Relevance, Minimum Redundancy; Ding and Peng, Proc Second IEEE Comput Syst Bioinformatics Conf 2003, 523; Peng et al., IEEE Trans Pattern Anal Mach Intell 2005, 27, 1226) combining with incremental feature selection (IFS) and feature forward search (FFS) are applied to find the two important cleavage sites and to select 364 important biochemistry features by jackknife test. Using KNN (K-nearest neighbors) to combine the selected features, the prediction model obtains high accuracy rate of 91.3% for Jackknife cross-validation test and 87.3% for independent-set test. It is expected that our feature selection scheme can be referred to as a useful assistant technique for finding effective inhibitors of HIV protease, especially for the scientists in this field.
Copyright 2008 Wiley Periodicals, Inc.
Similar articles
-
Support Vector Machines for predicting HIV protease cleavage sites in protein.J Comput Chem. 2002 Jan 30;23(2):267-74. doi: 10.1002/jcc.10017. J Comput Chem. 2002. PMID: 11924738
-
Prediction of human immunodeficiency virus protease cleavage sites in proteins.Anal Biochem. 1996 Jan 1;233(1):1-14. doi: 10.1006/abio.1996.0001. Anal Biochem. 1996. PMID: 8789141 Review.
-
GalNAc-transferase specificity prediction based on feature selection method.Peptides. 2009 Feb;30(2):359-64. doi: 10.1016/j.peptides.2008.09.020. Epub 2008 Oct 8. Peptides. 2009. PMID: 18955094
-
Predicting human immunodeficiency virus protease cleavage sites in proteins by a discriminant function method.Proteins. 1996 Jan;24(1):51-72. doi: 10.1002/(SICI)1097-0134(199601)24:1<51::AID-PROT4>3.0.CO;2-R. Proteins. 1996. PMID: 8628733
-
Bioinformatic approaches for modeling the substrate specificity of HIV-1 protease: an overview.Expert Rev Mol Diagn. 2007 Jul;7(4):435-51. doi: 10.1586/14737159.7.4.435. Expert Rev Mol Diagn. 2007. PMID: 17620050 Review.
Cited by
-
Prediction of nucleosome positioning based on transcription factor binding sites.PLoS One. 2010 Sep 1;5(9):e12495. doi: 10.1371/journal.pone.0012495. PLoS One. 2010. PMID: 20824131 Free PMC article.
-
A consistency-based feature selection method allied with linear SVMs for HIV-1 protease cleavage site prediction.PLoS One. 2013 Aug 23;8(8):e63145. doi: 10.1371/journal.pone.0063145. eCollection 2013. PLoS One. 2013. PMID: 24058397 Free PMC article.
-
Prediction of RNA-binding proteins by voting systems.J Biomed Biotechnol. 2011;2011:506205. doi: 10.1155/2011/506205. Epub 2011 Jul 26. J Biomed Biotechnol. 2011. PMID: 21826121 Free PMC article.
-
AVCpred: an integrated web server for prediction and design of antiviral compounds.Chem Biol Drug Des. 2017 Jan;89(1):74-83. doi: 10.1111/cbdd.12834. Epub 2016 Sep 9. Chem Biol Drug Des. 2017. PMID: 27490990 Free PMC article.
-
Feature Selection Combined with Neural Network Structure Optimization for HIV-1 Protease Cleavage Site Prediction.Biomed Res Int. 2015;2015:263586. doi: 10.1155/2015/263586. Epub 2015 Apr 15. Biomed Res Int. 2015. PMID: 25961009 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources