An overview on predicting the subcellular location of a protein
- PMID: 12542414
An overview on predicting the subcellular location of a protein
Abstract
The present paper overviews the issue on predicting the subcellular location of a protein. Five measures of extracting information from the global sequence based on the Bayes discriminant algorithm are reviewed. 1) The auto-correlation functions of amino acid indices along the sequence; 2) The quasi-sequence-order approach; 3) the pseudo-amino acid composition; 4) the unified attribute vector in Hilbert space, 5) Zp parameters extracted from the Zp curve. The actual performance of the predictive accuracy is closely related to the degree of similarity between the training and testing sets or to the average degree of pairwise similarity in dataset in a cross-validated study. Many scholars considered that the current higher predictive accuracy still cannot ensure that some available algorithms are effective in practice prediction for the higher pairwise sequence identity of the datasets, but some of them declared that construction of the dataset used for developing software should base on the reality determined by the Mother Nature that some subcellular locations really contain only a minor number of proteins of which some even have a high percentage of sequence similarity. Owing to the complexity of the problem itself, some very sophisticated and special programs are needed for both constructing dataset and improving the prediction. Anyhow finding the target information in mature protein sequence and properly cooperating it with sorting signals in prediction may further improve the overall predictive accuracy and make the prediction into practice.
Similar articles
-
Prediction of protein subcellular locations by incorporating quasi-sequence-order effect.Biochem Biophys Res Commun. 2000 Nov 19;278(2):477-83. doi: 10.1006/bbrc.2000.3815. Biochem Biophys Res Commun. 2000. PMID: 11097861
-
Prediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition.Biopolymers. 2001 Apr 15;58(5):491-9. doi: 10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I. Biopolymers. 2001. PMID: 11241220
-
Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21. Biochem Biophys Res Commun. 2006. PMID: 16808903
-
Predicting multisite protein subcellular locations: progress and challenges.Expert Rev Proteomics. 2013 Jun;10(3):227-37. doi: 10.1586/epr.13.16. Expert Rev Proteomics. 2013. PMID: 23777214 Review.
-
Recent progress in predicting protein sub-subcellular locations.Expert Rev Proteomics. 2011 Jun;8(3):391-404. doi: 10.1586/epr.11.20. Expert Rev Proteomics. 2011. PMID: 21679119 Review.
Cited by
-
Probing of the nuclear import and export signals and subcellular transport mechanism of varicella-zoster virus tegument protein open reading frame 10.Med Microbiol Immunol. 2012 Feb;201(1):103-11. doi: 10.1007/s00430-011-0211-4. Epub 2011 Jul 14. Med Microbiol Immunol. 2012. PMID: 21755366
-
Cloning, expression, purification, antiserum preparation and its characteristics of the truncated UL6 protein of herpes simplex virus 1.Mol Biol Rep. 2014 Sep;41(9):5997-6002. doi: 10.1007/s11033-014-3477-y. Epub 2014 Jun 29. Mol Biol Rep. 2014. PMID: 24973881
-
Expression, Purification, and Antiserum Production of the Truncated UL31 Protein of Herpes Simplex Virus 1.Iran J Biotechnol. 2019 Jan 11;17(1):e1609. doi: 10.21859/ijb.1609. eCollection 2019 Jan. Iran J Biotechnol. 2019. PMID: 31457039 Free PMC article.
-
pSLIP: SVM based protein subcellular localization prediction using multiple physicochemical properties.BMC Bioinformatics. 2005 Jun 17;6:152. doi: 10.1186/1471-2105-6-152. BMC Bioinformatics. 2005. PMID: 15963230 Free PMC article.
-
Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction.BMC Bioinformatics. 2006 Nov 7;7:491. doi: 10.1186/1471-2105-7-491. BMC Bioinformatics. 2006. PMID: 17090318 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources