Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition
- PMID: 14635197
- DOI: 10.1002/jcb.10719
Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition
Erratum in
- J Cell Biochem. 2004 Apr 1;91(5):1085
Abstract
Given a protein sequence, how to identify its subcellular location? With the rapid increase in newly found protein sequences entering into databanks, the problem has become more and more important because the function of a protein is closely correlated with its localization. To practically deal with the challenge, a dataset has been established that allows the identification performed among the following 14 subcellular locations: (1) cell wall, (2) centriole, (3) chloroplast, (4) cytoplasm, (5) cytoskeleton, (6) endoplasmic reticulum, (7) extracellular, (8) Golgi apparatus, (9) lysosome, (10) mitochondria, (11) nucleus, (12) peroxisome, (13) plasma membrane, and (14) vacuole. Compared with the datasets constructed by the previous investigators, the current one represents the largest in the scope of localizations covered, and hence many proteins which were totally out of picture in the previous treatments, can now be investigated. Meanwhile, to enhance the potential and flexibility in taking into account the sequence-order effect, the series-mode pseudo-amino-acid-composition has been introduced as a representation for a protein. High success rates are obtained by the re-substitution test, jackknife test, and independent dataset test, respectively. It is anticipated that the current automated method can be developed to a high throughput tool for practical usage in both basic research and pharmaceutical industry.
Copyright 2003 Wiley-Liss, Inc.
Similar articles
-
Predicting protein subcellular location by fusing multiple classifiers.J Cell Biochem. 2006 Oct 1;99(2):517-27. doi: 10.1002/jcb.20879. J Cell Biochem. 2006. PMID: 16639720
-
Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21. Biochem Biophys Res Commun. 2006. PMID: 16808903
-
Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.Amino Acids. 2007 Jul;33(1):57-67. doi: 10.1007/s00726-006-0478-8. Epub 2007 Jan 19. Amino Acids. 2007. PMID: 17235453
-
An overview on predicting the subcellular location of a protein.In Silico Biol. 2002;2(3):291-303. In Silico Biol. 2002. PMID: 12542414 Review.
-
Recent progress in predicting protein sub-subcellular locations.Expert Rev Proteomics. 2011 Jun;8(3):391-404. doi: 10.1586/epr.11.20. Expert Rev Proteomics. 2011. PMID: 21679119 Review.
Cited by
-
SUBA: the Arabidopsis Subcellular Database.Nucleic Acids Res. 2007 Jan;35(Database issue):D213-8. doi: 10.1093/nar/gkl863. Epub 2006 Oct 28. Nucleic Acids Res. 2007. PMID: 17071959 Free PMC article.
-
Going from where to why--interpretable prediction of protein subcellular localization.Bioinformatics. 2010 May 1;26(9):1232-8. doi: 10.1093/bioinformatics/btq115. Epub 2010 Mar 17. Bioinformatics. 2010. PMID: 20299325 Free PMC article.
-
Graphical representation and mathematical characterization of protein sequences and applications to viral proteins.Adv Protein Chem Struct Biol. 2011;83:1-42. doi: 10.1016/B978-0-12-381262-9.00001-X. Adv Protein Chem Struct Biol. 2011. PMID: 21570664 Free PMC article. Review.
-
Large-scale automated analysis of location patterns in randomly tagged 3T3 cells.Ann Biomed Eng. 2007 Jun;35(6):1081-7. doi: 10.1007/s10439-007-9254-5. Epub 2007 Feb 7. Ann Biomed Eng. 2007. PMID: 17285363 Free PMC article.
-
iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.Oncotarget. 2016 Oct 25;7(43):69783-69793. doi: 10.18632/oncotarget.11975. Oncotarget. 2016. PMID: 27626500 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources