A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0
- PMID: 20368981
- PMCID: PMC2848569
- DOI: 10.1371/journal.pone.0009931
A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0
Abstract
Information of subcellular locations of proteins is important for in-depth studies of cell biology. It is very useful for proteomics, system biology and drug development as well. However, most existing methods for predicting protein subcellular location can only cover 5 to 12 location sites. Also, they are limited to deal with single-location proteins and hence failed to work for multiplex proteins, which can simultaneously exist at, or move between, two or more location sites. Actually, multiplex proteins of this kind usually posses some important biological functions worthy of our special notice. A new predictor called "Euk-mPLoc 2.0" is developed by hybridizing the gene ontology information, functional domain information, and sequential evolutionary information through three different modes of pseudo amino acid composition. It can be used to identify eukaryotic proteins among the following 22 locations: (1) acrosome, (2) cell wall, (3) centriole, (4) chloroplast, (5) cyanelle, (6) cytoplasm, (7) cytoskeleton, (8) endoplasmic reticulum, (9) endosome, (10) extracell, (11) Golgi apparatus, (12) hydrogenosome, (13) lysosome, (14) melanosome, (15) microsome (16) mitochondria, (17) nucleus, (18) peroxisome, (19) plasma membrane, (20) plastid, (21) spindle pole body, and (22) vacuole. Compared with the existing methods for predicting eukaryotic protein subcellular localization, the new predictor is much more powerful and flexible, particularly in dealing with proteins with multiple locations and proteins without available accession numbers. For a newly-constructed stringent benchmark dataset which contains both single- and multiple-location proteins and in which none of proteins has pairwise sequence identity to any other in a same location, the overall jackknife success rate achieved by Euk-mPLoc 2.0 is more than 24% higher than those by any of the existing predictors. As a user-friendly web-server, Euk-mPLoc 2.0 is freely accessible at http://www.csbio.sjtu.edu.cn/bioinf/euk-multi-2/. For a query protein sequence of 400 amino acids, it will take about 15 seconds for the web-server to yield the predicted result; the longer the sequence is, the more time it may usually need. It is anticipated that the novel approach and the powerful predictor as presented in this paper will have a significant impact to Molecular Cell Biology, System Biology, Proteomics, Bioinformatics, and Drug Development.
Conflict of interest statement
Figures



Similar articles
-
iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins.PLoS One. 2011 Mar 30;6(3):e18258. doi: 10.1371/journal.pone.0018258. PLoS One. 2011. PMID: 21483473 Free PMC article.
-
A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins.PLoS One. 2012;7(5):e36317. doi: 10.1371/journal.pone.0036317. Epub 2012 May 22. PLoS One. 2012. PMID: 22629314 Free PMC article.
-
Plant-mPLoc: a top-down strategy to augment the power for predicting plant protein subcellular localization.PLoS One. 2010 Jun 28;5(6):e11335. doi: 10.1371/journal.pone.0011335. PLoS One. 2010. PMID: 20596258 Free PMC article.
-
Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.Amino Acids. 2007 Jul;33(1):57-67. doi: 10.1007/s00726-006-0478-8. Epub 2007 Jan 19. Amino Acids. 2007. PMID: 17235453
-
pLoc_bal-mPlant: Predict Subcellular Localization of Plant Proteins by General PseAAC and Balancing Training Dataset.Curr Pharm Des. 2018;24(34):4013-4022. doi: 10.2174/1381612824666181119145030. Curr Pharm Des. 2018. PMID: 30451108 Review.
Cited by
-
FAM46 proteins are novel eukaryotic non-canonical poly(A) polymerases.Nucleic Acids Res. 2016 May 5;44(8):3534-48. doi: 10.1093/nar/gkw222. Epub 2016 Apr 7. Nucleic Acids Res. 2016. PMID: 27060136 Free PMC article.
-
An ensemble classifier for eukaryotic protein subcellular location prediction using gene ontology categories and amino acid hydrophobicity.PLoS One. 2012;7(1):e31057. doi: 10.1371/journal.pone.0031057. Epub 2012 Jan 30. PLoS One. 2012. PMID: 22303481 Free PMC article.
-
Identification of conformational B-cell Epitopes in an antigen from its primary sequence.Immunome Res. 2010 Oct 20;6:6. doi: 10.1186/1745-7580-6-6. Immunome Res. 2010. PMID: 20961417 Free PMC article.
-
Proteomic enzyme analysis of the marine fungus Paradendryphiella salina reveals alginate lyase as a minimal adaptation strategy for brown algae degradation.Sci Rep. 2019 Aug 26;9(1):12338. doi: 10.1038/s41598-019-48823-9. Sci Rep. 2019. PMID: 31451726 Free PMC article.
-
Genome-wide identification and characterization of superoxide dismutases in four oyster species reveals functional differentiation in response to biotic and abiotic stress.BMC Genomics. 2022 May 18;23(1):378. doi: 10.1186/s12864-022-08610-9. BMC Genomics. 2022. PMID: 35585505 Free PMC article.
References
-
- Nakashima H, Nishikawa K. Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies. J Mol Biol. 1994;238:54–61. - PubMed
-
- Cedano J, Aloy P, P'erez-Pons JA, Querol E. Relation between amino acid composition and cellular location of proteins. J Mol Biol. 1997;266:594–600. - PubMed
-
- Chou KC, Elrod DW. Protein subcellular location prediction. Protein Engineering. 1999;12:107–118. - PubMed
-
- Emanuelsson O, Nielsen H, Brunak S, von Heijne G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. Journal of Molecular Biology. 2000;300:1005–1016. - PubMed
-
- Zhou GP, Doctor K. Subcellular location prediction of apoptosis proteins. PROTEINS: Structure, Function, and Genetics. 2003;50:44–48. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials