Plus ça change - evolutionary sequence divergence predicts protein subcellular localization signals
- PMID: 24438075
- PMCID: PMC3906766
- DOI: 10.1186/1471-2164-15-46
Plus ça change - evolutionary sequence divergence predicts protein subcellular localization signals
Abstract
Background: Protein subcellular localization is a central problem in understanding cell biology and has been the focus of intense research. In order to predict localization from amino acid sequence a myriad of features have been tried: including amino acid composition, sequence similarity, the presence of certain motifs or domains, and many others. Surprisingly, sequence conservation of sorting motifs has not yet been employed, despite its extensive use for tasks such as the prediction of transcription factor binding sites.
Results: Here, we flip the problem around, and present a proof of concept for the idea that the lack of sequence conservation can be a novel feature for localization prediction. We show that for yeast, mammal and plant datasets, evolutionary sequence divergence alone has significant power to identify sequences with N-terminal sorting sequences. Moreover sequence divergence is nearly as effective when computed on automatically defined ortholog sets as on hand curated ones. Unfortunately, sequence divergence did not necessarily increase classification performance when combined with some traditional sequence features such as amino acid composition. However a post-hoc analysis of the proteins in which sequence divergence changes the prediction yielded some proteins with atypical (i.e. not MPP-cleaved) matrix targeting signals as well as a few misannotations.
Conclusion: We report the results of the first quantitative study of the effectiveness of evolutionary sequence divergence as a feature for protein subcellular localization prediction. We show that divergence is indeed useful for prediction, but it is not trivial to improve overall accuracy simply by adding this feature to classical sequence features. Nevertheless we argue that sequence divergence is a promising feature and show anecdotal examples in which it succeeds where other features fail.
Figures







Similar articles
-
MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites.Mol Cell Proteomics. 2015 Apr;14(4):1113-26. doi: 10.1074/mcp.M114.043083. Epub 2015 Feb 10. Mol Cell Proteomics. 2015. PMID: 25670805 Free PMC article.
-
Prediction of the subcellular localization of eukaryotic proteins using sequence signals and composition.Proteomics. 2004 Jun;4(6):1591-6. doi: 10.1002/pmic.200300769. Proteomics. 2004. PMID: 15174129
-
Prediction of protein subcellular localization.Proteins. 2006 Aug 15;64(3):643-51. doi: 10.1002/prot.21018. Proteins. 2006. PMID: 16752418
-
Evolution, structure and function of mitochondrial carriers: a review with new insights.Plant J. 2011 Apr;66(1):161-81. doi: 10.1111/j.1365-313X.2011.04516.x. Plant J. 2011. PMID: 21443630 Review.
-
Conservation and divergence in plant microRNAs.Plant Mol Biol. 2012 Sep;80(1):3-16. doi: 10.1007/s11103-011-9829-2. Epub 2011 Oct 14. Plant Mol Biol. 2012. PMID: 21996939 Review.
Cited by
-
MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites.Mol Cell Proteomics. 2015 Apr;14(4):1113-26. doi: 10.1074/mcp.M114.043083. Epub 2015 Feb 10. Mol Cell Proteomics. 2015. PMID: 25670805 Free PMC article.
-
Cytosolic and Nuclear Co-localization of Betalain Biosynthetic Enzymes in Tobacco Suggests that Betalains Are Synthesized in the Cytoplasm and/or Nucleus of Betalainic Plant Cells.Front Plant Sci. 2017 May 18;8:831. doi: 10.3389/fpls.2017.00831. eCollection 2017. Front Plant Sci. 2017. PMID: 28572813 Free PMC article.
-
Transcriptome analysis of Leucojum aestivum and identification of genes involved in norbelladine biosynthesis.Planta. 2022 Jan 3;255(2):30. doi: 10.1007/s00425-021-03741-x. Planta. 2022. PMID: 34981205
-
A Brief History of Protein Sorting Prediction.Protein J. 2019 Jun;38(3):200-216. doi: 10.1007/s10930-019-09838-3. Protein J. 2019. PMID: 31119599 Free PMC article. Review.
-
NommPred: Prediction of Mitochondrial and Mitochondrion-Related Organelle Proteins of Nonmodel Organisms.Evol Bioinform Online. 2018 Dec 23;14:1176934318819835. doi: 10.1177/1176934318819835. eCollection 2018. Evol Bioinform Online. 2018. PMID: 30626996 Free PMC article.
References
-
- Huh WK, Falvo JV, Gerke LG, Carroll AS, Howson RW, Weissman JS, O’Shea EK. Global analysis of protein localization in budding yeast. Nature. 2003;15(6959):689–691. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases