Computational identification and analysis of protein short linear motifs
- PMID: 20515727
- DOI: 10.2741/3647
Computational identification and analysis of protein short linear motifs
Abstract
Short linear motifs (SLiMs) in proteins can act as targets for proteolytic cleavage, sites of post-translational modification, determinants of sub-cellular localization, and mediators of protein-protein interactions. Computational discovery of SLiMs involves assembling a group of proteins postulated to share a potential motif, masking out residues less likely to contain such a motif, down-weighting shared motifs arising through common evolutionary descent, and calculation of statistical probabilities allowing for the multiple testing of all possible motifs. Much of the challenge for motif discovery lies in the assembly and masking of datasets of proteins likely to share motifs, since the motifs are typically short (between 3 and 10 amino acids in length), so that potential signals can be easily swamped by the noise of stochastically recurring motifs. Focusing on disordered regions of proteins, where SLiMs are predominantly found, and masking out non-conserved residues can reduce the level of noise but more work is required to improve the quality of high-throughput experimental datasets (e.g. of physical protein interactions) as input for computational discovery.
Similar articles
-
Discovering short linear protein motif based on selective training of profile hidden Markov models.J Theor Biol. 2015 Jul 21;377:75-84. doi: 10.1016/j.jtbi.2015.03.010. Epub 2015 Mar 17. J Theor Biol. 2015. PMID: 25791288
-
SLiMFinder: a probabilistic method for identifying over-represented, convergently evolved, short linear motifs in proteins.PLoS One. 2007 Oct 3;2(10):e967. doi: 10.1371/journal.pone.0000967. PLoS One. 2007. PMID: 17912346 Free PMC article.
-
Computational prediction of short linear motifs from protein sequences.Methods Mol Biol. 2015;1268:89-141. doi: 10.1007/978-1-4939-2285-7_6. Methods Mol Biol. 2015. PMID: 25555723 Review.
-
Bioinformatics Approaches for Predicting Disordered Protein Motifs.Adv Exp Med Biol. 2015;870:291-318. doi: 10.1007/978-3-319-20164-1_9. Adv Exp Med Biol. 2015. PMID: 26387106 Review.
-
Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery.Bioinformatics. 2009 Feb 15;25(4):443-50. doi: 10.1093/bioinformatics/btn664. Epub 2009 Jan 9. Bioinformatics. 2009. PMID: 19136552
Cited by
-
QSLiMFinder: improved short linear motif prediction using specific query protein data.Bioinformatics. 2015 Jul 15;31(14):2284-93. doi: 10.1093/bioinformatics/btv155. Epub 2015 Mar 19. Bioinformatics. 2015. PMID: 25792551 Free PMC article.
-
The structural and functional signatures of proteins that undergo multiple events of post-translational modification.Protein Sci. 2014 Aug;23(8):1077-93. doi: 10.1002/pro.2494. Epub 2014 Jun 11. Protein Sci. 2014. PMID: 24888500 Free PMC article.
-
Fast and accurate discovery of degenerate linear motifs in protein sequences.PLoS One. 2014 Sep 10;9(9):e106081. doi: 10.1371/journal.pone.0106081. eCollection 2014. PLoS One. 2014. PMID: 25207816 Free PMC article.
-
Short Linear Motifs in Colorectal Cancer Interactome and Tumorigenesis.Cells. 2022 Nov 23;11(23):3739. doi: 10.3390/cells11233739. Cells. 2022. PMID: 36496998 Free PMC article. Review.
-
Proteome-wide discovery of evolutionary conserved sequences in disordered regions.Sci Signal. 2012 Mar 13;5(215):rs1. doi: 10.1126/scisignal.2002515. Sci Signal. 2012. PMID: 22416277 Free PMC article.