Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery
- PMID: 19136552
- DOI: 10.1093/bioinformatics/btn664
Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery
Abstract
Motivation: Short linear motifs (SLiMs) are important mediators of protein-protein interactions. Their short and degenerate nature presents a challenge for computational discovery. We sought to improve SLiM discovery by incorporating evolutionary information, since SLiMs are more conserved than surrounding residues.
Results: We have developed a new method that assesses the evolutionary signal of a residue in its sequence and structural context. Under-conserved residues are masked out prior to SLiM discovery, allowing incorporation into the existing statistical model employed by SLiMFinder. The method shows considerable robustness in terms of both the conservation score used for individual residues and the size of the sequence neighbourhood. Optimal parameters significantly improve return of known functional motifs from benchmarking data, raising the return of significant validated SLiMs from typical human interaction datasets from 20% to 60%, while retaining the high level of stringency needed for application to real biological data. The success of this regime indicates that it could be of general benefit to computational annotation and prediction of protein function at the sequence level.
Availability: All data and tools in this article are available at http://bioware.ucd.ie/~slimdisc/slimfinder/conmasking/.
Similar articles
-
The SLiMDisc server: short, linear motif discovery in proteins.Nucleic Acids Res. 2007 Jul;35(Web Server issue):W455-9. doi: 10.1093/nar/gkm400. Epub 2007 Jun 18. Nucleic Acids Res. 2007. PMID: 17576682 Free PMC article.
-
D-SLIMMER: domain-SLiM interaction motifs miner for sequence based protein-protein interaction data.J Proteome Res. 2011 Dec 2;10(12):5285-95. doi: 10.1021/pr200312e. Epub 2011 Nov 1. J Proteome Res. 2011. PMID: 22004555
-
Discovery of stable and significant binding motif pairs from PDB complexes and protein interaction datasets.Bioinformatics. 2005 Feb 1;21(3):314-24. doi: 10.1093/bioinformatics/bti019. Epub 2004 Sep 16. Bioinformatics. 2005. PMID: 15374856
-
Computational prediction of short linear motifs from protein sequences.Methods Mol Biol. 2015;1268:89-141. doi: 10.1007/978-1-4939-2285-7_6. Methods Mol Biol. 2015. PMID: 25555723 Review.
-
Discovering sequence motifs.Methods Mol Biol. 2008;452:231-51. doi: 10.1007/978-1-60327-159-2_12. Methods Mol Biol. 2008. PMID: 18566768 Review.
Cited by
-
Human Protein Reference Database and Human Proteinpedia as resources for phosphoproteome analysis.Mol Biosyst. 2012 Feb;8(2):453-63. doi: 10.1039/c1mb05340j. Epub 2011 Dec 8. Mol Biosyst. 2012. PMID: 22159132 Free PMC article. Review.
-
SLiMSearch 2.0: biological context for short linear motifs in proteins.Nucleic Acids Res. 2011 Jul;39(Web Server issue):W56-60. doi: 10.1093/nar/gkr402. Epub 2011 May 26. Nucleic Acids Res. 2011. PMID: 21622654 Free PMC article.
-
Evidence for the concerted evolution between short linear protein motifs and their flanking regions.PLoS One. 2009 Jul 8;4(7):e6052. doi: 10.1371/journal.pone.0006052. PLoS One. 2009. PMID: 19584925 Free PMC article.
-
The molecular basis for cellular function of intrinsically disordered protein regions.Nat Rev Mol Cell Biol. 2024 Mar;25(3):187-211. doi: 10.1038/s41580-023-00673-0. Epub 2023 Nov 13. Nat Rev Mol Cell Biol. 2024. PMID: 37957331 Free PMC article. Review.
-
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.Protein Sci. 2025 Jan;34(1):e70004. doi: 10.1002/pro.70004. Protein Sci. 2025. PMID: 39720898
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials