A computational strategy for the prediction of functional linear peptide motifs in proteins
- PMID: 17977881
- DOI: 10.1093/bioinformatics/btm524
A computational strategy for the prediction of functional linear peptide motifs in proteins
Abstract
Motivation: Short linear peptide motifs mediate protein-protein interaction, cell compartment targeting and represent the sites of post-translational modification. The identification of functional motifs by conventional sequence searches, however, is hampered by the short length of the motifs resulting in a large number of hits of which only a small portion is functional.
Results: We have developed a procedure for the identification of functional motifs, which scores pattern conservation in homologous sequences by taking explicitly into account the sequence similarity to the query sequence. For a further improvement of this method, sequence filters have been optimized to mask those sequence regions containing little or no linear motifs. The performance of this approach was verified by measuring its ability to identify 576 experimentally validated motifs among a total of 15 563 instances in a set of 415 protein sequences. Compared to a random selection procedure, the joint application of sequence filters and the novel scoring scheme resulted in a 9-fold enrichment of validated functional motifs on the first rank. In addition, only half as many hits need to be investigated to recover 75% of the functional instances in our dataset. Therefore, this motif-scoring approach should be helpful to guide experiments because it allows focusing on those short linear peptide motifs that have a high probability to be functional.
Similar articles
-
Predicting protein-peptide interactions via a network-based motif sampler.Bioinformatics. 2004 Aug 4;20 Suppl 1:i274-82. doi: 10.1093/bioinformatics/bth922. Bioinformatics. 2004. PMID: 15262809
-
An integrative approach for predicting interactions of protein regions.Bioinformatics. 2008 Aug 15;24(16):i35-41. doi: 10.1093/bioinformatics/btn290. Bioinformatics. 2008. PMID: 18689837
-
Prediction of short linear protein binding regions.J Mol Biol. 2012 Jan 6;415(1):193-204. doi: 10.1016/j.jmb.2011.10.025. Epub 2011 Oct 21. J Mol Biol. 2012. PMID: 22079048
-
Predicting protein function from sequence and structural data.Curr Opin Struct Biol. 2005 Jun;15(3):275-84. doi: 10.1016/j.sbi.2005.04.003. Curr Opin Struct Biol. 2005. PMID: 15963890 Review.
-
Computational prediction of protein-protein interactions.Methods Mol Biol. 2004;261:445-68. doi: 10.1385/1-59259-762-9:445. Methods Mol Biol. 2004. PMID: 15064475 Review.
Cited by
-
A structure filter for the Eukaryotic Linear Motif Resource.BMC Bioinformatics. 2009 Oct 24;10:351. doi: 10.1186/1471-2105-10-351. BMC Bioinformatics. 2009. PMID: 19852836 Free PMC article.
-
Resources to Discover and Use Short Linear Motifs in Viral Proteins.Trends Biotechnol. 2020 Jan;38(1):113-127. doi: 10.1016/j.tibtech.2019.07.004. Epub 2019 Aug 16. Trends Biotechnol. 2020. PMID: 31427097 Free PMC article. Review.
-
Proteome-wide assessment of human interactome as a source of capturing domain-motif and domain-domain interactions.J Cell Commun Signal. 2024 Jan 19;18(1):e12014. doi: 10.1002/ccs3.12014. eCollection 2024 Mar. J Cell Commun Signal. 2024. PMID: 38545252 Free PMC article.
-
ELM: the status of the 2010 eukaryotic linear motif resource.Nucleic Acids Res. 2010 Jan;38(Database issue):D167-80. doi: 10.1093/nar/gkp1016. Epub 2009 Nov 17. Nucleic Acids Res. 2010. PMID: 19920119 Free PMC article.
-
seeMotif: exploring and visualizing sequence motifs in 3D structures.Nucleic Acids Res. 2009 Jul;37(Web Server issue):W552-8. doi: 10.1093/nar/gkp439. Epub 2009 May 28. Nucleic Acids Res. 2009. PMID: 19477961 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources