Discovery of phosphorylation motif mixtures in phosphoproteomics data
- PMID: 18996944
- PMCID: PMC2638929
- DOI: 10.1093/bioinformatics/btn569
Discovery of phosphorylation motif mixtures in phosphoproteomics data
Abstract
Motivation: Modification of proteins via phosphorylation is a primary mechanism for signal transduction in cells. Phosphorylation sites on proteins are determined in part through particular patterns, or motifs, present in the amino acid sequence.
Results: We describe an algorithm that simultaneously discovers multiple motifs in a set of peptides that were phosphorylated by several different kinases. Such sets of peptides are routinely produced in proteomics experiments.Our motif-finding algorithm uses the principle of minimum description length to determine a mixture of sequence motifs that distinguish a foreground set of phosphopeptides from a background set of unphosphorylated peptides. We show that our algorithm outperforms existing motif-finding algorithms on synthetic datasets consisting of mixtures of known phosphorylation sites. We also derive a motif specificity score that quantifies whether or not the phosphoproteins containing an instance of a motif have a significant number of known interactions. Application of our motif-finding algorithm to recently published human and mouse proteomic studies recovers several known phosphorylation motifs and reveals a number of novel motifs that are enriched for interactions with a particular kinase or phosphatase. Our tools provide a new approach for uncovering the sequence specificities of uncharacterized kinases or phosphatases.
Figures


Similar articles
-
Discovery of protein phosphorylation motifs through exploratory data analysis.PLoS One. 2011;6(5):e20025. doi: 10.1371/journal.pone.0020025. Epub 2011 May 25. PLoS One. 2011. PMID: 21647451 Free PMC article.
-
MMFPh: a maximal motif finder for phosphoproteomics datasets.Bioinformatics. 2012 Jun 15;28(12):1562-70. doi: 10.1093/bioinformatics/bts195. Epub 2012 Apr 23. Bioinformatics. 2012. PMID: 22531218 Free PMC article.
-
Phosphoproteomic analysis of ethylene-regulated protein phosphorylation in etiolated seedlings of Arabidopsis mutant ein2 using two-dimensional separations coupled with a hybrid quadrupole time-of-flight mass spectrometer.Proteomics. 2009 Mar;9(6):1646-61. doi: 10.1002/pmic.200800420. Proteomics. 2009. PMID: 19253305
-
From Phosphosites to Kinases.Methods Mol Biol. 2016;1355:307-21. doi: 10.1007/978-1-4939-3049-4_21. Methods Mol Biol. 2016. PMID: 26584935 Review.
-
Systems Analysis for Interpretation of Phosphoproteomics Data.Methods Mol Biol. 2016;1355:341-60. doi: 10.1007/978-1-4939-3049-4_23. Methods Mol Biol. 2016. PMID: 26584937 Review.
Cited by
-
PTMphinder: an R package for PTM site localization and motif extraction from proteomic datasets.PeerJ. 2019 Jun 4;7:e7046. doi: 10.7717/peerj.7046. eCollection 2019. PeerJ. 2019. PMID: 31198645 Free PMC article.
-
MoMo: discovery of statistically significant post-translational modification motifs.Bioinformatics. 2019 Aug 15;35(16):2774-2782. doi: 10.1093/bioinformatics/bty1058. Bioinformatics. 2019. PMID: 30596994 Free PMC article.
-
PhosSA: Fast and accurate phosphorylation site assignment algorithm for mass spectrometry data.Proteome Sci. 2013 Nov 7;11(Suppl 1):S14. doi: 10.1186/1477-5956-11-S1-S14. Epub 2013 Nov 7. Proteome Sci. 2013. PMID: 24565028 Free PMC article.
-
Identifying protein phosphorylation sites with kinase substrate specificity on human viruses.PLoS One. 2012;7(7):e40694. doi: 10.1371/journal.pone.0040694. Epub 2012 Jul 23. PLoS One. 2012. PMID: 22844408 Free PMC article.
-
seqlm: an MDL based method for identifying differentially methylated regions in high density methylation array data.Bioinformatics. 2016 Sep 1;32(17):2604-10. doi: 10.1093/bioinformatics/btw304. Epub 2016 May 13. Bioinformatics. 2016. PMID: 27187204 Free PMC article.
References
-
- Amanchy R, et al. A curated compendium of phosphorylation motifs. Nat. Biotechnol. 2007;25:285. - PubMed
-
- Bailey TL, Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1994;2:28–36. - PubMed
-
- Bailey TL, Elkan C. The value of prior knowledge in discovering motifs with MEME. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1995;3:21–29. - PubMed
-
- Balla S, et al. Minimotif Miner: a tool for investigating protein function. Nat. Methods. 2006;3:175–177. - PubMed
-
- Blom N, et al. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics. 2004;4:1633–1649. - PubMed