MS2PIP: a tool for MS/MS peak intensity prediction
- PMID: 24078703
- PMCID: PMC5994937
- DOI: 10.1093/bioinformatics/btt544
MS2PIP: a tool for MS/MS peak intensity prediction
Abstract
Motivation: Tandem mass spectrometry provides the means to match mass spectrometry signal observations with the chemical entities that generated them. The technology produces signal spectra that contain information about the chemical dissociation pattern of a peptide that was forced to fragment using methods like collision-induced dissociation. The ability to predict these MS(2) signals and to understand this fragmentation process is important for sensitive high-throughput proteomics research.
Results: We present a new tool called MS(2)PIP for predicting the intensity of the most important fragment ion signal peaks from a peptide sequence. MS(2)PIP pre-processes a large dataset with confident peptide-to-spectrum matches to facilitate data-driven model induction using a random forest regression learning algorithm. The intensity predictions of MS(2)PIP were evaluated on several independent evaluation sets and found to correlate significantly better with the observed fragment-ion intensities as compared with the current state-of-the-art PeptideART tool.
Availability: MS(2)PIP code is available for both training and predicting at http://compomics.com/.
Figures
References
-
- Arnold RJ, et al. Pacific Symposium on Biocomputing. 2006. A machine learning approach to predicting peptide fragmentation spectra; pp. 219–230. - PubMed
-
- Barton SJ, Whittaker JC. Review of factors that influence the abundance of ions produced in a tandem mass spectrometer and statistical methods for discovering these factors. Mass Spectrom. Rev. 2009;28:177–187. - PubMed
-
- Breiman L. Random Forests. Machine Learning. 2001;45:5–32.
-
- Craig R, Beavis RC. TANDEM: matching proteins with tandem mass spectra. Bioinformatics. 2004;20:1466–1467. - PubMed
-
- Degroeve S, et al. A reproducibility-based evaluation procedure for quantifying the differences between MS/MS peak intensity normalization methods. Proteomics. 2011;11:1172–1180. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
