Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Dec 15;29(24):3199-203.
doi: 10.1093/bioinformatics/btt544. Epub 2013 Sep 27.

MS2PIP: a tool for MS/MS peak intensity prediction

Affiliations

MS2PIP: a tool for MS/MS peak intensity prediction

Sven Degroeve et al. Bioinformatics. .

Abstract

Motivation: Tandem mass spectrometry provides the means to match mass spectrometry signal observations with the chemical entities that generated them. The technology produces signal spectra that contain information about the chemical dissociation pattern of a peptide that was forced to fragment using methods like collision-induced dissociation. The ability to predict these MS(2) signals and to understand this fragmentation process is important for sensitive high-throughput proteomics research.

Results: We present a new tool called MS(2)PIP for predicting the intensity of the most important fragment ion signal peaks from a peptide sequence. MS(2)PIP pre-processes a large dataset with confident peptide-to-spectrum matches to facilitate data-driven model induction using a random forest regression learning algorithm. The intensity predictions of MS(2)PIP were evaluated on several independent evaluation sets and found to correlate significantly better with the observed fragment-ion intensities as compared with the current state-of-the-art PeptideART tool.

Availability: MS(2)PIP code is available for both training and predicting at http://compomics.com/.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
The distribution of the oob R2 prediction performance results for the regression tasks Dclf, with f{b,y}
Fig. 2.
Fig. 2.
The distribution of the PCC values computed from the b and y ion types (set1) for the evaluation datasets lab1, lab2, lab3, sample1 and sample2

References

    1. Arnold RJ, et al. Pacific Symposium on Biocomputing. 2006. A machine learning approach to predicting peptide fragmentation spectra; pp. 219–230. - PubMed
    1. Barton SJ, Whittaker JC. Review of factors that influence the abundance of ions produced in a tandem mass spectrometer and statistical methods for discovering these factors. Mass Spectrom. Rev. 2009;28:177–187. - PubMed
    1. Breiman L. Random Forests. Machine Learning. 2001;45:5–32.
    1. Craig R, Beavis RC. TANDEM: matching proteins with tandem mass spectra. Bioinformatics. 2004;20:1466–1467. - PubMed
    1. Degroeve S, et al. A reproducibility-based evaluation procedure for quantifying the differences between MS/MS peak intensity normalization methods. Proteomics. 2011;11:1172–1180. - PubMed

Publication types