Artificial neural networks for the prediction of peptide drift time in ion mobility mass spectrometry

Bing Wang¹, Steve Valentine, Manolo Plasencia, Sriram Raghuraman, Xiang Zhang

Affiliations

PMID: 20380738
PMCID: PMC2874804
DOI: 10.1186/1471-2105-11-182

Artificial neural networks for the prediction of peptide drift time in ion mobility mass spectrometry

Bing Wang et al. BMC Bioinformatics. 2010.

. 2010 Apr 11:11:182.

doi: 10.1186/1471-2105-11-182.

Authors

Bing Wang¹, Steve Valentine, Manolo Plasencia, Sriram Raghuraman, Xiang Zhang

Affiliation

¹ Department of Electronics and Information Engineering, Anhui University of Technology, Ma'anshan, 243002, China. wangbing@ustc.edu

PMID: 20380738
PMCID: PMC2874804
DOI: 10.1186/1471-2105-11-182

Abstract

Background: There is an increasing usage of ion mobility-mass spectrometry (IMMS) in proteomics. IMMS combines the features of ion mobility spectrometry (IMS) and mass spectrometry (MS). It separates and detects peptide ions on a millisecond time-scale. IMS separates peptide ions based on drift time that is determined by the collision cross-section of each peptide ion in a given experiment condition. A peptide ion's collision cross-section is related to the ion size and shape resulted from the peptide amino acid sequence and their modifications. This inherent relation between the drift time of peptide ion and peptide sequence indicates that the drift time of peptide ions can be used to infer peptide sequence and therefore, for peptide identification.

Results: This paper describes an artificial neural networks (ANNs) regression model for the prediction of peptide ion drift time in IMMS. Each peptide in this work was represented using three descriptors (i.e., molecular weight, sequence length and a two-dimensional sequence index). An ANN predictor consisting of four input nodes, three hidden nodes and one output node was constructed for peptide ion drift time prediction. For the model training and testing, a 10-fold cross-validation strategy was employed for three datasets each containing different charge states. Dataset one contains 212 singly-charged peptide ions, dataset two has 306 doubly-charged peptide ions, and dataset three has 77 triply-charged peptide ions. Our proposed method achieved 94.4%, 93.6% and 74.2% prediction accuracy for singly-, doubly- and triply-charged peptide ions, respectively.

Conclusions: An ANN-based method has been developed for predicting the drift time of peptide ions in IMMS. The results achieved here demonstrate the effectiveness and efficiency of the prediction model. This work can enhance the confidence of protein identification by combining with current database search approaches for protein identification.

PubMed Disclaimer

Figures

**Figure 1**
**Box plots of peptide molecular weight (A), sequence length (B) and drift time distribution (C) in the three datasets**. The central mark is the median, the edges of the box are the 25th and 75th percentiles, the whiskers extend to the most extreme data points that are not outliers, the cross points are outliers if they are larger than Q3+1.5*(Q3-Q1) or smaller than Q1-1.5*(Q3-Q1), where Q1 and Q3 are the 25th and 75th percentiles, respectively.

**Figure 2**
**The fraction of peptides vs. prediction accuracy variation threshold during the model construction process using the training dataset**. The diagram shows the number of peptides which can be predicted in different accuracy variation levels.

**Figure 3**
**Relationship between the observed and predicted drift times for the three peptide datasets**. Subfigures A, B and C are the regression results of our proposed ANN model for singly-, doubly-, and triply-charged peptide ions, respectively. The linear function in each subfigure is achieved by fitting the predicted results to observed drift times, and the line is the corresponding fitted curve. The correlation coefficients between observed and predicted peptide ion drift time are also shows in each subfigure.

**Figure 4**
**The fraction of predicted peptides vs. prediction accuracy variation threshold on the testing data**.

**Figure 5**
**A typical 3-layers neural network architecture**.

See this image and copyright information in PMC

References

1. Henzel WJ, Watanabe C, Stults JT. Protein identification: the origins of peptide mass fingerprinting. J Am Soc Mass Spectrom. 2003;14(9):931–942. doi: 10.1016/S1044-0305(03)00214-9. - DOI - PubMed
1. Bogdanov B, Smith RD. Proteomics by FTICR mass spectrometry: top down and bottom up. Mass Spectrom Rev. 2005;24(2):168–200. doi: 10.1002/mas.20015. - DOI - PubMed
1. Chait BT. Chemistry. Mass spectrometry: bottom-up or top-down? Science. 2006;314(5796):65–66. doi: 10.1126/science.1133987. - DOI - PubMed
1. Breuker K, Jin M, Han X, Jiang H, McLafferty FW. Top-down identification and characterization of biomolecules by mass spectrometry. J Am Soc Mass Spectrom. 2008;19(8):1045–1053. doi: 10.1016/j.jasms.2008.05.013. - DOI - PMC - PubMed
1. Cravatt BF, Simon GM, Yates JR. The biological impact of mass-spectrometry-based proteomics. Nature. 2007;450(7172):991–1000. doi: 10.1038/nature06525. - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Artificial neural networks for the prediction of peptide drift time in ion mobility mass spectrometry

Affiliation

Artificial neural networks for the prediction of peptide drift time in ion mobility mass spectrometry

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources