Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 May;10(5):M110.000455.
doi: 10.1074/mcp.M110.000455. Epub 2011 Feb 14.

DeltAMT: a statistical algorithm for fast detection of protein modifications from LC-MS/MS data

Affiliations

DeltAMT: a statistical algorithm for fast detection of protein modifications from LC-MS/MS data

Yan Fu et al. Mol Cell Proteomics. 2011 May.

Abstract

Identification of proteins and their modifications via liquid chromatography-tandem mass spectrometry is an important task for the field of proteomics. However, because of the complexity of tandem mass spectra, the majority of the spectra cannot be identified. The presence of unanticipated protein modifications is among the major reasons for the low spectral identification rate. The conventional database search approach to protein identification has inherent difficulties in comprehensive detection of protein modifications. In recent years, increasing efforts have been devoted to developing unrestrictive approaches to modification identification, but they often suffer from their lack of speed. This paper presents a statistical algorithm named DeltAMT (Delta Accurate Mass and Time) for fast detection of abundant protein modifications from tandem mass spectra with high-accuracy precursor masses. The algorithm is based on the fact that the modified and unmodified versions of a peptide are usually present simultaneously in a sample and their spectra are correlated with each other in precursor masses and retention times. By representing each pair of spectra as a delta mass and time vector, bivariate Gaussian mixture models are used to detect modification-related spectral pairs. Unlike previous approaches to unrestrictive modification identification that mainly rely upon the fragment information and the mass dimension in liquid chromatography-tandem mass spectrometry, the proposed algorithm makes the most of precursor information. Thus, it is highly efficient while being accurate and sensitive. On two published data sets, the algorithm effectively detected various modifications and other interesting events, yielding deep insights into the data. Based on these discoveries, the spectral identification rates were significantly increased and many modified peptides were identified.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Flowchart of the DeltAMT algorithm (dotted lines indicate optional steps).
Fig. 2.
Fig. 2.
An example of the distribution of Δ obtained from the ISB standard protein mix data set. In this example, three distribution components were automatically detected by the DeltAMT algorithm. One of them is the random distribution marked by the large dashed square, and the other two are modification-induced distributions marked by the small solid squares. The two modifications are oxidation (right) and the (calcium - sodium) subtractive pseudo-modification (left). These two modifications are well discriminated from each other by the mass and time dimensions.
Fig. 3.
Fig. 3.
An example of the distribution of Δm. Within each potential modification mass interval, random Δm is assumed to come from a Gaussian distribution. Those mass intervals that contain Δm values of unexpectedly high frequencies (peaks marked by stars) are selected and subjected to two-dimensional (mass and time) distribution fitting for modification detection. In this example, the highest peak corresponds to a real modification (acetaldehyde), and the other three are random signals.
Fig. 4.
Fig. 4.
Scatter-histogram of observed Δ data points around the nominal mass value of 38 Da for the ISB standard protein mix data set. The dense data cluster in the small square, which was automatically located by the DeltAMT algorithm, was induced by the calcium adduct formation.
Fig. 5.
Fig. 5.
Modification mass shifts detected by the MS-Alignment algorithm and the corresponding numbers of identified spectra. Those mass shifts with more than three identified spectra are annotated with the nominal mass shift values and modification names.

References

    1. Aebersold R., Mann M. (2003) Mass spectrometry-based proteomics. Nature 422, 198–207 - PubMed
    1. Mann M., Jensen O. N. (2003) Proteomic analysis of post-translational modifications. Nat. Biotechnol. 21, 255–261 - PubMed
    1. Witze E. S., Old W. M., Resing K. A., Ahn N. G. (2007) Mapping protein post-translational modifications with mass spectrometry. Nat. Methods 4, 798–806 - PubMed
    1. Eng J. K., McCormack A. L., Yates J. R., III (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass. Spectrom. 5, 976–989 - PubMed
    1. Perkins D. N., Pappin D. J., Creasy D. M., Cottrell J. S. (1999) Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551–3567 - PubMed

Publication types

LinkOut - more resources