Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jan 20;106(3):761-6.
doi: 10.1073/pnas.0811739106. Epub 2009 Jan 9.

PTMap--a sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites

Affiliations

PTMap--a sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites

Yue Chen et al. Proc Natl Acad Sci U S A. .

Abstract

We present sequence alignment software, called PTMap, for the accurate identification of full-spectrum protein post-translational modifications (PTMs) and polymorphisms. The software incorporates several features to improve searching speed and accuracy, including peak selection, adjustment of inaccurate mass shifts, and precise localization of PTM sites. PTMap also automates rules, based mainly on unmatched peaks, for manual verification of identified peptides. To evaluate the quality of sequence alignment, we developed a scoring system that takes into account both matched and unmatched peaks in the mass spectrum. Incorporation of these features dramatically increased both accuracy and sensitivity of the peptide- and PTM-identifications. To our knowledge, PTMap is the first algorithm that emphasizes unmatched peaks to eliminate false positives. The superior performance and reliability of PTMap were demonstrated by confident identification of PTMs on 156 peptides from four proteins and validated by MS/MS of the synthetic peptides. Our results demonstrate that PTMap is a powerful algorithm capable of identification of all possible protein PTMs with high confidence.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Flow chart of the PTMap algorithm.
Fig. 2.
Fig. 2.
Evaluation of the PTMap score distribution. (A) PTMap score distributions (SUnmatched = 4.0:10.0 in high and low mass ranges respectively) of unmodified peptides identified from normal and scrambled protein sequences of the four selected proteins—histone H4, SGK1, HMG2 and BSA—using PTMap (155 peptides generated from normal database and 15 peptides generated from scrambled database); (B) PTMap score distributions (SUnmatched = 4.0:10.0) of both unmodified and modified peptides identified from normal and scrambled of the same four protein sequences using PTMap (1,478 peptides generated from normal database and 917 peptides generated from scrambled database); (C) Correlation of Mascot scores with PTMap scores (SUnmatched = 4.0:10.0) for identification of unmodified peptides. The highest Mascot score and PTMap score of each peptide were used. Those peptides identified with Mascot only (PTMap score = 0) were found to be false positives by manual verification methods (21).
Fig. 3.
Fig. 3.
Evaluation of the two PTMap strategies: peak selection and automatic mass shift adjustment. (A) The number of unmodified peptides (PTMap score cutoff = 0.5) identified with or without incorporation of the peak-selection function in PTMap; (B) The distribution of the mass changes (ΔMafter − ΔMbefore) made by PTMap after automatic mass-shift adjustment for all identified spectra bearing PTMs (PTMap score cutoff = 1.0); (C) The number of modified peptides identified with or without automatic mass-shift adjustment strategy in PTMap (PTMap score cutoff = 1.0);(D) scatter plot showing the distribution of the mass errors of the spectra that identified unmodified peptides in the four proteins (PTMap score cutoff = 0.5)
Fig. 4.
Fig. 4.
Evaluation of the PTMap strategy for exclusive site localization. (A) A strategy for precise mapping of a PTM site that will distinguish two peptide isoforms that are modified on adjacent sites; (B) The number of modified peptides identified by PTMap before and after the implementation of exclusive site localization.

References

    1. Witze ES, Old WM, Resing KA, Ahn NG. Mapping protein post-translational modifications with mass spectrometry. Nat Methods. 2007;4:798–806. - PubMed
    1. Jensen ON. Modification-specific proteomics: Characterization of post-translational modifications by mass spectrometry. Curr Opin Chem Biol. 2004;8:33–41. - PubMed
    1. Sadygov RG, Cociorva D, Yates JR., 3rd Large-scale database searching using tandem mass spectra: Looking up the answer in the back of the book. Nat Methods. 2004;1:195–202. - PubMed
    1. Eng JK, McCormack AL, Yates JR., 3rd An Approach to Correlate Tandem Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database. J Am Soc Mass Spectrom. 1994;5:976–989. - PubMed
    1. Yates JR, 3rd, Eng JK, McCormack AL. Mining genomes: Correlating tandem mass spectra of modified and unmodified peptides to sequences in nucleotide databases. Anal Chem. 1995;67:3202–3210. - PubMed

Publication types