Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Dec;7(12):2386-98.
doi: 10.1074/mcp.M800021-MCP200. Epub 2008 Jul 24.

In-depth analysis of tandem mass spectrometry data from disparate instrument types

Affiliations

In-depth analysis of tandem mass spectrometry data from disparate instrument types

Robert J Chalkley et al. Mol Cell Proteomics. 2008 Dec.

Abstract

Mass spectrometric analyses of protein digests produce large numbers of fragmentation spectra that are not identified by routine database searching strategies. Some of these spectra could be identified by development of improved search engines. However, many of these spectra represent fragmentation of peptide components bearing modifications that are not routinely considered in database searches. Here we present new software within Protein Prospector that allows comprehensive analysis of data sets by analyzing the data at increasing levels of depth. Analysis of published data sets is presented to illustrate that the software is not biased to any instrument types. The results show that these data sets contain many modified peptides. As well as searching for known modification types, Protein Prospector permits the detection and identification of unexpected or novel modifications by searching for any mass shift within a user-specified mass range to any chosen amino acid(s). Several modifications never previously reported in proteomics data were identified in these standard data sets using this mass modification searching approach.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.
Fig. 1.
Receiver-operating characteristic curves for Protein Prospector (PP) searches of the different instrument data. The predicted number of incorrect answers is derived by doubling the number of matches to the decoy part of the database in the concatenated database search. The data points for the Sequest results are derived from the average number of peptides reported in the publication for each instrument with the number of predicted incorrect results indicated assuming a 2.5% FDR as reported in the publication.
F<sc>ig</sc>. 2.
Fig. 2.
Venn diagram showing the levels of overlap between Protein Prospector (PP) and Inspect results, both without any confidence level threshold and while applying thresholds of E-value < 0. 1 and p < 0.1.
F<sc>ig</sc>. 3.
Fig. 3.
Duplication of peak lists for different charge states causes false positive identifications of the wrong charge state peak list to homologous peptides to the correct answer. Both of the matches presented are derived from the same mass spectrum. The match in a is assuming a 2+ precursor charge state; the match in b is assuming a 3+ precursor charge state.
F<sc>ig</sc>. 4.
Fig. 4.
Different charge state assignments to the same precursor can also lead to false positive matches in searches with defined variable mass modifications. a and b show the matches to 2+ and 3+ precursor charge state peak lists of the spectrum that was acquired at 34.1304 min.
F<sc>ig</sc>. 5.
Fig. 5.
Histogram of mass modifications reported by Protein Prospector when analyzing the QSTAR standard data set.
F<sc>ig</sc>. 6.
Fig. 6.
Identification of the peak list 1975. A precursor of m/z 854.931 was fragmented, and Protein Prospector reported a confident assignment to the peptide VPTPNVSVVDLTCR with a modification of +209 Da on the cysteine residue (a). Inspect reported a match to a homologous peptide, VPTPDVSVVDTVKLAK, with a modification of a loss of 71 Da from the C-terminal lysine (b). Plots in the top right corners of each panel show mass errors for fragment ion assignments. The intensity of the precursor ion in these figures has been artificially reduced to allow easier visualization of the fragment ions.
F<sc>ig</sc>. 7.
Fig. 7.
Identification of the peak list 2465 to the peptide VLDALDSIK with a mass modification of +26 Da on the peptide N terminus. This modification corresponds to an acetaldehyde Schiff base modification to the N-terminal amine group.
F<sc>ig</sc>. 8.
Fig. 8.
Identification of peak list 1480 as VVDLMVHMASK with a modification of 41 Da somewhere on the peptide that is subsequently lost as a neutral group and therefore is not present on any of the fragment ions. The plot in the top right corner shows the mass errors for fragment ion assignments.

References

    1. Deutsch, E. W., Lam, H., and Aebersold, R. ( 2008) Data analysis and bioinformatics tools for tandem mass spectrometry in proteomics. Physiol. Genomics 33, 18–25 - PubMed
    1. Perkins, D. N., Pappin, D. J., Creasy, D. M., and Cottrell, J. S. ( 1999) Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551–3567 - PubMed
    1. Craig, R., and Beavis, R. C. ( 2004) TANDEM: matching proteins with tandem mass spectra. Bioinformatics 20, 1466–1467 - PubMed
    1. Klimek, J., Eddes, J. S., Hohmann, L., Jackson, J., Peterson, A., Letarte, S., Gafken, P. R., Katz, J. E., Mallick, P., Lee, H., Schmidt, A., Ossola, R., Eng, J. K., Aebersold, R., and Martin, D. B. ( 2008) The standard protein mix database: a diverse data set to assist in the production of improved peptide and protein identification software tools. J. Proteome Res. 7, 96–103 - PMC - PubMed
    1. Elias, J. E., and Gygi, S. P. ( 2007) Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods 4, 207–214 - PubMed

Publication types