Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2011 Nov;10(11):R111.009522.
doi: 10.1074/mcp.R111.009522. Epub 2011 Aug 29.

A face in the crowd: recognizing peptides through database search

Affiliations
Review

A face in the crowd: recognizing peptides through database search

Jimmy K Eng et al. Mol Cell Proteomics. 2011 Nov.

Abstract

Peptide identification via tandem mass spectrometry sequence database searching is a key method in the array of tools available to the proteomics researcher. The ability to rapidly and sensitively acquire tandem mass spectrometry data and perform peptide and protein identifications has become a commonly used proteomics analysis technique because of advances in both instrumentation and software. Although many different tandem mass spectrometry database search tools are currently available from both academic and commercial sources, these algorithms share similar core elements while maintaining distinctive features. This review revisits the mechanism of sequence database searching and discusses how various parameter settings impact the underlying search.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
A snapshot of MS/MS data acquisition. A schematic showing the relationship between MS scans (blue) and MS/MS scans (red) in a typical tandem MS experiment. In this example, a precursor ion in each MS scan is isolated and fragmented, resulting in the MS/MS spectra that alternate every scan with the MS spectra. Typical data acquisition would acquire multiple MS/MS scans for each MS scan. In addition to the fragmentation pattern from each MS/MS spectrum used in a database search, the precursor ion's m/z, charge state, and mass accuracy of the measured precursor m/z are obtained from the MS spectrum.
Fig. 2.
Fig. 2.
Data acquisition and database search. For a given experimental MS/MS spectrum, protein sequences from a database are in silico digested and peptides of the right mass are selected. Theoretical fragment ions from each candidate peptide are calculated and used to generate a similarity or probability score by comparing the theoretical fragment ion masses against the experimental spectrum. Each candidate peptide is scored against the experimental spectrum and the best matching peptides and their scores are reported.

Similar articles

Cited by

References

    1. Mann M., Wilm M. (1994) Error-tolerant identification of peptides in sequence databases by peptide sequence tags. Anal. Chem. 66, 4390–4399 - PubMed
    1. Tanner S., Shu H., Frank A., Wang L. C., Zandi E., Mumby M., Pevzner P. A., Bafna V. (2005) InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. Anal. Chem. 77, 4626–4639 - PubMed
    1. Liu C., Yan B., Song Y., Xu Y., Cai L. (2006) Peptide sequence tag-based blind identification of post-translational modifications with point process model. Bioinformatics 22, e307–313 - PubMed
    1. Tabb D. L., Ma Z. Q., Martin D. B., Ham A. J., Chambers M. C. (2008) DirecTag: accurate sequence tags from peptide MS/MS through statistical scoring. J. Proteome Res. 7, 3838–3846 - PMC - PubMed
    1. Eng J. K., McCormack A. L., Yates J. R., 3rd (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectr. 5, 976–989 - PubMed

Publication types

LinkOut - more resources