Protein identification by tandem mass spectrometry and sequence database searching
- PMID: 17185772
- DOI: 10.1385/1-59745-275-0:87
Protein identification by tandem mass spectrometry and sequence database searching
Abstract
The shotgun proteomics strategy, based on digesting proteins into peptides and sequencing them using tandem mass spectrometry (MS/MS), has become widely adopted. The identification of peptides from acquired MS/MS spectra is most often performed using the database search approach. We provide a detailed description of the peptide identification process and review the most commonly used database search programs. The appropriate choice of the search parameters and the sequence database are important for successful application of this method, and we provide general guidelines for carrying out efficient analysis of MS/MS data. We also discuss various reasons why database search tools fail to assign the correct sequence to many MS/MS spectra, and draw attention to the problem of false-positive identifications that can significantly diminish the value of published data. To assist in the evaluation of peptide assignments to MS/MS spectra, we review the scoring schemes implemented in most frequently used database search tools. We also describe statistical approaches and computational tools for validating peptide assignments to MS/MS spectra, including the concept of expectation values, reversed database searching, and the empirical Bayesian analysis of PeptideProphet. Finally, the process of inferring the identities of the sample proteins given the list of peptide identifications is outlined, and the limitations of shotgun proteomics with regard to discrimination between protein isoforms are discussed.
Similar articles
-
Computational approaches to peptide identification via tandem MS.Methods Mol Biol. 2010;604:23-42. doi: 10.1007/978-1-60761-444-9_3. Methods Mol Biol. 2010. PMID: 20013362
-
Algorithms for the de novo sequencing of peptides from tandem mass spectra.Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54. Expert Rev Proteomics. 2011. PMID: 21999834 Review.
-
An algorithm for identifying multiply modified endogenous proteins using both full-scan and high-resolution tandem mass spectrometric data.Rapid Commun Mass Spectrom. 2011 Dec 15;25(23):3617-26. doi: 10.1002/rcm.5257. Rapid Commun Mass Spectrom. 2011. PMID: 22095511
-
Improving sensitivity in shotgun proteomics using a peptide-centric database with reduced complexity: protease cleavage and SCX elution rules from data mining of MS/MS spectra.Anal Chem. 2006 Feb 15;78(4):1071-84. doi: 10.1021/ac051127f. Anal Chem. 2006. PMID: 16478097
-
Elective affinities--bioinformatic analysis of proteomic mass spectrometry data.Arch Physiol Biochem. 2009 Dec;115(5):311-9. doi: 10.3109/13813450903390039. Arch Physiol Biochem. 2009. PMID: 19911947 Review.
Cited by
-
Differential (14)N/(15)N-Labeling of Peptides Using N-Terminal Charge Derivatization with a High-Proton Affinity for Straightforward de novo Peptide Sequencing.Mass Spectrom (Tokyo). 2013;2(1):A0024. doi: 10.5702/massspectrometry.A0024. Epub 2013 Nov 23. Mass Spectrom (Tokyo). 2013. PMID: 24860714 Free PMC article.
-
Optimizing metaproteomics database construction: lessons from a study of the vaginal microbiome.mSystems. 2023 Aug 31;8(4):e0067822. doi: 10.1128/msystems.00678-22. Epub 2023 Jun 23. mSystems. 2023. PMID: 37350639 Free PMC article.
-
Isolation and biochemical characterization of amyloid plaques and paired helical filaments.Curr Protoc Cell Biol. 2009 Sep;Chapter 3:Unit 3.33 3.33.1-33. doi: 10.1002/0471143030.cb0333s44. Curr Protoc Cell Biol. 2009. PMID: 19731227 Free PMC article.
-
Proteomic analysis of excretory-secretory products of Heligmosomoides polygyrus assessed with next-generation sequencing transcriptomic information.PLoS Negl Trop Dis. 2011 Oct;5(10):e1370. doi: 10.1371/journal.pntd.0001370. Epub 2011 Oct 25. PLoS Negl Trop Dis. 2011. PMID: 22039562 Free PMC article.
-
Mapping Biological Networks from Quantitative Data-Independent Acquisition Mass Spectrometry: Data to Knowledge Pipelines.Methods Mol Biol. 2017;1558:395-413. doi: 10.1007/978-1-4939-6783-4_19. Methods Mol Biol. 2017. PMID: 28150249 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources