Proteomics-grade de novo sequencing approach
- PMID: 16335984
- DOI: 10.1021/pr050288x
Proteomics-grade de novo sequencing approach
Abstract
The conventional approach in modern proteomics to identify proteins from limited information provided by molecular and fragment masses of their enzymatic degradation products carries an inherent risk of both false positive and false negative identifications. For reliable identification of even known proteins, complete de novo sequencing of their peptides is desired. The main problems of conventional sequencing based on tandem mass spectrometry are incomplete backbone fragmentation and the frequent overlap of fragment masses. In this work, the first proteomics-grade de novo approach is presented, where the above problems are alleviated by the use of complementary fragmentation techniques CAD and ECD. Implementation of a high-current, large-area dispenser cathode as a source of low-energy electrons provided efficient ECD of doubly charged peptides, the most abundant species (65-80%), in a typical trypsin-based proteomics experiment. A new linear de novo algorithm is developed combining efficiency and speed, processing on a conventional 3 GHz PC, 1000 MS/MS data sets in 60 s. More than 6% of all MS/MS data for doubly charged peptides yielded complete sequences, and another 13% gave nearly complete sequences with a maximum gap of two amino acid residues. These figures are comparable with the typical success rates (5-15%) of database identification. For peptides reliably found in the database (Mowse score > or = 34), the agreement with de novo-derived full sequences was >95%. Full sequences were derived in 67% of the cases when full sequence information was present in MS/MS spectra. Thus the new de novo sequencing approach reached the same level of efficiency and reliability as conventional database-identification strategies.
Similar articles
-
De novo sequencing methods in proteomics.Methods Mol Biol. 2010;604:105-21. doi: 10.1007/978-1-60761-444-9_8. Methods Mol Biol. 2010. PMID: 20013367
-
High-throughput identification of proteins and unanticipated sequence modifications using a mass-based alignment algorithm for MS/MS de novo sequencing results.Anal Chem. 2004 Apr 15;76(8):2220-30. doi: 10.1021/ac035258x. Anal Chem. 2004. PMID: 15080731
-
pNovo: de novo peptide sequencing and identification using HCD spectra.J Proteome Res. 2010 May 7;9(5):2713-24. doi: 10.1021/pr100182k. J Proteome Res. 2010. PMID: 20329752
-
De novo sequencing of peptides by MS/MS.Proteomics. 2010 Feb;10(4):634-49. doi: 10.1002/pmic.200900459. Proteomics. 2010. PMID: 19953542 Review.
-
Algorithms for the de novo sequencing of peptides from tandem mass spectra.Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54. Expert Rev Proteomics. 2011. PMID: 21999834 Review.
Cited by
-
Lessons in de novo peptide sequencing by tandem mass spectrometry.Mass Spectrom Rev. 2015 Jan-Feb;34(1):43-63. doi: 10.1002/mas.21406. Mass Spectrom Rev. 2015. PMID: 25667941 Free PMC article. Review.
-
Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides.J Proteome Res. 2013 Jun 7;12(6):2846-57. doi: 10.1021/pr400173d. Epub 2013 May 30. J Proteome Res. 2013. PMID: 23679345 Free PMC article.
-
Proteotranscriptomic Analysis and Discovery of the Profile and Diversity of Toxin-like Proteins in Centipede.Mol Cell Proteomics. 2018 Apr;17(4):709-720. doi: 10.1074/mcp.RA117.000431. Epub 2018 Jan 16. Mol Cell Proteomics. 2018. PMID: 29339413 Free PMC article.
-
Analysis of tandem mass spectra by FTMS for improved large-scale proteomics with superior protein quantification.Anal Chem. 2010 Jan 1;82(1):316-22. doi: 10.1021/ac902005s. Anal Chem. 2010. PMID: 19938823 Free PMC article.
-
Improved Protein and PTM Characterization with a Practical Electron-Based Fragmentation on Q-TOF Instruments.J Am Soc Mass Spectrom. 2021 Aug 4;32(8):2081-2091. doi: 10.1021/jasms.0c00482. Epub 2021 Apr 29. J Am Soc Mass Spectrom. 2021. PMID: 33914527 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous