Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 1;20(1):498-505.
doi: 10.1021/acs.jproteome.0c00544. Epub 2020 Dec 17.

Fast Deisotoping Algorithm and Its Implementation in the MSFragger Search Engine

Affiliations

Fast Deisotoping Algorithm and Its Implementation in the MSFragger Search Engine

Guo Ci Teo et al. J Proteome Res. .

Abstract

Deisotoping, or the process of removing peaks in a mass spectrum resulting from the incorporation of naturally occurring heavy isotopes, has long been used to reduce complexity and improve the effectiveness of spectral annotation methods in proteomics. We have previously described MSFragger, an ultrafast search engine for proteomics, that did not utilize deisotoping in processing input spectra. Here, we present a new, high-speed parallelized deisotoping algorithm, based on elements of several existing methods, that we have incorporated into the MSFragger search engine. Applying deisotoping with MSFragger reveals substantial improvements to database search speed and performance, particularly for complex methods like open or nonspecific searches. Finally, we evaluate our deisotoping method on data from several instrument types and vendors, revealing a wide range in performance and offering an updated perspective on deisotoping in the modern proteomics environment.

Keywords: MSFragger; deisotoping; nonspecific search; open search; preprocessing; proteomics; spectrum processing.

PubMed Disclaimer

Conflict of interest statement

Competing Interests Statement

The authors declare no competing financial interests.

Figures

Figure 1.
Figure 1.
Left: An overview of the algorithm. Right: Schematic representation of the deisotoping algorithm. The original raw spectrum is denoised by removing low intensity peaks. Then a list of possible peak clusters is generated based solely on the m/z distances. This list of clusters is then filtered down to a list of high confidence clusters by comparison with an averagine distribution and categorized by monoisotopic peak. High confidence clusters are then used to decharge fragments. Fragments with no associated clusters are decharged based on parameter set by the user.
Figure 2.
Figure 2.
Change in the number of PSMs obtained after 1% FDR filtering (blue) and search time (green) for various search methods with deisotoping relative to the same search without deisotoping. Blue bars show relative increase in PSMs obtained from closed, open, mass offset glycopeptide, and nonspecific neuropeptidome searches from left to right. Green bars show decrease in search time with deisotoping for the same searches.
Figure 3.
Figure 3.
Change in number of PSMs obtained after 1% FDR filtering and search time for open searches on data from various instruments with deisotoping relative to the same searches without deisotoping. Blue bars show relative increase in PSMs obtained from searches of Orbitrap Q Exactive (Chick et al.), Orbitrap Fusion Lumos (Espadas et al.), timsTOF (Meier et al.), and Sciex 6600+ (Ammar et al.) data from left to right. Green bars show decrease in search time required with deisotoping for the same searches.

Similar articles

Cited by

References

    1. Kong AT; Leprevost FV; Avtonomov DM; Mellacheruvu D; Nesvizhskii AI, MSFragger: Ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nature Methods 2017, 14 (5), 513–520. - PMC - PubMed
    1. Solntsev SK; Shortreed MR; Frey BL; Smith LM, Enhanced Global Post-translational Modification Discovery with MetaMorpheus. Journal of Proteome Research 2018, 17 (5), 1844–1851. - PubMed
    1. Chick JM; Kolippakkam D; Nusinow DP; Zhai B; Rad R; Huttlin EL; Gygi SP, A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides. Nature Biotechnology 2015, 33 (7), 743–749. - PMC - PubMed
    1. Chi H; Liu C; Yang H; Zeng WF; Wu L; Zhou WJ; Wang RM; Niu XN; Ding YH; Zhang Y; Wang ZW; Chen ZL; Sun RX; Liu T; Tan GM; Dong MQ; Xu P; Zhang PH; He SM, Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine. Nature Biotechnology 2018, 36 (11), 1059–1066. - PubMed
    1. Peng J; Zhang H; Niu H; Wu R. a., Peptidomic analyses: The progress in enrichment and identification of endogenous peptides. TrAC 2020, 125, 115835–115835.

Publication types