A learned embedding for efficient joint analysis of millions of mass spectra
- PMID: 35637305
- PMCID: PMC9189069
- DOI: 10.1038/s41592-022-01496-1
A learned embedding for efficient joint analysis of millions of mass spectra
Abstract
Computational methods that aim to exploit publicly available mass spectrometry repositories rely primarily on unsupervised clustering of spectra. Here we trained a deep neural network in a supervised fashion on the basis of previous assignments of peptides to spectra. The network, called 'GLEAMS', learns to embed spectra in a low-dimensional space in which spectra generated by the same peptide are close to one another. We applied GLEAMS for large-scale spectrum clustering, detecting groups of unidentified, proximal spectra representing the same peptide. We used these clusters to explore the dark proteome of repeatedly observed yet consistently unidentified mass spectra.
© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.
Figures
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
