A universal language for finding mass spectrometry data patterns
- PMID: 40355727
- PMCID: PMC12334354
- DOI: 10.1038/s41592-025-02660-z
A universal language for finding mass spectrometry data patterns
Erratum in
-
Author Correction: A universal language for finding mass spectrometry data patterns.Nat Methods. 2025 Sep;22(9):1995. doi: 10.1038/s41592-025-02785-1. Nat Methods. 2025. PMID: 40781363 Free PMC article. No abstract available.
Abstract
Despite being information rich, the vast majority of untargeted mass spectrometry data are underutilized; most analytes are not used for downstream interpretation or reanalysis after publication. The inability to dive into these rich raw mass spectrometry datasets is due to the limited flexibility and scalability of existing software tools. Here we introduce a new language, the Mass Spectrometry Query Language (MassQL), and an accompanying software ecosystem that addresses these issues by enabling the community to directly query mass spectrometry data with an expressive set of user-defined mass spectrometry patterns. Illustrated by real-world examples, MassQL provides a data-driven definition of chemical diversity by enabling the reanalysis of all public untargeted metabolomics data, empowering scientists across many disciplines to make new discoveries. MassQL has been widely implemented in multiple open-source and commercial mass spectrometry analysis tools, which enhances the ability, interoperability and reproducibility of mining of mass spectrometry data for the research community.
© 2025. The Author(s), under exclusive licence to Springer Nature America, Inc.
Conflict of interest statement
Competing interests: P.C.D. is an advisor to Cybele and a co-founder and scientific advisor to Ometa and Enveda with prior approval by UC San Diego. M.W. is a co-founder of Ometa Labs LLC. R.S., S.H. and T.P. are co-founders of mzio GmbH. T.R.N. is an advisor of Brightseed Bio. J.J.J.v.d.H. is a member of the Scientific Advisory Board of NAICONS Srl., Milano, Italy, and is consulting for Corteva Agriscience, Indianapolis, IN, USA. O.K. and T.S. are officers in OpenMS Inc., a non-profit foundation that manages the international coordination of OpenMS development. The other authors declare no competing interests.
Figures
References
-
- Stein, S. E. & Scott, D. R. Optimization and testing of mass spectral library search algorithms for compound identification. J. Am. Soc. Mass. Spectrom.5, 859–866 (1994). - PubMed
-
- Baars, O., Morel, F. M. M. & Perlman, D. H. ChelomEx: isotope-assisted discovery of metal chelates in complex media using high-resolution LC–MS. Anal. Chem.86, 11298–11305 (2014). - PubMed
-
- Huber, F. et al. matchms—processing and similarity evaluation of mass spectrometry data. J. Open Source Softw.5, 2411 (2020).
-
- Matsuda, F. Regular expressions of MS/MS spectra for partial annotation of metabolite features. Metabolomics12, 113 (2016).
MeSH terms
Grants and funding
- R35 GM155026/GM/NIGMS NIH HHS/United States
- R01 GM107550/GM/NIGMS NIH HHS/United States
- R01 GM155383/GM/NIGMS NIH HHS/United States
- U24 DK141185/DK/NIDDK NIH HHS/United States
- R35 GM128690/GM/NIGMS NIH HHS/United States
- R21 AI156669/AI/NIAID NIH HHS/United States
- R35 GM146934/GM/NIGMS NIH HHS/United States
- R01 GM125943/GM/NIGMS NIH HHS/United States
- R03 OD034493/OD/NIH HHS/United States
- R15 AI137996/AI/NIAID NIH HHS/United States
- U2C DK119886/DK/NIDDK NIH HHS/United States
- U24 DK133658/DK/NIDDK NIH HHS/United States
