Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 5;97(30):16110-16122.
doi: 10.1021/acs.analchem.4c05577. Epub 2025 Jul 22.

Multilaboratory Untargeted Mass Spectrometry Metabolomics Collaboration to Identify Bottlenecks and Comprehensively Annotate A Single Dataset

Affiliations

Multilaboratory Untargeted Mass Spectrometry Metabolomics Collaboration to Identify Bottlenecks and Comprehensively Annotate A Single Dataset

Joelle Houriet et al. Anal Chem. .

Abstract

Annotation is the process of assigning features in mass spectrometry metabolomics data sets to putative chemical structures or "analytes." The purpose of this study was to identify challenges in the annotation of untargeted mass spectrometry metabolomics datasets and suggest strategies to overcome them. Toward this goal, we analyzed an extract of the plant ashwagandha (Withania somnifera) using liquid chromatography-mass spectrometry on two different platforms (an Orbitrap and Q-ToF) with various acquisition modes. The resulting 12 datasets were shared with ten teams that had established expertise in metabolomics data interpretation. Each team annotated at least one positive ion dataset using their own approaches. Eight teams selected the positive ion mode data-dependent acquisition (DDA) data collected on the Orbitrap platform, so the results reported for that dataset were chosen for an in-depth comparison. We compiled and cross-checked the annotations of this dataset from each laboratory to arrive at a "consensus annotation," which included 142 putative analytes, of which 13 were confirmed by comparison with standards. Each team only reported a subset (24 to 57%) of the analytes in the consensus list. Correct assignment of ion species (clusters and fragments) in MS spectra was a major bottleneck. In many cases, in-source redundant features were mistakenly considered to be independent analytes, causing annotation errors and resulting in overestimation of sample complexity. Our results suggest that better tools/approaches are needed to effectively assign feature identity, group related mass features, and query published spectral and taxonomic data when assigning putative analyte structures.

PubMed Disclaimer

Figures

1
1
Annotation Agreement Scores assess the feature-level agreement between the 8 teams for the Orbitrap data-dependent analysis (DDA) positive ionization dataset. Assignment Match (j) is defined as the number of teams that agreed on the assignment for a given Element. Four Elements of the annotations were considered: the “feature reported” element (defined by m/z and retention time), the “ion species description” element (ion species, [M + H]+, [M + Na]+, etc.), the “chemical class” element (defined by the NP classifier), and the “identity” element (two-dimensional identity). The Annotation Agreement Score is calculated from eq and described in more detail in SI-01 Figure S18. As an example, for Assignment Match 2 and the “Chemical class” element, an Annotation Agreement Score of 58 indicates that 58 features were assigned to the same chemical class by two teams.
2
2
Histogram of the ion species description proposed by the 8 teams for the features reported in the Merged Annotation Table compared to the assignments in the final Consensus Annotation Table. The results indicate that teams were biased toward [M + H]+ ions in their original assignments (black bars) and that reassignment (blue bars) caused a shift toward other ion species (see SI-01 Figures S25–S26).
3
3
Consensus Annotation Score analysis (SI-01 Figure S21) assesses the overlap between the lists provided by the 8 teams and the Consensus Annotation Table. A total of 142 analytes (unique chemical entities) were annotated in the Consensus Annotation Table after grouping all of the features. “Analyte” refers to the number of analytes reported by a given team that overlaps with those included in the Consensus Annotation Table, while “Ion species,” “Chemical class,” and “Identity” refer to the number of analytes with these elements that match the elements in the Consensus Annotation Table. For example, in Panel A, Team 1 reported 47 (33%) of the 142 analytes included in the Consensus Annotation Table, and 32, 41, and 23 of these analytes, respectively, were reported to have the same ion species, chemical class, and identity assigned in the Consensus Annotation Table. Panel A shows the analyses for all analytes, and panel B for the 13 confirmed by comparison with standards.
4
4
LC-MS data and annotation of withanone (confidence level 1A). (A) Structure of withanone, (B) MS spectrum of the pure standard, (C) Ion species description illustrated with a bar plot that represents the peak area of the features associated with one analyte as a function of mass-to-charge ratio (m/z), (D) MS spectrum of withanone in the Withania somnifera extract, and (E) annotation of its features and their detection by the 8 teams. [M+X]+ means the ion species, and ID means its identity (“x” false annotation, “√” correct annotation, “c” correct chemical class but false annotation; blue fill color means correspondence with the consensus annotation, red fill color means no correspondence). The features in red font were not reported by any teams but were detected in our manual interpretation and by comparison with standards, and the ones in blue font did not have any MS/MS spectra. See SI-01 Figure S35 for the fragmentation spectra.

Similar articles

References

    1. Pezzatti J., Boccard J., Codesido S., Gagnebin Y., Joshi A., Picard D., Gonzalez-Ruiz V., Rudaz S.. Implementation of liquid chromatography-high resolution mass spectrometry methods for untargeted metabolomic analyses of biological samples: A tutorial. Anal. Chim. Acta. 2020;1105:28–44. doi: 10.1016/j.aca.2019.12.062. - DOI - PubMed
    1. Wolfender J.-L., Nuzillard J.-M., van der Hooft J. J. J., Renault J.-H., Bertrand S.. Accelerating Metabolite Identification in Natural Product Research: Toward an Ideal Combination of Liquid Chromatography–High-Resolution Tandem Mass Spectrometry and NMR Profiling, in Silico Databases, and Chemometrics. Anal. Chem. 2019;91(1):704–742. doi: 10.1021/acs.analchem.8b05112. - DOI - PubMed
    1. Theodoridis G., Gika H., Raftery D., Goodacre R., Plumb R. S., Wilson I. D.. Ensuring Fact-Based Metabolite Identification in Liquid Chromatography-Mass Spectrometry-Based Metabolomics. Anal. Chem. 2023;95:3909. doi: 10.1021/acs.analchem.2c05192. - DOI - PMC - PubMed
    1. Misra B. B.. New software tools, databases, and resources in metabolomics: updates from 2020. Metabolomics. 2021;17(5):49. doi: 10.1007/s11306-021-01796-1. - DOI - PMC - PubMed
    1. Sumner L. W., Amberg A., Barrett D., Beale M. H., Beger R., Daykin C. A., Fan T. W. M., Fiehn O., Goodacre R., Griffin J. L.. et al. Proposed minimum reporting standards for chemical analysis. Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI) Metabolomics. 2007;3(3):211–221. doi: 10.1007/s11306-007-0082-2. - DOI - PMC - PubMed

MeSH terms

LinkOut - more resources