Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Mar 5;91(5):3302-3310.
doi: 10.1021/acs.analchem.8b04310. Epub 2019 Feb 13.

mzTab-M: A Data Standard for Sharing Quantitative Results in Mass Spectrometry Metabolomics

Affiliations

mzTab-M: A Data Standard for Sharing Quantitative Results in Mass Spectrometry Metabolomics

Nils Hoffmann et al. Anal Chem. .

Abstract

Mass spectrometry (MS) is one of the primary techniques used for large-scale analysis of small molecules in metabolomics studies. To date, there has been little data format standardization in this field, as different software packages export results in different formats represented in XML or plain text, making data sharing, database deposition, and reanalysis highly challenging. Working within the consortia of the Metabolomics Standards Initiative, Proteomics Standards Initiative, and the Metabolomics Society, we have created mzTab-M to act as a common output format from analytical approaches using MS on small molecules. The format has been developed over several years, with input from a wide range of stakeholders. mzTab-M is a simple tab-separated text format, but importantly, the structure is highly standardized through the design of a detailed specification document, tightly coupled to validation software, and a mandatory controlled vocabulary of terms to populate it. The format is able to represent final quantification values from analyses, as well as the evidence trail in terms of features measured directly from MS (e.g., LC-MS, GC-MS, DIMS, etc.) and different types of approaches used to identify molecules. mzTab-M allows for ambiguity in the identification of molecules to be communicated clearly to readers of the files (both people and software). There are several implementations of the format available, and we anticipate widespread adoption in the field.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Overall structure of an mzTab-M file. (A) Metadata about the experiment, describing experimental design (study variables and assays), links to other files, etc.. (B) The small molecule (SML) table, capturing “final” results table: i.e., overall calculated quantification value (and identity where known) of a metabolite. (C) Quantification value in each (aligned) MS run for MS1 features: e.g., mapped to individual adducts or charge states of a molecule. (D) Evidence supporting identification (with ambiguity if needed) for molecules, using CV terms for scores/statistics where available.
Figure 2
Figure 2
Simple experimental designs in mzTab-M can be represented using a combination of the elements study_variable (SV), assay, ms_run, and sample. Quantitative values can be reported in files for SVs and assays. (A) SV is intended to capture different groups of replicates, which might have resulted from different levels of a given variable: e.g. control versus treated (represented as 2 SVs) and n time points over a treatment course (as n SVs). (B) assay captures a measurement made about a molecule (small molecule/lipid) where multiple assays within the same SV are taken to be replicates of some kind (biological or technical). (C) ms_run captures a single run on an MS instrument. (D) Samples are optional in mzTab, since the quantitative software may often be unaware of the biological samples that have been analyzed. If that information is available, references from assay to the same (technical, upper half) or different (biological, bottom half) samples are used to describe the type of replication performed.
Figure 3
Figure 3
(A) The summary level (SML) reports the final assumed identifications, allowing for ambiguity by including “|” separated results in the relevant columns. (B) The feature level (SMF) does not explicitly report identifications but references down to the SME level. Ambiguity is propagated via referencing multiple SME rows with different identification results. (C) One SME row represents a single possible identification from some input evidence. Multiple identifications from the same input data share the same value for evidence_input_id. Ambiguity can be captured by different rows for the same input data.

References

    1. Wilkinson M. D.; Dumontier M.; Aalbersberg I. J.; Appleton G.; Axton M.; Baak A.; Blomberg N.; Boiten J.-W.; da Silva Santos L. B.; Bourne P. E.; Bouwman J.; Brookes A. J.; Clark T.; Crosas M.; Dillo I.; Dumon O.; Edmunds S.; Evelo C. T.; Finkers R.; Gonzalez-Beltran A.; Gray A. J. G.; Groth P.; Goble C.; Grethe J. S.; Heringa J.; ’t Hoen P. A. C.; Hooft R.; Kuhn T.; Kok R.; Kok J.; Lusher S. J.; Martone M. E.; Mons A.; Packer A. L.; Persson B.; Rocca-Serra P.; Roos M.; van Schaik R.; Sansone S.-A.; Schultes E.; Sengstag T.; Slater T.; Strawn G.; Swertz M. A.; Thompson M.; van der Lei J.; van Mulligen E.; Velterop J.; Waagmeester A.; Wittenburg P.; Wolstencroft K.; Zhao J.; Mons B. Sci. Data 2016, 3, 160018.10.1038/sdata.2016.18. - DOI - PMC - PubMed
    1. Holcapek M.; Liebisch G.; Ekroos K. Anal. Chem. 2018, 90, 4249–4257. 10.1021/acs.analchem.7b05395. - DOI - PubMed
    1. Blazenovic I.; Kind T.; Ji J.; Fiehn O. Metabolites 2018, 8, 31.10.3390/metabo8020031. - DOI - PMC - PubMed
    1. Bingol K.; Bruschweiler-Li L.; Li D.; Zhang B.; Xie M.; Bruschweiler R. Bioanalysis 2016, 8, 557–573. 10.4155/bio-2015-0004. - DOI - PMC - PubMed
    1. Guijas C.; Montenegro-Burke J. R.; Domingo-Almenara X.; Palermo A.; Warth B.; Hermann G.; Koellensperger G.; Huan T.; Uritboonthai W.; Aisporna A. E.; Wolan D. W.; Spilker M. E.; Benton H. P.; Siuzdak G. Anal. Chem. 2018, 90, 3156–3164. 10.1021/acs.analchem.7b04424. - DOI - PMC - PubMed

Publication types