Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Jun 23;12(7):584.
doi: 10.3390/metabo12070584.

A Current Encyclopedia of Bioinformatics Tools, Data Formats and Resources for Mass Spectrometry Lipidomics

Affiliations
Review

A Current Encyclopedia of Bioinformatics Tools, Data Formats and Resources for Mass Spectrometry Lipidomics

Nils Hoffmann et al. Metabolites. .

Abstract

Mass spectrometry is a widely used technology to identify and quantify biomolecules such as lipids, metabolites and proteins necessary for biomedical research. In this study, we catalogued freely available software tools, libraries, databases, repositories and resources that support lipidomics data analysis and determined the scope of currently used analytical technologies. Because of the tremendous importance of data interoperability, we assessed the support of standardized data formats in mass spectrometric (MS)-based lipidomics workflows. We included tools in our comparison that support targeted as well as untargeted analysis using direct infusion/shotgun (DI-MS), liquid chromatography-mass spectrometry, ion mobility or MS imaging approaches on MS1 and potentially higher MS levels. As a result, we determined that the Human Proteome Organization-Proteomics Standards Initiative standard data formats, mzML and mzTab-M, are already supported by a substantial number of recent software tools. We further discuss how mzTab-M can serve as a bridge between data acquisition and lipid bioinformatics tools for interpretation, capturing their output and transmitting rich annotated data for downstream processing. However, we identified several challenges of currently available tools and standards. Potential areas for improvement were: adaptation of common nomenclature and standardized reporting to enable high throughput lipidomics and improve its data handling. Finally, we suggest specific areas where tools and repositories need to improve to become FAIRer.

Keywords: FAIR; bioinformatics; data format; database; lipidomics; mass spectrometry; standardization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Figures

Figure 1
Figure 1
Data flow for different lipidomics software tools. From top to bottom, the supported ‘data format paths’ of selected lipidomics software tools that support at least one PSI standard data format are highlighted with a solid white outline. Those with planned or upcoming support are highlighted with a dashed white outline. The various software tools are represented by blue rectangles with the tool name in white. Data formats are represented by gray rectangles. On the right, the theoretically possible data flow between the PSI standard data formats, mzML and mzTab-M, is depicted (orange background) via supporting software and repositories/databases (white rectangle outline). In this figure, ‘Vendor Format’ stands for various proprietary, vendor-specific raw data formats (i.e., Thermo Fisher .raw, Agilent .d, ABSciex .wiff and Waters .raw), which can be converted to mzML (among other formats) using msConvert. (1) ‘csv/xlsx’ represents non-standardized output formats such as Microsoft XLSX, comma/tab separated text file formats and HTML. (2) ‘mgf/msp/ms2‘ are text-based formats that encode mass spectral data but generally do not have a strongly defined metadata schema. (3) ‘mzXML/mzData’ represent the legacy raw data and peak list formats mzXML and mzData. (4) ISA-Tab (used by MetaboLights) and mwTab (used by Metabolomics Workbench) are text-based, tabular data formats based on a defined metadata model, which simplifies validation and tooling. (5) Only some tools support quantification. See Table 1 for details.

References

    1. Züllig T., Trötzmüller M., Köfeler H.C. Lipidomics from Sample Preparation to Data Analysis: A Primer. Anal. Bioanal. Chem. 2020;412:2191–2209. doi: 10.1007/s00216-019-02241-y. - DOI - PMC - PubMed
    1. Paglia G., Smith A.J., Astarita G. Ion Mobility Mass Spectrometry in the Omics Era: Challenges and Opportunities for Metabolomics and Lipidomics. Mass Spectrom. Rev. 2021 doi: 10.1002/mas.21686. - DOI - PubMed
    1. But Is the Code (Re)Usable? Nat. Comput. Sci. 2021;1:449. doi: 10.1038/s43588-021-00109-9. - DOI - PubMed
    1. Rampler E., Abiead Y.E., Schoeny H., Rusz M., Hildebrand F., Fitz V., Koellensperger G. Recurrent Topics in Mass Spectrometry-Based Metabolomics and Lipidomics—Standardization, Coverage, and Throughput. Anal. Chem. 2021;93:519–545. doi: 10.1021/acs.analchem.0c04698. - DOI - PMC - PubMed
    1. Köfeler H.C., Ahrends R., Baker E.S., Ekroos K., Han X., Hoffmann N., Holčapek M., Wenk M.R., Liebisch G. Recommendations for Good Practice in MS-Based Lipidomics. J. Lipid Res. 2021;62:100138. doi: 10.1016/j.jlr.2021.100138. - DOI - PMC - PubMed