Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 30;16(10):104.
doi: 10.1007/s11306-020-01726-7.

MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation

Affiliations

MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation

Ziling Fan et al. Metabolomics. .

Abstract

Introduction: Metabolite annotation is a critical and challenging step in mass spectrometry-based metabolomic profiling. In a typical untargeted MS/MS-based metabolomic study, experimental MS/MS spectra are matched against those in spectral libraries for metabolite annotation. Yet, existing spectral libraries comprise merely a marginal percentage of known compounds.

Objective: The objective is to develop a method that helps rank putative metabolite IDs for analytes whose reference MS/MS spectra are not present in spectral libraries.

Methods: We introduce MetFID, which uses an artificial neural network (ANN) trained for predicting molecular fingerprints based on experimental MS/MS data. To narrow the search space, MetFID retrieves candidates from metabolite databases using molecular formula or m/z value of the precursor ions of the analytes. The candidate whose fingerprint is most analogous to the predicted fingerprint is used for metabolite annotation. A comprehensive evaluation was performed by training MetFID using MS/MS spectra from the MoNA repository and NIST library and by testing with structure-disjoint MS/MS spectra from the NIST library, the CASMI 2016 dataset, and in-house MS/MS data from a cancer biomarker discovery study.

Results: We observed that training separate models for distinct ranges of collision energies enhanced model performance compared to a single model that covers a wide range of collision energies. Using MetaboQuest to retrieve candidates, MetFID prioritized the correct putative ID in the first place rank for about 50% of the testing cases. Through the independent testing dataset, we demonstrated that MetFID has the potential to improve the accuracy of ranking putative metabolite IDs by more than 5% compared to other tools such as ChemDistiller, CSI:FingerID, and MetFrag.

Conclusion: MetFID offers a promising opportunity to enhance the accuracy of metabolite annotation by using ANN for molecular fingerprint prediction.

Keywords: Artificial neural network; Metabolite identification; Metabolomics; Molecular fingerprint.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest The authors declare no conflict of interest.

Figures

Fig. 1
Fig. 1
Workflow of MetFID
Fig. 2
Fig. 2
Steps involved in processing spectra prior to training an ANN
Fig. 3
Fig. 3
Architecture of the ANN
Fig. 4
Fig. 4
Examples of MS/MS for Cycloheptylamine acquired by different machine types and collision energy. HCD higher-energy C-trap dissociation, QqQ triple quadrupole
Fig. 5
Fig. 5
Examples of clusters of MS/MS spectra for two compounds. Each dot represents an MS/MS spectrum and its collision energy is labeled (filled circle = ESI QqQ; filled down triangel = HCD; filled diamond = Ion Trap)

Similar articles

Cited by

References

    1. Brouard C, Shen H, Duhrkop K, d’Alche-Buc F, Bocker S, & Rousu J (2016). Fast metabolite identification with Input Output Kernel Regression. Bioinformatics, 32, i28–i36. - PMC - PubMed
    1. Duhrkop K, Fleischauer M, Ludwig M, Aksenov AA, Melnik AV, Meusel M, et al. (2019). SIRIUS 4: A rapid tool for turning tandem mass spectra into metabolite structure information. Nature Methods, 16, 299–302. - PubMed
    1. Duhrkop K, Shen H, Meusel M, Rousu J, & Bocker S (2015). Searching molecular structure databases with tandem mass spectra using CSI:FingerID. Proceedings of the National Academy of Sciences of the United States of America, 112, 12580–12585. - PMC - PubMed
    1. Fan Z, Ghaffari K, Alley A, & Ressom HW (2019). Metabolite identification using artificial neural network. In Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), San Diego, CA, November 18–21, 2019 (pp. 244–248).
    1. Ferrarini A, Di Poto C, He S, Tu C, Varghese RS, Kara Balla A, et al. (2019). Metabolomic analysis of liver tissues for characterization of hepatocellular Carcinoma. Journal of Proteome Research, 18, 3067–3076. - PMC - PubMed

Publication types

LinkOut - more resources