MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation
- PMID: 32997169
- PMCID: PMC9547616
- DOI: 10.1007/s11306-020-01726-7
MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation
Abstract
Introduction: Metabolite annotation is a critical and challenging step in mass spectrometry-based metabolomic profiling. In a typical untargeted MS/MS-based metabolomic study, experimental MS/MS spectra are matched against those in spectral libraries for metabolite annotation. Yet, existing spectral libraries comprise merely a marginal percentage of known compounds.
Objective: The objective is to develop a method that helps rank putative metabolite IDs for analytes whose reference MS/MS spectra are not present in spectral libraries.
Methods: We introduce MetFID, which uses an artificial neural network (ANN) trained for predicting molecular fingerprints based on experimental MS/MS data. To narrow the search space, MetFID retrieves candidates from metabolite databases using molecular formula or m/z value of the precursor ions of the analytes. The candidate whose fingerprint is most analogous to the predicted fingerprint is used for metabolite annotation. A comprehensive evaluation was performed by training MetFID using MS/MS spectra from the MoNA repository and NIST library and by testing with structure-disjoint MS/MS spectra from the NIST library, the CASMI 2016 dataset, and in-house MS/MS data from a cancer biomarker discovery study.
Results: We observed that training separate models for distinct ranges of collision energies enhanced model performance compared to a single model that covers a wide range of collision energies. Using MetaboQuest to retrieve candidates, MetFID prioritized the correct putative ID in the first place rank for about 50% of the testing cases. Through the independent testing dataset, we demonstrated that MetFID has the potential to improve the accuracy of ranking putative metabolite IDs by more than 5% compared to other tools such as ChemDistiller, CSI:FingerID, and MetFrag.
Conclusion: MetFID offers a promising opportunity to enhance the accuracy of metabolite annotation by using ANN for molecular fingerprint prediction.
Keywords: Artificial neural network; Metabolite identification; Metabolomics; Molecular fingerprint.
Conflict of interest statement
Figures





Similar articles
-
Deep Learning Based Metabolite Annotation.Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341007. Annu Int Conf IEEE Eng Med Biol Soc. 2023. PMID: 38082953 Free PMC article.
-
Convolutional Neural Network-Based Compound Fingerprint Prediction for Metabolite Annotation.Metabolites. 2022 Jun 29;12(7):605. doi: 10.3390/metabo12070605. Metabolites. 2022. PMID: 35888729 Free PMC article.
-
Deep Learning-Based Molecular Fingerprint Prediction for Metabolite Annotation.Metabolites. 2025 Feb 14;15(2):132. doi: 10.3390/metabo15020132. Metabolites. 2025. PMID: 39997757 Free PMC article.
-
De Novo Molecular Formula Annotation and Structure Elucidation Using SIRIUS 4.Methods Mol Biol. 2020;2104:185-207. doi: 10.1007/978-1-0716-0239-3_11. Methods Mol Biol. 2020. PMID: 31953819 Review.
-
Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics.Metabolites. 2018 May 10;8(2):31. doi: 10.3390/metabo8020031. Metabolites. 2018. PMID: 29748461 Free PMC article. Review.
Cited by
-
An approach of molecular-fingerprint prediction implementing a GAT.RSC Adv. 2025 Apr 22;15(16):12757-12764. doi: 10.1039/d5ra00973a. eCollection 2025 Apr 16. RSC Adv. 2025. PMID: 40264881 Free PMC article.
-
Deep Learning Based Metabolite Annotation.Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341007. Annu Int Conf IEEE Eng Med Biol Soc. 2023. PMID: 38082953 Free PMC article.
-
Natural Products Dereplication: Databases and Analytical Methods.Prog Chem Org Nat Prod. 2024;124:1-56. doi: 10.1007/978-3-031-59567-7_1. Prog Chem Org Nat Prod. 2024. PMID: 39101983 Review.
-
Reproducible MS/MS library cleaning pipeline in matchms.J Cheminform. 2024 Jul 29;16(1):88. doi: 10.1186/s13321-024-00878-1. J Cheminform. 2024. PMID: 39075613 Free PMC article.
-
A Comparative Study of Network-Based Machine Learning Approaches for Binary Classification in Metabolomics.Metabolites. 2025 Mar 3;15(3):174. doi: 10.3390/metabo15030174. Metabolites. 2025. PMID: 40137139 Free PMC article.
References
-
- Duhrkop K, Fleischauer M, Ludwig M, Aksenov AA, Melnik AV, Meusel M, et al. (2019). SIRIUS 4: A rapid tool for turning tandem mass spectra into metabolite structure information. Nature Methods, 16, 299–302. - PubMed
-
- Fan Z, Ghaffari K, Alley A, & Ressom HW (2019). Metabolite identification using artificial neural network. In Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), San Diego, CA, November 18–21, 2019 (pp. 244–248).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources