Multi-Task Neural Networks and Molecular Fingerprints to Enhance Compound Identification from LC-MS/MS Data
- PMID: 36144564
- PMCID: PMC9502453
- DOI: 10.3390/molecules27185827
Multi-Task Neural Networks and Molecular Fingerprints to Enhance Compound Identification from LC-MS/MS Data
Abstract
Mass spectrometry (MS) is widely used for the identification of chemical compounds by matching the experimentally acquired mass spectrum against a database of reference spectra. However, this approach suffers from a limited coverage of the existing databases causing a failure in the identification of a compound not present in the database. Among the computational approaches for mining metabolite structures based on MS data, one option is to predict molecular fingerprints from the mass spectra by means of chemometric strategies and then use them to screen compound libraries. This can be carried out by calibrating multi-task artificial neural networks from large datasets of mass spectra, used as inputs, and molecular fingerprints as outputs. In this study, we prepared a large LC-MS/MS dataset from an on-line open repository. These data were used to train and evaluate deep-learning-based approaches to predict molecular fingerprints and retrieve the structure of unknown compounds from their LC-MS/MS spectra. Effects of data sparseness and the impact of different strategies of data curing and dimensionality reduction on the output accuracy have been evaluated. Moreover, extensive diagnostics have been carried out to evaluate modelling advantages and drawbacks as a function of the explored chemical space.
Keywords: LC-MS/MS; chemometrics; classification; fingerprints; multi-task; neural networks; similarity matching.
Conflict of interest statement
The authors declare no conflict of interest.
Figures







Similar articles
-
Deep Learning Based Metabolite Annotation.Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341007. Annu Int Conf IEEE Eng Med Biol Soc. 2023. PMID: 38082953 Free PMC article.
-
MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation.Metabolomics. 2020 Sep 30;16(10):104. doi: 10.1007/s11306-020-01726-7. Metabolomics. 2020. PMID: 32997169 Free PMC article.
-
A classification of liquid chromatography mass spectrometry techniques for evaluation of chemical composition and quality control of traditional medicines.J Chromatogr A. 2020 Jan 4;1609:460501. doi: 10.1016/j.chroma.2019.460501. Epub 2019 Aug 30. J Chromatogr A. 2020. PMID: 31515074 Review.
-
LC-MS/MS Software for Screening Unknown Erectile Dysfunction Drugs and Analogues: Artificial Neural Network Classification, Peak-Count Scoring, Simple Similarity Search, and Hybrid Similarity Search Algorithms.Anal Chem. 2019 Jul 16;91(14):9119-9128. doi: 10.1021/acs.analchem.9b01643. Epub 2019 Jul 1. Anal Chem. 2019. PMID: 31260264
-
Role of liquid chromatography-high-resolution mass spectrometry (LC-HR/MS) in clinical toxicology.Clin Toxicol (Phila). 2012 Sep;50(8):733-42. doi: 10.3109/15563650.2012.713108. Epub 2012 Aug 13. Clin Toxicol (Phila). 2012. PMID: 22888997 Review.
Cited by
-
Mixture of experts for multitask learning in cardiotoxicity assessment.J Cheminform. 2025 Aug 29;17(1):135. doi: 10.1186/s13321-025-01072-7. J Cheminform. 2025. PMID: 40883848 Free PMC article.
-
Exploring Synergistic Inhibition of Inflammatory and Antioxidant Potential: Integrated In Silico and In Vitro Analyses of Garcinia mangostana, Curcuma comosa, and Acanthus ebracteatus.Adv Pharmacol Pharm Sci. 2024 Sep 18;2024:8584015. doi: 10.1155/2024/8584015. eCollection 2024. Adv Pharmacol Pharm Sci. 2024. PMID: 39328582 Free PMC article.
-
Deep Learning Based Metabolite Annotation.Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341007. Annu Int Conf IEEE Eng Med Biol Soc. 2023. PMID: 38082953 Free PMC article.
References
-
- He Y., Zhang Z.M., Ma P., Ji H.C., Lu H.M. GC-MS Profiling of Leukemia Cells: An Optimized Preparation Protocol for the Intracellular Metabolome. Anal. Methods. 2018;10:1266–1274. doi: 10.1039/C7AY02578E. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources