Toward an Integrated Machine Learning Model of a Proteomics Experiment
- PMID: 36744821
- PMCID: PMC9990124
- DOI: 10.1021/acs.jproteome.2c00711
Toward an Integrated Machine Learning Model of a Proteomics Experiment
Abstract
In recent years machine learning has made extensive progress in modeling many aspects of mass spectrometry data. We brought together proteomics data generators, repository managers, and machine learning experts in a workshop with the goals to evaluate and explore machine learning applications for realistic modeling of data from multidimensional mass spectrometry-based proteomics analysis of any sample or organism. Following this sample-to-data roadmap helped identify knowledge gaps and define needs. Being able to generate bespoke and realistic synthetic data has legitimate and important uses in system suitability, method development, and algorithm benchmarking, while also posing critical ethical questions. The interdisciplinary nature of the workshop informed discussions of what is currently possible and future opportunities and challenges. In the following perspective we summarize these discussions in the hope of conveying our excitement about the potential of machine learning in proteomics and to inspire future research.
Keywords: artificial intelligence; deep learning; enzymatic digestion; ion mobility; liquid chromatography; machine learning; research integrity; synthetic data; tandem mass spectrometry.
Conflict of interest statement
The authors declare the following competing financial interest(s): S.G. and T.S. are co-founders, shareholders and employees of MSAID GmbH, a company that develops software for proteomics. M.W. is founder and shareholder of MSAID GmbH and OmicScouts GmbH, with no operational role in both companies.
Figures
References
-
- Gessulat S.; Schmidt T.; Zolg D. P.; Samaras P.; Schnatbaum K.; Zerweck J.; Knaute T.; Rechenberger J.; Delanghe B.; Huhmer A.; Reimer U.; Ehrlich H.-C.; Aiche S.; Kuster B.; Wilhelm M. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 2019, 16 (6), 509–518. 10.1038/s41592-019-0426-7. - DOI - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources