Neural network based integration of assays to assess pathogenic potential
- PMID: 37055450
- PMCID: PMC10102301
- DOI: 10.1038/s41598-023-32950-5
Neural network based integration of assays to assess pathogenic potential
Abstract
Limited data significantly hinders our capability of biothreat assessment of novel bacterial strains. Integration of data from additional sources that can provide context about the strain can address this challenge. Datasets from different sources, however, are generated with a specific objective and which makes integration challenging. Here, we developed a deep learning-based approach called the neural network embedding model (NNEM) that integrates data from conventional assays designed to classify species with new assays that interrogate hallmarks of pathogenicity for biothreat assessment. We used a dataset of metabolic characteristics from a de-identified set of known bacterial strains that the Special Bacteriology Reference Laboratory (SBRL) of the Centers for Disease Control and Prevention (CDC) has curated for use in species identification. The NNEM transformed results from SBRL assays into vectors to supplement unrelated pathogenicity assays from de-identified microbes. The enrichment resulted in a significant improvement in accuracy of 9% for biothreat. Importantly, the dataset used in our analysis is large, but noisy. Therefore, the performance of our system is expected to improve as additional types of pathogenicity assays are developed and deployed. The proposed NNEM strategy thus provides a generalizable framework for enrichment of datasets with previously collected assays indicative of species.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures






Similar articles
-
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25. Artif Intell Med. 2019. PMID: 31521253
-
Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation.J Med Internet Res. 2020 Oct 23;22(10):e19810. doi: 10.2196/19810. J Med Internet Res. 2020. PMID: 33095174 Free PMC article.
-
Deep semi-supervised learning via dynamic anchor graph embedding in latent space.Neural Netw. 2022 Feb;146:350-360. doi: 10.1016/j.neunet.2021.11.026. Epub 2021 Dec 1. Neural Netw. 2022. PMID: 34929418
-
Sequence-based bacterial small RNAs prediction using ensemble learning strategies.BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):503. doi: 10.1186/s12859-018-2535-1. BMC Bioinformatics. 2018. PMID: 30577759 Free PMC article.
-
Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7. Methods Mol Biol. 2021. PMID: 32804365 Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources