Using machine learning to detect coronaviruses potentially infectious to humans
- PMID: 37291260
- PMCID: PMC10248971
- DOI: 10.1038/s41598-023-35861-7
Using machine learning to detect coronaviruses potentially infectious to humans
Abstract
Establishing the host range for novel viruses remains a challenge. Here, we address the challenge of identifying non-human animal coronaviruses that may infect humans by creating an artificial neural network model that learns from spike protein sequences of alpha and beta coronaviruses and their binding annotation to their host receptor. The proposed method produces a human-Binding Potential (h-BiP) score that distinguishes, with high accuracy, the binding potential among coronaviruses. Three viruses, previously unknown to bind human receptors, were identified: Bat coronavirus BtCoV/133/2005 and Pipistrellus abramus bat coronavirus HKU5-related (both MERS related viruses), and Rhinolophus affinis coronavirus isolate LYRa3 (a SARS related virus). We further analyze the binding properties of BtCoV/133/2005 and LYRa3 using molecular dynamics. To test whether this model can be used for surveillance of novel coronaviruses, we re-trained the model on a set that excludes SARS-CoV-2 and all viral sequences released after the SARS-CoV-2 was published. The results predict the binding of SARS-CoV-2 with a human receptor, indicating that machine learning methods are an excellent tool for the prediction of host expansion events.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures





Similar articles
-
A mouse model for Betacoronavirus subgroup 2c using a bat coronavirus strain HKU5 variant.mBio. 2014 Mar 25;5(2):e00047-14. doi: 10.1128/mBio.00047-14. mBio. 2014. PMID: 24667706 Free PMC article.
-
Replication of MERS and SARS coronaviruses in bat cells offers insights to their ancestral origins.Emerg Microbes Infect. 2018 Dec 10;7(1):209. doi: 10.1038/s41426-018-0208-9. Emerg Microbes Infect. 2018. PMID: 30531999 Free PMC article.
-
Intestinal Tropism of a Betacoronavirus (Merbecovirus) in Nathusius's Pipistrelle Bat (Pipistrellus nathusii), Its Natural Host.J Virol. 2023 Mar 30;97(3):e0009923. doi: 10.1128/jvi.00099-23. Epub 2023 Mar 1. J Virol. 2023. PMID: 36856426 Free PMC article.
-
[Source of the COVID-19 pandemic: ecology and genetics of coronaviruses (Betacoronavirus: Coronaviridae) SARS-CoV, SARS-CoV-2 (subgenus Sarbecovirus), and MERS-CoV (subgenus Merbecovirus).].Vopr Virusol. 2020;65(2):62-70. doi: 10.36233/0507-4088-2020-65-2-62-70. Vopr Virusol. 2020. PMID: 32515561 Review. Russian.
-
Bat origin of human coronaviruses.Virol J. 2015 Dec 22;12:221. doi: 10.1186/s12985-015-0422-1. Virol J. 2015. PMID: 26689940 Free PMC article. Review.
Cited by
-
ORF1ab codon frequency model predicts host-pathogen relationship in orthocoronavirinae.Front Bioinform. 2025 Mar 18;5:1562668. doi: 10.3389/fbinf.2025.1562668. eCollection 2025. Front Bioinform. 2025. PMID: 40170904 Free PMC article.
References
-
- Rodriguez-Morales AJ, et al. History is repeating itself: Probable zoonotic spillover as the cause of the 2019 novel Coronavirus Epidemic. Infez. Med. 2020;28(1):3–5. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous