. 2025 Apr 14;17(4):567.

doi: 10.3390/v17040567.

Evaluation of Different Machine Learning Approaches to Predict Antigenic Distance Among Newcastle Disease Virus (NDV) Strains

Giovanni Franzo¹, Alice Fusaro², Chantal J Snoeck³, Aleksandar Dodovski⁴, Steven Van Borm⁵, Mieke Steensels⁵, Vasiliki Christodoulou⁶, Iuliana Onita⁷, Raluca Burlacu⁷, Azucena Sánchez Sánchez⁸, Ilya A Chvala⁹, Mia Kim Torchetti¹⁰, Ismaila Shittu¹¹, Mayowa Olabode¹¹, Ambra Pastori², Alessia Schivo², Angela Salomoni², Silvia Maniero², Ilaria Zambon², Francesco Bonfante², Isabella Monne², Mattia Cecchinato¹, Alessio Bortolami²

Affiliations

¹ Department of Animal Medicine, Production and Health (MAPS), Padua University, 35020 Legnaro, Italy.
² Division of Comparative Biomedical Sciences (DSBIO), Istituto Zooprofilattico Sperimentale delle Venezie, Viale dell'Università 10, 35020 Legnaro, Italy.
³ Clinical and Applied Virology Group, Department of Infection and Immunity, Luxembourg Institute of Health, 29, Rue Henri Koch, Esch-sur-Alzette, L-4354 Luxembourg, Luxembourg.
⁴ Faculty of Veterinary Medicine-Skopje, Ss. Cyril and Methodius University in Skopje, Lazar Pop Trajkov 5-7, 1000 Skopje, North Macedonia.
⁵ Avian Virology and Immunology, Sciensano, Rue Groeselenberg 99, 1180 Ukkel, Belgium.
⁶ Section Veterinary Services (1417), Laboratory for Animal Health Virology, 79, Athalassa Avenue, Aglantzia, Nicosia 2109, Cyprus.
⁷ Institute For Diagnosis and Animal Health, 63, Dr. Staicovici Str., Sector 5, 050557 Bucharest, Romania.
⁸ Laboratorio Central de Veterinaria (LCV), Ministry of Agriculture, Fisheries and Food, Ctra. M-106, Km 1, 4 Algete, 28110 Madrid, Spain.
⁹ National Reference Laboratory for Avian Influenza and Newcastle Disease, Federal Centre for Animal Health (FGBI "ARRIAH"), Vladimir 600901, Russia.
¹⁰ National Veterinary Services Laboratories, U.S. Department of Agriculture, Ames, IA 50011, USA.
¹¹ National Veterinary Research Institute, Vom 93010, Nigeria.

PMID: 40285009
PMCID: PMC12031050
DOI: 10.3390/v17040567

Evaluation of Different Machine Learning Approaches to Predict Antigenic Distance Among Newcastle Disease Virus (NDV) Strains

Giovanni Franzo et al. Viruses. 2025.

. 2025 Apr 14;17(4):567.

doi: 10.3390/v17040567.

Authors

Affiliations

¹ Department of Animal Medicine, Production and Health (MAPS), Padua University, 35020 Legnaro, Italy.
² Division of Comparative Biomedical Sciences (DSBIO), Istituto Zooprofilattico Sperimentale delle Venezie, Viale dell'Università 10, 35020 Legnaro, Italy.
³ Clinical and Applied Virology Group, Department of Infection and Immunity, Luxembourg Institute of Health, 29, Rue Henri Koch, Esch-sur-Alzette, L-4354 Luxembourg, Luxembourg.
⁴ Faculty of Veterinary Medicine-Skopje, Ss. Cyril and Methodius University in Skopje, Lazar Pop Trajkov 5-7, 1000 Skopje, North Macedonia.
⁵ Avian Virology and Immunology, Sciensano, Rue Groeselenberg 99, 1180 Ukkel, Belgium.
⁶ Section Veterinary Services (1417), Laboratory for Animal Health Virology, 79, Athalassa Avenue, Aglantzia, Nicosia 2109, Cyprus.
⁷ Institute For Diagnosis and Animal Health, 63, Dr. Staicovici Str., Sector 5, 050557 Bucharest, Romania.
⁸ Laboratorio Central de Veterinaria (LCV), Ministry of Agriculture, Fisheries and Food, Ctra. M-106, Km 1, 4 Algete, 28110 Madrid, Spain.
⁹ National Reference Laboratory for Avian Influenza and Newcastle Disease, Federal Centre for Animal Health (FGBI "ARRIAH"), Vladimir 600901, Russia.
¹⁰ National Veterinary Services Laboratories, U.S. Department of Agriculture, Ames, IA 50011, USA.
¹¹ National Veterinary Research Institute, Vom 93010, Nigeria.

PMID: 40285009
PMCID: PMC12031050
DOI: 10.3390/v17040567

Abstract

Newcastle disease virus (NDV) continues to present a significant challenge for vaccination due to its rapid evolution and the emergence of new variants. Although molecular and sequence data are now quickly and inexpensively produced, genetic distance rarely serves as a good proxy for cross-protection, while experimental studies to assess antigenic differences are time consuming and resource intensive. In response to these challenges, this study explores and compares several machine learning (ML) methods to predict the antigenic distance between NDV strains as determined by hemagglutination-inhibition (HI) assays. By analyzing F and HN gene sequences alongside corresponding amino acid features, we developed predictive models aimed at estimating antigenic distances. Among the models evaluated, the random forest (RF) approach outperformed traditional linear models, achieving a predictive accuracy with an R² value of 0.723 compared to only 0.051 for linear models based on genetic distance alone. This significant improvement demonstrates the usefulness of applying flexible ML approaches as a rapid and reliable tool for vaccine selection, minimizing the need for labor-intensive experimental trials. Moreover, the flexibility of this ML framework holds promise for application to other infectious diseases in both animals and humans, particularly in scenarios where rapid response and ethical constraints limit conventional experimental approaches.

Keywords: NDV; antigenic cartography; cross-protection; hemagglutination inhibition; machine learning; sequencing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 1**
Antigenic map of NDVs based on HI data. Names of antigens (depicted as dots) and sera (depicted as squares) were excluded from the map to improve readability. Colors have been assigned to each genotype (nomenclature according to Dimitrov et al., [18]) to visualize antigenic relatedness between genotypes. The vertical and horizontal axes both represent antigenic distance, and, because only the relative positions of antigens and antisera can be determined, the orientation of the map within these axes is free. The spacing between grid lines is 1 unit of antigenic distance corresponding to a twofold dilution of antiserum in the HI assay.

**Figure 2**
Antigenic map of NDVs based on MN data. Dots represent antigens and squares represent sera of individual immunized birds. The same color has been used for viruses and homologous antisera; superposition of two sera is represented by darker color of the square. The spacing between grid lines is 1 unit of antigenic distance corresponding to a two-fold dilution of antiserum in the MN assay.

**Figure 3**
Boxplot performance metrics obtained through cross-validation for different methods, based on the F dataset (**left**). The solid, hollow dots, represent the median value. Differences in performance parameters between methods pairs (**right**). The average difference and the confidence interval, corrected for multiple comparisons, indicative of statistical significance, are reported.

**Figure 4**
Boxplot performance metrics obtained through cross-validation for different methods, based on the HN dataset (**left**). The solid, hollow dots, represent the median value. Differences in performance parameters between methods pairs (**right**). The average difference and the confidence interval, corrected for multiple comparisons, indicative of statistical significance, are reported.

**Figure 5**
Boxplot performance metrics obtained through cross-validation for different methods, based on the Merged dataset (**left**). The solid, hollow dots, represent the median value. Differences in performance parameters between methods pairs (**right**). The average difference and the confidence interval, corrected for multiple comparisons, indicative of statistical significance, are reported.

See this image and copyright information in PMC

References

1. Plotkin S. History of Vaccination. Proc. Natl. Acad. Sci. USA. 2014;111:12283–12287. doi: 10.1073/pnas.1400472111. - DOI - PMC - PubMed
1. Lombard M., Pastoret P.P., Moulin A.M. A Brief History of Vaccines and Vaccination. Rev. Sci. Tech. 2007;26:29–48. doi: 10.20506/rst.26.1.1724. - DOI - PubMed
1. Read A.F., Mackinnon M.J. Pathogen Evolution in a Vaccinated World. Evol. Health Dis. 2010;2:139–152. doi: 10.1093/acprof:oso/9780199207466.003.0011. - DOI
1. Tannous L.K., Barlow G., Metcalfe N.H. A Short Clinical Review of Vaccination against Measles. JRSM Open. 2014;5:2054270414523408. doi: 10.1177/2054270414523408. - DOI - PMC - PubMed
1. Hegerle N., Guiso N. Bordetella Pertussis and Pertactin-Deficient Clinical Isolates: Lessons for Pertussis Vaccines. Expert Rev. Vaccines. 2014;13:1135–1146. doi: 10.1586/14760584.2014.932254. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Associated data

Actions
- Search in PubMed
- Search in Protein

Grants and funding

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Evaluation of Different Machine Learning Approaches to Predict Antigenic Distance Among Newcastle Disease Virus (NDV) Strains

Affiliations

Evaluation of Different Machine Learning Approaches to Predict Antigenic Distance Among Newcastle Disease Virus (NDV) Strains

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

Associated data

Grants and funding

LinkOut - more resources

Full Text Sources