Future Sequon Finder - A novel approach for predicting future N-linked glycosylation sequon locations on viral surface proteins
- PMID: 40811715
- DOI: 10.1371/journal.pone.0328174
Future Sequon Finder - A novel approach for predicting future N-linked glycosylation sequon locations on viral surface proteins
Abstract
Influenza viruses are known to evade host immune responses by shielding vulnerable surface protein epitopes via N-linked glycosylation. A program titled Future Sequon Finder was developed to predict the locations in which glycan binding sites are most likely to emerge in future influenza hemagglutinin proteins. The predictive modeling approach considers how closely sites in currently circulating strains resemble glycosylation sequons at the nucleic acid level, the surface accessibility of those sites, and the mutation frequency of amino acids at those sites that would need to change to form a glycosylation sequon. The efficacy of this model is tested using historic human H1N1 and H3N2 influenza strains along with swine H1N1 strains. Through this analysis, it is revealed that glycosylation addition events in influenza hemagglutinin proteins are typically the result of single nucleotide mutation events. It is also demonstrated that site-specific mutation frequency and surface accessibility are powerful predictors of which sites will become glycosylated in human influenza viruses when considered with the genetic composition of the sites in question. Having been designed to incorporate these factors, the program successfully predicted almost every historic sequon addition event (28/30 in human IFVs, 14/15 in swine IFVs). For human strains, it also ranked the correct near-sequons highly among falsely predicted sequons based on site-specific mutation frequency. After demonstrating the model's power with historical data, the program was used to predict future HA glycosylation sequon locations based on currently circulating human influenza viruses.
Copyright: © 2025 Bryan, Zand. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
LinkOut - more resources
Full Text Sources