Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 14;20(8):e0328174.
doi: 10.1371/journal.pone.0328174. eCollection 2025.

Future Sequon Finder - A novel approach for predicting future N-linked glycosylation sequon locations on viral surface proteins

Affiliations
Free article

Future Sequon Finder - A novel approach for predicting future N-linked glycosylation sequon locations on viral surface proteins

Shane P Bryan et al. PLoS One. .
Free article

Abstract

Influenza viruses are known to evade host immune responses by shielding vulnerable surface protein epitopes via N-linked glycosylation. A program titled Future Sequon Finder was developed to predict the locations in which glycan binding sites are most likely to emerge in future influenza hemagglutinin proteins. The predictive modeling approach considers how closely sites in currently circulating strains resemble glycosylation sequons at the nucleic acid level, the surface accessibility of those sites, and the mutation frequency of amino acids at those sites that would need to change to form a glycosylation sequon. The efficacy of this model is tested using historic human H1N1 and H3N2 influenza strains along with swine H1N1 strains. Through this analysis, it is revealed that glycosylation addition events in influenza hemagglutinin proteins are typically the result of single nucleotide mutation events. It is also demonstrated that site-specific mutation frequency and surface accessibility are powerful predictors of which sites will become glycosylated in human influenza viruses when considered with the genetic composition of the sites in question. Having been designed to incorporate these factors, the program successfully predicted almost every historic sequon addition event (28/30 in human IFVs, 14/15 in swine IFVs). For human strains, it also ranked the correct near-sequons highly among falsely predicted sequons based on site-specific mutation frequency. After demonstrating the model's power with historical data, the program was used to predict future HA glycosylation sequon locations based on currently circulating human influenza viruses.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

LinkOut - more resources