Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep:315:108217.
doi: 10.1016/j.mbs.2019.108217. Epub 2019 Jun 17.

Conservation region finding for influenza A viruses by machine learning methods of N-linked glycosylation sites and B-cell epitopes

Affiliations

Conservation region finding for influenza A viruses by machine learning methods of N-linked glycosylation sites and B-cell epitopes

Jone-Han Liu et al. Math Biosci. 2019 Sep.

Abstract

Influenza type A, a serious infectious disease of the human respiratory tract, poses an enormous threat to human health worldwide. It leads to high mortality rates in poultry, pigs, and humans. The primary target identity regions for the human immune system are hemagglutinin (HA) and neuraminidase (NA), two surface proteins of the influenza A virus. Research and development of vaccines is highly complex because the influenza A virus evolves rapidly. This study focused on three genetic features of viral surface proteins: ribonucleic acid (RNA) sequence conservation, linear B-cell epitopes, and N-linked glycosylation. On the basis of these three properties, we analyzed 12,832 HA and 9487 NA protein sequences, which we retrieved from the influenza virus database. We classified the viral surface protein sequences into the 18 HA and 11 NA subtypes that have been identified thus far. Using available analytic tools, we searched for the representative strain of each virus subtype. Furthermore, using machine learning methods, we looked for conservation regions with sequences showing linear B-cell epitopes and N-linked glycosylation. Compared to the prediction of the Immune Epitope Database (IEDB) antibody neutralization response (i.e., screening of antibody sequence regions), in this study, the virus sequence coverage was large and accurate and contained N-linked glycosylation sites. The results of this study proved that we can use the machine learning-based prediction method to solve the problem of vaccine invalidation that occurred during the rapid evolution of the influenza A virus and also as a prevaccine assessment. In addition, the screening fragments can be used as a universal influenza vaccine design reference in the future.

Keywords: Hemagglutinin; Linear B-cell epitope; Machine learning; N-linked glycosylation; Neuraminidase.

PubMed Disclaimer

Publication types

Substances

LinkOut - more resources