Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 12;12(9):1019.
doi: 10.3390/v12091019.

Convolutional Neural Network Based Approach to in Silico Non-Anticipating Prediction of Antigenic Distance for Influenza Virus

Affiliations

Convolutional Neural Network Based Approach to in Silico Non-Anticipating Prediction of Antigenic Distance for Influenza Virus

Majid Forghani et al. Viruses. .

Abstract

Evaluation of the antigenic similarity degree between the strains of the influenza virus is highly important for vaccine production. The conventional method used to measure such a degree is related to performing the immunological assays of hemagglutinin inhibition. Namely, the antigenic distance between two strains is calculated on the basis of HI assays. Usually, such distances are visualized by using some kind of antigenic cartography method. The known drawback of the HI assay is that it is rather time-consuming and expensive. In this paper, we propose a novel approach for antigenic distance approximation based on deep learning in the feature spaces induced by hemagglutinin protein sequences and Convolutional Neural Networks (CNNs). To apply a CNN to compare the protein sequences, we utilize the encoding based on the physical and chemical characteristics of amino acids. By varying (hyper)parameters of the CNN architecture design, we find the most robust network. Further, we provide insight into the relationship between approximated antigenic distance and antigenicity by evaluating the network on the HI assay database for the H1N1 subtype. The results indicate that the best-trained network gives a high-precision approximation for the ground-truth antigenic distances, and can be used as a good exploratory tool in practical tasks.

Keywords: antigenic distance; convolutional neural network; evolution; influenza; vaccine.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
General diagram of the proposed research.
Figure 2
Figure 2
Each HI assay entry includes identifiers of the test and reference viruses, date of experiment, and the measured titer.
Figure 3
Figure 3
An example of the AAindex1 entry representing the hydrophobicity index. The values assigned to amino acids are highlighted in pink.
Figure 4
Figure 4
Variance ratios explained by the first 11 factors obtained with application the PCA to AAindex1 database. Total explained variance is about 91%.
Figure 5
Figure 5
The network input tensor represents the HA1 amino acid sequence of test and reference viruses encoded by 11 synthetic indices from AAindex1.
Figure 6
Figure 6
The layers used in the examined networks.
Figure 7
Figure 7
Common architecture of the tested networks, M1-M32.
Figure 8
Figure 8
Architecture of SqueezeNet used as a baseline network.
Figure 9
Figure 9
Experiment Tall, where all models were trained on the unrestricted prehistory.
Figure 10
Figure 10
Experiment T5: the models were trained on a five-year prehistory.
Figure 11
Figure 11
Experiment T4: the models were trained on a four-year prehistory.
Figure 12
Figure 12
Experiment T3: the models were trained on a three-year prehistory.
Figure 13
Figure 13
The results of 10-fold cross-validation for models M23 and SqueezeNet.
Figure 14
Figure 14
Titer: linear regression.
Figure 15
Figure 15
Virus: linear regression.

References

    1. Agor J.K., Özaltın O.Y. Models for predicting the evolution of influenza to inform vaccine strain selection. Hum. Vaccines Immunother. 2018;14:678–683. doi: 10.1080/21645515.2017.1423152. - DOI - PMC - PubMed
    1. Neher R.A., Bedford T. Nextflu: Real-time tracking of seasonal influenza virus evolution in humans. Bioinformatics. 2015;31:3546–3548. doi: 10.1093/bioinformatics/btv381. - DOI - PMC - PubMed
    1. Yang H., Carney P.J., Chang J.C., Guo Z., Villanueva J.M., Stevens J. Structure and receptor-binding preferences of recombinant human A (H3N2) virus hemagglutinins. Virology. 2015;477:18–31. doi: 10.1016/j.virol.2014.12.024. - DOI - PMC - PubMed
    1. Drake J.W., Holland J.J. Mutation rates among RNA viruses. Proc. Natl. Acad. Sci. USA. 1999;96:13910–13913. doi: 10.1073/pnas.96.24.13910. - DOI - PMC - PubMed
    1. Barnett J.L., Yang J., Cai Z., Zhang T., Wan X.F. AntigenMap 3D: An online antigenic cartography resource. Bioinformatics. 2012;28:1292–1293. doi: 10.1093/bioinformatics/bts105. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources