Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Nov 7;8(64):1644-53.
doi: 10.1098/rsif.2011.0105. Epub 2011 May 4.

Quantifying selection and diversity in viruses by entropy methods, with application to the haemagglutinin of H3N2 influenza

Affiliations

Quantifying selection and diversity in viruses by entropy methods, with application to the haemagglutinin of H3N2 influenza

Keyao Pan et al. J R Soc Interface. .

Abstract

Many viruses evolve rapidly. For example, haemagglutinin (HA) of the H3N2 influenza A virus evolves to escape antibody binding. This evolution of the H3N2 virus means that people who have previously been exposed to an influenza strain may be infected by a newly emerged virus. In this paper, we use Shannon entropy and relative entropy to measure the diversity and selection pressure by an antibody in each amino acid site of H3 HA between the 1992-1993 season and the 2009-2010 season. Shannon entropy and relative entropy are two independent state variables that we use to characterize H3N2 evolution. The entropy method estimates future H3N2 evolution and migration using currently available H3 HA sequences. First, we show that the rate of evolution increases with the virus diversity in the current season. The Shannon entropy of the sequence in the current season predicts relative entropy between sequences in the current season and those in the next season. Second, a global migration pattern of H3N2 is assembled by comparing the relative entropy flows of sequences sampled in China, Japan, the USA and Europe. We verify this entropy method by describing two aspects of historical H3N2 evolution. First, we identify 54 amino acid sites in HA that have evolved in the past to evade the immune system. Second, the entropy method shows that epitopes A and B on the top of HA evolve most vigorously to escape antibody binding. Our work provides a novel entropy-based method to predict and quantify future H3N2 evolution and to describe the evolutionary history of H3N2.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The tertiary structure of the HA1 domain of H3 HA (PDB code: 1HGF). The surface of HA1 facing outward is the exposed surface when the HA trimer is formed. The other two HA1 domains (not shown) in the HA trimer are located at the back of the structure displayed here. The solid balls represent five epitopes. Colour code: blue, epitope A; red, epitope B; cyan, epitope C; yellow, epitope D; green, epitope E.
Figure 2.
Figure 2.
Mean and standard error of relative entropy Si +1,j in each bin of Shannon entropy. Shannon entropy and relative entropy in each of the 329 positions and in each of the 17 seasons between 1992–1993 (i = 0) and 2008–2009 (i = 16) fall into one of the eight bins. The first bin with Shannon entropy less than 0.1 is discarded. Bins with larger Shannon entropy Di,j also have larger relative entropy Si +1,j . Shannon entropy Di,j and relative entropy Si +1,j in iteration i = 51–100 of the neutral evolution model (crosses) are used to calculate mean and standard error of relative entropy in each bin of Shannon entropy distribution in the same way. No increasing trend is found. Error bar is one standard error. Open squares represent H3 data.
Figure 3.
Figure 3.
Average Shannon entropy 〈Di versus average relative entropy 〈Si +1 for each season between 1992–1993 (i = 0) and 2008–2009 (i = 16). For each season i, a set of amino acid positions j with Shannon entropy Di,j greater than 0.1 are chosen. For all the j in this set of positions, 〈Di is the average of the Shannon entropy Di,j values and 〈Si +1 is the average of relative entropy Si +1,j values. Horizontal and vertical error bars are the standard errors of Shannon entropy and relative entropy, respectively. The solid line, 〈Si +1 = 1.82〈Di−0.23, is a least squares fit of 〈Di to 〈Si +1 (i = 0,2, …,16). A strong correlation with R2 = 0.50 exists between 〈Di and 〈Si +1 excluding the point (0.22,1.38) with Ni = 1, which has a large standard error of the relative entropy Si +1,j. Using the same method, 〈Di and 〈Si +1 are calculated from a neutral evolution model, i = 51–100, and plotted. No visible correlation exists between 〈Di and 〈Si +1 from the neutral evolution model (crosses). Open squares represent H3 data.
Figure 4.
Figure 4.
(a) Average selection in each position quantified by relative entropy during the past 17 seasons from 1993–1994 to 2009–2010, calculated by formula image. The colours represent positions in epitopes A to E and positions outside the epitopes, as in figure 1. (b) Number of seasons for each position when the relative entropy was greater than the threshold Sithres, i.e. the position was under selection. (c) Average diversity in each position quantified by Shannon entropy in the seasons from 1993–1994 to 2009–2010, calculated by formula image. (d) Distribution of the average selection in each position displayed in (a). (e) Distribution of the numbers of seasons under selection displayed in (b). (f) Distribution of the average diversity in each position shown in (c).

Similar articles

Cited by

References

    1. Li Y., Carroll D. S., Gardner S. N., Walsh M. C., Vitalis E. A., Damon I. K. 2007. On the origin of smallpox: correlating variola phylogenics with historical smallpox records. Proc. Natl Acad. Sci. USA 104, 15 787–15 79210.1073/pnas.0609268104 (doi:10.1073/pnas.0609268104) - DOI - DOI - PMC - PubMed
    1. World Health Organization Media Centre influenza fact sheet 211. 2009 See http://www.who.int/mediacentre/factsheets/fs211/en/index.html .
    1. Wiley D. C., Wilson I. A., Skehel J. J. 1981. Structural identification of the antibody-binding sites of Hong Kong influenza haemagglutinin and their involvement in antigenic variation. Nature 289, 373–37810.1038/289373a0 (doi:10.1038/289373a0) - DOI - DOI - PubMed
    1. Nobusawa E., Sato K. 2006. Comparison of the mutation rates of human influenza A and B viruses. J. Virol. 80, 3675–367810.1128/JVI.80.7.3675-3678.2006 (doi:10.1128/JVI.80.7.3675-3678.2006) - DOI - DOI - PMC - PubMed
    1. Parvin J. D., Moscona A., Pan W. T., Leider J. M., Palese P. 1986. Measurement of the mutation rates of animal viruses: influenza A virus and poliovirus type 1. J. Virol. 59, 377–383 - PMC - PubMed

Publication types

LinkOut - more resources