Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 27;9(1):vead037.
doi: 10.1093/ve/vead037. eCollection 2023.

Mutual information networks reveal evolutionary relationships within the influenza A virus polymerase

Affiliations

Mutual information networks reveal evolutionary relationships within the influenza A virus polymerase

Sarah Arcos et al. Virus Evol. .

Abstract

The influenza A virus (IAV) RNA polymerase is an essential driver of IAV evolution. Mutations that the polymerase introduces into viral genome segments during replication are the ultimate source of genetic variation, including within the three subunits of the IAV polymerase (polymerase basic protein 2, polymerase basic protein 1, and polymerase acidic protein). Evolutionary analysis of the IAV polymerase is complicated, because changes in mutation rate, replication speed, and drug resistance involve epistatic interactions among its subunits. In order to study the evolution of the human seasonal H3N2 polymerase since the 1968 pandemic, we identified pairwise evolutionary relationships among ∼7000 H3N2 polymerase sequences using mutual information (MI), which measures the information gained about the identity of one residue when a second residue is known. To account for uneven sampling of viral sequences over time, we developed a weighted MI (wMI) metric and demonstrate that wMI outperforms raw MI through simulations using a well-sampled severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) dataset. We then constructed wMI networks of the H3N2 polymerase to extend the inherently pairwise wMI statistic to encompass relationships among larger groups of residues. We included hemagglutinin (HA) in the wMI network to distinguish between functional wMI relationships within the polymerase and those potentially due to hitch-hiking on antigenic changes in HA. The wMI networks reveal coevolutionary relationships among residues with roles in replication and encapsidation. Inclusion of HA highlighted polymerase-only subgraphs containing residues with roles in the enzymatic functions of the polymerase and host adaptability. This work provides insight into the factors that drive and constrain the rapid evolution of influenza viruses.

Keywords: Influenza Avirus; RNA-dependent RNA polymerase; information theory; virus evolution.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
Uneven sampling of H3N2 polymerase sequences over time influences Shannon entropy and MI. (A) The distribution of complete H3N2 polymerase sequences on GISAID per year between 1968 and 2015. (B) Upper panels, Sliding window analysis of MI for residue pairs PA-350/PB1-469 and PB2-590/PB1-709. Sliding windows were constructed with a width of 5 years and a slide length of 1 year. Lower panels, Plots of the frequency of amino acids for each residue over the period 1968–2015.
Figure 2.
Figure 2.
Re-weighting of amino acid frequencies improves MI estimates for unevenly sampled data. (A) The distribution of SARS-CoV-2 Spike RBD sequences generated by our laboratory per month between 1 May 2021 and 30 April 2021 from Washtenaw County, MI. The line shows the number of confirmed Coronavirus Disease-19 (COVID-19) cases in Washtenaw County, MI over the same time period. (B) Representative distribution of sampled Spike RBD sequences used to simulate the uneven sampling of H3N2 polymerase sequences (see Fig. 1A). (C) The distribution of Spearman correlation coefficients between the MI from the original Spike RBD dataset and the unweighted, equal-weighted, or incidence-wMI of 100 sampled datasets.
Figure 3.
Figure 3.
Coevolving residues with PB2-627. (A) Residues that coevolve with PB2-627 shown highlighted on the replicating–encapsidating dimer conformation of the influenza C polymerase (Protein Data Bank (PDB) ID 6XZR) (Carrique et al. 2020). (B) Domain organization of the IAV polymerase with coevolving residues indicated.
Figure 4.
Figure 4.
wMI network of the H3N2 polymerase (PB2, PB1, and PA). Nodes represent residues, and edges represent the normalized wMI between residues. An edge threshold was set at the normalized wMI score (58) that minimizes relative maximum subgraph size (see Fig. S3). The network visualization was created using the associationSubgraphs package for R (Strayer et al. 2023).
Figure 5.
Figure 5.
Features of the residues in Subgraphs 1 and 2 from the wMI network of the H3N2 polymerase. (A) Amino acid frequencies from 1968 to 2015 for the residues within Subgraph 1 (left) or 2 (right). Location of the residues in Subgraphs 1 (B) and 2 (C) plotted on the replicating–encapsidating dimer conformation of the influenza C polymerase (PDB ID: 6XZR) (Carrique et al. 2020).
Figure 6.
Figure 6.
wMI network of the H3N2 polymerase (PB2, PB1, and PA) and HA. Nodes represent residues, and edges represent the normalized wMI between residues. Residue nodes are colored as in Fig. 4. HA residues that are located in antigenic regions A–E are shown in bold. Residue −6 (signal peptide sequence, (SS)) is in the cleaved N-terminal signal sequence of HA. An edge threshold was set at the normalized wMI score (40.506) that minimizes relative maximum subgraph size (see Fig. S5). The network visualization was created using the associationSubgraphs package for R (Strayer et al. 2023).
Figure 7.
Figure 7.
Features of the residues in Subgraphs 4, 8, and 10 from the wMI network of the H3N2 polymerase and HA. (A) Amino acid frequencies from 1968 to 2015 for the residues within Subgraph 4 (top left), 8 (bottom left), or 10 (right). Location of the residues in Subgraph 4 (B), 8 (C), or 10 (D) plotted on the post-cap-snatching conformation of the H3N2 polymerase (PDB ID: 6RR7) (Fan et al. 2019).

Update of

Similar articles

Cited by

References

    1. Ackerman S. H., Tillier E. R., and Gatti D. L. (2012) ‘Accurate Simulation and Detection of Coevolution Signals in Multiple Sequence Alignments’, PloS One, 7: e47108. - PMC - PubMed
    1. Bhatt S., Holmes E. C., and Pybus O. G. (2011) ‘The Genomic Rate of Molecular Adaptation of the Human Influenza A Virus’, Molecular Biology and Evolution, 28: 2443–51. - PMC - PubMed
    1. Bloom J. D., Gong L. I., and Baltimore D. (2010) ‘Permissive Secondary Mutations Enable the Evolution of Influenza Oseltamivir Resistance’, Science, 328: 1272–5. - PMC - PubMed
    1. Carrique L. et al. (2020) ‘Host ANP32A Mediates the Assembly of the Influenza Virus Replicase’, Nature, 587: 638–43. - PMC - PubMed
    1. Chang W. et al. (2022), Shiny: Web Applicatoin Framework for R. <https://shiny.rstudio.com/> accessed 1 Jan 2021.