Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jul 25:11:220.
doi: 10.1186/1471-2148-11-220.

Strength and tempo of selection revealed in viral gene genealogies

Affiliations

Strength and tempo of selection revealed in viral gene genealogies

Trevor Bedford et al. BMC Evol Biol. .

Abstract

Background: RNA viruses evolve extremely quickly, allowing them to rapidly adapt to new environmental conditions. Viral pathogens, such as influenza virus, exploit this capacity for evolutionary change to persist within the human population despite substantial immune pressure. Understanding the process of adaptation in these viral systems is essential to our efforts to combat infectious disease.

Results: Through analysis of simulated populations and sequence data from influenza A (H3N2) and measles virus, we show how phylogenetic and population genetic techniques can be used to assess the strength and temporal pattern of adaptive evolution. The action of natural selection affects the shape of the genealogical tree connecting members of an evolving population, causing deviations from the neutral expectation. The magnitude and distribution of these deviations lends insight into the historical pattern of evolution and adaptation in the viral population. We quantify the degree of ongoing adaptation in influenza and measles virus through comparison of census population size and effective population size inferred from genealogical patterns, finding a 60-fold greater deviation in influenza than in measles. We also examine the tempo of adaptation in influenza, finding evidence for both continuous and episodic change.

Conclusions: Our results have important consequences for understanding the epidemiological and evolutionary dynamics of the influenza virus. Additionally, these general techniques may prove useful to assess the strength and pattern of adaptive evolution in a variety of evolving systems. They are especially powerful when assessing selection in fast-evolving populations, where temporal patterns become highly visible.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Genealogical trees derived from hemagglutinin sequences of influenza A (H3N2) and measles virus. The influenza tree, sampled between 1968 and 2008, appears long and spindly with a distinct lack of deep branches. It only takes a few years for contemporaneous strains to find a common ancestor. The measles tree, sampled between 1979 and 2009, looks very different, harboring many deep branches. It takes approximately 50 years for contemporaneous measles strains to find a common ancestor.
Figure 2
Figure 2
Genealogical trees derived from simulated sequences at varying population sizes. In each case, sequence evolution was simulated using a neutral Wright-Fisher model, and approximately 200 samples were taken over the course of 10,000 generations (sampled at a Poisson rate of 0.02 per generation). In each frame, the x-axis is shown as intervals of 5000 generations, so that the temporal scale increases with N. As population size N increases, the time it takes for two lineages to coalesce increases proportionally, so that larger populations show a deeper most recent common ancestor. Population dynamics result in rapid coalescence when there are many contemporaneous lineages and slower coalescence when there are few, a detail especially apparent at larger population sizes.
Figure 3
Figure 3
Genealogical tree and skyline plots of coalescent statistics for sequences simulated under selective neutrality. There were 10,000 individuals in the simulation. The tree shown represents the maximum posterior tree sampled over the course of the MCMC analysis of the simulated data. For coalescent statistics, TMRCA, diversity and Neτ, solid lines represent mean values and gray outlines represent 95% credible intervals across MCMC replicates. TMRCA, diversity and Neτ are measured in units of generations.
Figure 4
Figure 4
Genealogical tree and skyline plots of coalescent statistics for sequences simulated under purifying selection. There were 10,000 individuals in the simulation, and a mutation rate μ of 10-5 per site per individual per generation with each mutation having a 99% probability of being deleterious with a selective disadvantage of 0.01. The tree shown represents the maximum posterior tree sampled over the course of the MCMC analysis of the simulated data. Colors move from red to purple as fitness decreases. For coalescent statistics, TMRCA, diversity and Neτ, solid lines represent mean values and gray outlines represent 95% credible intervals across MCMC replicates. TMRCA, diversity and Neτ are measured in units of generations.
Figure 5
Figure 5
Genealogical tree and skyline plots of coalescent statistics for sequences simulated under constant positive selection. There were 10,000 individuals in the simulation, and an advantageous mutation rate μa of 0.002 per individual per generation with each mutation harboring a selective advantage s of 0.01. The tree shown represents the maximum posterior tree sampled over the course of the MCMC analysis of the simulated data. Colors move from purple to red as fitness increases. For coalescent statistics, TMRCA, diversity and Neτ, solid lines represent mean values and gray outlines represent 95% credible intervals across MCMC replicates. TMRCA, diversity and Neτ are measured in units of generations.
Figure 6
Figure 6
Genealogical tree and skyline plots of coalescent statistics for sequences simulated under episodic positive selection. There were 10,000 individuals in the simulation, and an advantageous mutation rate μa of 8 × 10-7 per individual per generation with each mutation harboring a selective advantage s of 0.1. The tree shown represents the maximum posterior tree sampled over the course of the MCMC analysis of the simulated data. Colors move from purple to red as fitness increases. For coalescent statistics, TMRCA, diversity and Neτ, solid lines represent mean values and gray outlines represent 95% credible intervals across MCMC replicates. TMRCA, diversity and Neτ are measured in units of generations.
Figure 7
Figure 7
Relationship between population size N and genetic diversity at varying intensities of selection. Nucleotide diversity represents mean pairwise proportion of sequence differences between random individuals in the population. Diversity estimates are taken from 105 generations across 3 replicate simulations. Neutral dynamics are shown as a dashed line. Selective dynamics are shown as solid lines. In each case, 1% of mutations are selectively advantageous, with a selective advantage s of 0.001, 0.01 or 0.1.
Figure 8
Figure 8
Genealogical tree and skyline plots of coalescent statistics for 1270 world-wide samples of HA sequences of influenza A (H3N2). The tree shown represents the maximum posterior tree sampled over the course of the MCMC analysis. Vaccine strains are shown on the tree as colored points. For coalescent statistics, TMRCA, diversity and Neτ, solid lines represent mean values and gray outlines represent 95% credible intervals across MCMC replicates. TMRCA, diversity and Neτ are measured in units of years.
Figure 9
Figure 9
Genealogical tree and skyline plots of coalescent statistics for 624 China and Southeast Asia samples of HA sequences of influenza A (H3N2). The tree shown represents the maximum posterior tree sampled over the course of the MCMC analysis. Vaccine strains are shown on the tree as colored points and included regardless of geographic origin. For coalescent statistics, TMRCA, diversity and Neτ, solid lines represent mean values and gray outlines represent 95% credible intervals across MCMC replicates. TMRCA, diversity and Neτ are measured in units of years.

Similar articles

Cited by

References

    1. Duffy S, Shackelton LA, Holmes EC. Rates of evolutionary change in viruses: patterns and determinants. Nat Rev Genet. 2008;9(4):267–76. - PubMed
    1. Cox N, Subbarao K. Global epidemiology of influenza: past and present. Annu Rev Med. 2000;51:407–421. doi: 10.1146/annurev.med.51.1.407. - DOI - PubMed
    1. Hay AJ, Gregory V, Douglas AR, Lin YP. The evolution of human influenza viruses. Phil Trans R Soc Lond B. 2001;356(1416):1861–1870. - PMC - PubMed
    1. Fitch WM, Bush RM, Bender CA, Cox NJ. Long term trends in the evolution of H(3) HA1 human influenza type A. Proc Natl Acad Sci USA. 1997;94(15):7712–8. doi: 10.1073/pnas.94.15.7712. - DOI - PMC - PubMed
    1. Bush R, Fitch W, Bender C, Cox N. Positive selection on the H3 hemagglutinin gene of human influenza virus A. Mol Biol Evol. 1999;16(11):1457–1465. - PubMed

Publication types

LinkOut - more resources