Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Feb 4;368(1614):20120208.
doi: 10.1098/rstb.2012.0208. Print 2013 Mar 19.

Modelling tree shape and structure in viral phylodynamics

Affiliations

Modelling tree shape and structure in viral phylodynamics

Simon D W Frost et al. Philos Trans R Soc Lond B Biol Sci. .

Abstract

Epidemiological models have highlighted the importance of population structure in the transmission dynamics of infectious diseases. Using HIV-1 as an example of a model evolutionary system, we consider how population structure affects the shape and the structure of a viral phylogeny in the absence of strong selection at the population level. For structured populations, the number of lineages as a function of time is insufficient to describe the shape of the phylogeny. We develop deterministic approximations for the dynamics of tips of the phylogeny over evolutionary time, the number of 'cherries', tips that share a direct common ancestor, and Sackin's index, a commonly used measure of phylogenetic imbalance or asymmetry. We employ cherries both as a measure of asymmetry of the tree as well as a measure of the association between sequences from different groups. We consider heterogeneity in infectiousness associated with different stages of HIV infection, and in contact rates between groups of individuals. In the absence of selection, we find that population structure may have relatively little impact on the overall asymmetry of a tree, especially when only a small fraction of infected individuals is sampled, but may have marked effects on how sequences from different subpopulations cluster and co-cluster.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic illustrating cherries and the calculation of Sackin's statistic for a symmetric (a) and an asymmetric (b) six-taxon cladogram.
Figure 2.
Figure 2.
(a) Dynamics of the number of infected individuals, I (black line), and ν = Iτ/2 (dashed line) over time in weeks based on equations (3.1)–(3.2), as well as estimates of ‘scaled effective population size’ obtained from applying a Bayesian skyride (grey) to simulated data generated from a forwards-time stochastic version of the model, with 100 replicates. (b) Dynamics of the mean generation time, τ. Parameter values and initial conditions are as follows: β = 0.01, c = 1, formula image, formula image, formula image, S(0) = 9999, I(0) = 1, with a simulation time of 40 years. Simulations of the differential equations were performed using the simecol library [29] in R [30], fitting of the skyline plot used the INLA library [31], while the stochastic simulations were performed using SimPy v. 1.9.1 in Python (see [3] for more details). Simulations were conditioned on reaching a quasi-equilibrium state, and registered by aligning the peaks of the simulated number of infected individuals to the peak of infected individuals from the ordinary differential equations. Code to perform simulations is available from http://code.google.com/p/simonfrost.
Figure 3.
Figure 3.
Dynamics of (a) the number of lineages, A, (b) the distribution of tip lengths, (c) the mean cluster size, M, (d) the fraction of sequences clustered, 1−P, (e) the number of cherries, C, and (f) Sackin′s index, K, for the simple model of HIV infection given by equations (3.1)–(3.2). Parameter values, initial conditions, and simulations are as in figure 2.
Figure 4.
Figure 4.
Asymmetry and clustering assuming a range of sampling fractions, from formula image (red) to formula image (violet) in steps of 0.1, for different values of the relative infectiousness of acute infection (a), and the relative contact rate in the differential risk model, assuming either (b) proportionate mixing (ρ = 0) or (c) preferential mixing (ρ = 0.9). Parameter values for the acute/chronic model are as follows: c = 1, formula image, formula image, formula image, formula image, S(0) = 9999, I1(0) = 1, I2(0) = 0. The infectivity parameters βi were constrained such that formula image and formula image, where formula image is the mean infectiousness (with formula image), di the mean duration of stage i and k the fold increase in infectiousness during acute infection. Parameter values for the differential risk model are β = 0.01, c2 = 1, formula image, formula image, formula image, formula image, S1(0) = 999, I1(0) = 1, S2(0) = 9000, I2(0) = 0. The simulation time is 30 years, with weekly timesteps, assuming 52 weeks per year.

References

    1. Pybus OG, Rambaut A. 2009. Evolutionary analysis of the dynamics of viral infectious disease. Nat. Rev. Genet. 10, 540–550 (doi:10.1038/nrg2583) - DOI - PMC - PubMed
    1. Grenfell BT, et al. 2004. Unifying the epidemiological and evolutionary dynamics of pathogens. Science 303, 327–332 (doi:10.1126/science.1090727) - DOI - PubMed
    1. Frost SDW, Volz EM. 2010. Viral phylodynamics and the search for an ‘effective number of infections’. Phil. Trans. R. Soc. B 365, 1879–1890 (doi:10.1098/rstb.2010.0060) - DOI - PMC - PubMed
    1. Hué S, Pillay D, Clewley JP, Pybus OG. 2005. Genetic analysis reveals the complex structure of HIV-1 transmission within defined risk groups. Proc. Natl Acad. Sci. USA 102, 4425–4429 (doi:10.1073/pnas.0407534102) - DOI - PMC - PubMed
    1. Lewis F, Hughes GJ, Rambaut A, Pozniak A, Brown AJL. 2008. Episodic sexual transmission of HIV revealed by molecular phylodynamics. PLoS Med. 5, e50 (doi:10.1371/journal.pmed.0050050) - DOI - PMC - PubMed

Publication types

LinkOut - more resources