Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;8(6):e1002552.
doi: 10.1371/journal.pcbi.1002552. Epub 2012 Jun 28.

Simple epidemiological dynamics explain phylogenetic clustering of HIV from patients with recent infection

Affiliations

Simple epidemiological dynamics explain phylogenetic clustering of HIV from patients with recent infection

Erik M Volz et al. PLoS Comput Biol. 2012.

Abstract

Phylogenies of highly genetically variable viruses such as HIV-1 are potentially informative of epidemiological dynamics. Several studies have demonstrated the presence of clusters of highly related HIV-1 sequences, particularly among recently HIV-infected individuals, which have been used to argue for a high transmission rate during acute infection. Using a large set of HIV-1 subtype B pol sequences collected from men who have sex with men, we demonstrate that virus from recent infections tend to be phylogenetically clustered at a greater rate than virus from patients with chronic infection ('excess clustering') and also tend to cluster with other recent HIV infections rather than chronic, established infections ('excess co-clustering'), consistent with previous reports. To determine the role that a higher infectivity during acute infection may play in excess clustering and co-clustering, we developed a simple model of HIV infection that incorporates an early period of intensified transmission, and explicitly considers the dynamics of phylogenetic clusters alongside the dynamics of acute and chronic infected cases. We explored the potential for clustering statistics to be used for inference of acute stage transmission rates and found that no single statistic explains very much variance in parameters controlling acute stage transmission rates. We demonstrate that high transmission rates during the acute stage is not the main cause of excess clustering of virus from patients with early/acute infection compared to chronic infection, which may simply reflect the shorter time since transmission in acute infection. Higher transmission during acute infection can result in excess co-clustering of sequences, while the extent of clustering observed is most sensitive to the fraction of infections sampled.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. A simple gene genealogy that could be generated by the HIV model.
Dark branches with taxa labeled A correspond to stage-1 (early/acute infected hosts). Light branches with taxa labeled C correspond to stage-2 (chronic infections). Event 1 represents the coalescence of two lineages corresponding to early/acute infection. Event 2 represents coalescence of an early and a late infection. Event 3 represents the stage transition of an early infection to a late infection. Event 4 represents the transmission by a late infection which is not ancestral to the sample. Top: Includes an unsampled lineage (dashed). Middle: The unsampled lineage has been pruned from the tree. The point where the lineage is pruned corresponds to event 4. Bottom: The number of lineages as a function of time (NLFT) which correspond to a host with early/acute infection (black) or chronic infection (grey).
Figure 2
Figure 2. Excess clustering and excess co-clustering of virus from patients with early/acute infections.
Left: The mean CD4 cell count (top) and frequency of ambiguous sites (bottom) versus the threshold TMRCA used to form clusters. Middle: The assortativity coefficient, a measure of similarity of co-clustered taxa, versus the treshold TMRCA used to form clusters. Assortativity of CD4 is at top, and frequency of ambiguous sites is bottom. Right: The size of each matrix element is proportional to number of co-clusterings between taxa categorized by CD4 (top, formula image) or quartile of frequency of ambiguous sites (bottom). The color represents the extent to which the count of co-clusterings exceeds the expectation if clusters were forming at random. The color scale (far right) shows strong assortativity within quartiles. The vertical red bar represents the threshold which was used to create clusters and the matrix derived from the set of clusters. This threshold corresponds to the maximum of the assortativity coefficient for the derived matrix.
Figure 3
Figure 3. Two simulated epidemics and the deterministic approximations for the prevalent number of early and late infections and the ancestor functions (the number of lineages over time).
The x-axis gives the time since the beginning of the epidemic, or equivalently, the threshold TMRCA used to calculate the number of lineages over time. Green describes the simulated number of late infections. Blue describes the simulated number of early infections. Dots show the simulated ancestor function for the number of lineages that correspond to late infections. And x's show the simulated ancestor function for lineages in early infection. Dashed lines show the prediction of the deterministic coalescent. The top row shows results for a sample taken at 15 years following the initial infections, and the bottom shows results for a sample at 30 years. The right column shows results for a sample fractions of 20%, and the left column for a census of prevalent infections(100%).
Figure 4
Figure 4. Summary statistics from HIV gene genealogies versus the fraction of transmissions attributable to early/acute infection.
The threshold TMRCA was five years before the most recent sample. Sample size and distribution of samples over time was matched to the Detroit MSM phylogeny.

Similar articles

Cited by

References

    1. Bruisten S, Schouls L. Molecular typing and clustering analysis as a tool for epidemiology of infectious diseases. Modern Infectious Disease Epidemiology: Concepts, Methods, Mathematical Models, and Public Health. 2010:117.
    1. Pybus O, Rambaut A. Evolutionary analysis of the dynamics of viral infectious disease. Nat Rev Genet. 2009;10:540–50. - PMC - PubMed
    1. Grenfell B, Pybus O, Gog J, Wood J, Daly J, et al. Unifying the epidemiological and evolutionary dynamics of pathogens. Science. 2004;303:327. - PubMed
    1. Zhu T, Mo H, Wang N, Nam D, Cao Y, et al. Genotypic and phenotypic characterization of HIV-1 patients with primary infection. Science. 1993;261:1179. - PubMed
    1. Li H, Bar K, Wang S, Decker J, Chen Y, et al. High multiplicity infection by HIV-1 in men who have sex with men. PLoS Pathog. 2010;6:e1000890. - PMC - PubMed

Publication types