Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan 13;6(1):e14531.
doi: 10.1371/journal.pone.0014531.

Taking multiple infections of cells and recombination into account leads to small within-host effective-population-size estimates of HIV-1

Affiliations

Taking multiple infections of cells and recombination into account leads to small within-host effective-population-size estimates of HIV-1

Rajesh Balagam et al. PLoS One. .

Abstract

Whether HIV-1 evolution in infected individuals is dominated by deterministic or stochastic effects remains unclear because current estimates of the effective population size of HIV-1 in vivo, N(e), are widely varying. Models assuming HIV-1 evolution to be neutral estimate N(e)~10²-10⁴, smaller than the inverse mutation rate of HIV-1 (~10⁵), implying the predominance of stochastic forces. In contrast, a model that includes selection estimates N(e)>10⁵, suggesting that deterministic forces would hold sway. The consequent uncertainty in the nature of HIV-1 evolution compromises our ability to describe disease progression and outcomes of therapy. We perform detailed bit-string simulations of viral evolution that consider large genome lengths and incorporate the key evolutionary processes underlying the genomic diversification of HIV-1 in infected individuals, namely, mutation, multiple infections of cells, recombination, selection, and epistatic interactions between multiple loci. Our simulations describe quantitatively the evolution of HIV-1 diversity and divergence in patients. From comparisons of our simulations with patient data, we estimate N(e)~10³-10⁴, implying predominantly stochastic evolution. Interestingly, we find that N(e) and the viral generation time are correlated with the disease progression time, presenting a route to a priori prediction of disease progression in patients. Further, we show that the previous estimate of N(e)>10⁵ reduces as the frequencies of multiple infections of cells and recombination assumed increase. Our simulations with N(e)~10³-10⁴ may be employed to estimate markers of disease progression and outcomes of therapy that depend on the evolution of viral diversity and divergence.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Simulations of viral genomic diversification.
The evolution of (A) viral diversity, dG, (B) divergence, dS, and (C) average fitness, f, with generations predicted by our simulations for different population sizes, C. Each cell is assumed to be infected with M = 3 virions. Error bars represent standard deviations.
Figure 2
Figure 2. Estimation of Ne from comparisons with data from patients.
Sum of squares of the errors (SSE) between data from patients and our predictions of viral diversity, dG, and divergence, dS, for different values of the population size, C, (Fig. 1) and the viral generation time, τ, shown for each of the nine patients. C and τ that yield the lowest SSE provide the best fit to the data. The best-fit value of C yields Ne (Table 1).
Figure 3
Figure 3. Fits of our simulations to data from patients.
Best-fit predictions of our simulations (solid lines) presented with experimental data (symbols) of the evolution of viral diversity, dG, (cyan) and divergence, dS, (purple) for each patient. Each cell is assumed to be infected with M = 3 virions in our simulations. The values of Ne (cells) and τ (days) employed for the predictions are indicated.
Figure 4
Figure 4. Correlations with disease progression.
Correlation of (A) Ne and (B) τ with the disease progression time, or the time from seroconversion for the CD4+ T cell count to fall to 200 cells/µL . Symbols represent data obtained from our simulations with the frequency of multiple infections, M,  = 3 (circles) and drawn from a distribution based on a viral dynamics model (triangles) (see text). Linear fits (lines) to the data yield Pearson correlation coefficients of (A) 0.91 (circles) and 0.74 (triangles) and (B) 0.88 (circles) and 0.75 (triangles). Note that the x-axis in (A) is plotted on a logarithmic scale.
Figure 5
Figure 5. Simulations of viral genomic diversification with a low frequency of multiple infections.
The evolution of (A) viral diversity, dG, (B) divergence, dS, and (C) average fitness, f, with generations predicted by our simulations for different population sizes, C. Each cell is assumed to be infected with M virions drawn from a distribution based on a viral dynamics model (see text). Error bars represent standard deviations.
Figure 6
Figure 6. Estimation of Ne from comparisons with data from patients.
Sum of squares of the errors (SSE) between data from patients and our predictions of viral diversity, dG, and divergence, dS, for different values of the population size, C, (Fig. 5) and the viral generation time, τ, shown for each of the nine patients. C and τ that yield the lowest SSE provide the best fit to the data. The best-fit value of C yields Ne (Table 1).
Figure 7
Figure 7. Fits of our simulations to data from patients.
Best-fit predictions of our simulations (solid lines) presented with experimental data (symbols) of the evolution of viral diversity, dG, (cyan) and divergence, dS, (purple) for each patient. Each cell is assumed to be infected with M virions drawn from a distribution based on a viral dynamics model (see text). The values of Ne (cells) and τ (days) employed for the predictions are indicated.
Figure 8
Figure 8. Simulations of viral genomic diversification with a multiplicative fitness landscape and comparisons with patient data.
The evolution of viral diversity, dG, with generations predicted by our simulations (lines) for different population sizes C = 200 (solid) and 10000 (dashed) with a multiplicative fitness landscape (see text) with s = 0.01 (pink) and 0.001 (cyan). Each cell is assumed to be infected with M virions drawn from a distribution based on a viral dynamics model (see text). Different symbols are data from nine different patients shown also in Figs. 3 and 7.
Figure 9
Figure 9. Estimation of Ne using the linkage disequilibrium test.
The frequency of the least abundant haplotype in a two-locus/two-allele model determined from our simulations (solid symbols) and by Rouzine and Coffin (open symbols) as functions of the population size, C, assuming neutral evolution (purple), evolution with selection (cyan), and evolution with selection and recombination where the number of infections per cell is constant at 3 (blue), or follows a distribution determined from a viral dynamics model (see text) with ki = k 0 (green) or ki = 0.7ki -1 (orange). Error bars represent standard deviations. Values of C at which predictions from simulations match experimental estimates of the least abundant haplotype frequency (black line) yield Ne. 95% confidence limits on the experimental data are also shown (dotted line).
Figure 10
Figure 10. Fitness landscape.
The relative fitness, fi, of genomes as a function of their Hamming distances from the fittest sequence, diFL, obtained from experimental observations (symbols) modified to account for the ratio of synonymous and non-synonymous substitutions (Methods) and predicted (black line) by the equation formula image, with the best-fit parameters f min = 0.24, d 50 L = 30, and n = 3 obtained upon ignoring outliers (open symbols). Multiplicative fitness landscapes, formula image, with s = 0.001 (cyan) and 0.01 (pink) are also shown.

References

    1. Haase AT, Henry K, Zupancic M, Sedgewick G, Faust RA, et al. Quantitative image analysis of HIV-1 infection in lymphoid tissue. Science. 1996;274:985–989. - PubMed
    1. Telenti A, Goldstein DB. Genomics meets HIV-1. Nat Rev Microbiol. 2006;4:865–873. - PMC - PubMed
    1. Nijhuis M, Boucher CAB, Schipper P, Leitner T, Schuurman R, et al. Stochastic processes strongly influence HIV-1 evolution during suboptimal protease-inhibitor therapy. Proc Natl Acad Sci U S A. 1998;95:14441–14446. - PMC - PubMed
    1. Telenti A, Carrington M. Host factors associated with outcome from primary human immunodeficiency virus-1 infection. Curr Opin HIV AIDS. 2008;3:28–35. - PubMed
    1. Gonzalez E, Kulkarni H, Bolivar H, Mangano A, Sanchez R, et al. The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science. 2005;307:1434–1440. - PubMed

Publication types