Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jan;10(1):e1003430.
doi: 10.1371/journal.pcbi.1003430. Epub 2014 Jan 9.

Linkage of viral sequences among HIV-infected village residents in Botswana: estimation of linkage rates in the presence of missing data

Affiliations

Linkage of viral sequences among HIV-infected village residents in Botswana: estimation of linkage rates in the presence of missing data

Nicole Bohme Carnegie et al. PLoS Comput Biol. 2014 Jan.

Abstract

Linkage analysis is useful in investigating disease transmission dynamics and the effect of interventions on them, but estimates of probabilities of linkage between infected people from observed data can be biased downward when missingness is informative. We investigate variation in the rates at which subjects' viral genotypes link across groups defined by viral load (low/high) and antiretroviral treatment (ART) status using blood samples from household surveys in the Northeast sector of Mochudi, Botswana. The probability of obtaining a sequence from a sample varies with viral load; samples with low viral load are harder to amplify. Pairwise genetic distances were estimated from aligned nucleotide sequences of HIV-1C env gp120. It is first shown that the probability that randomly selected sequences are linked can be estimated consistently from observed data. This is then used to develop estimates of the probability that a sequence from one group links to at least one sequence from another group under the assumption of independence across pairs. Furthermore, a resampling approach is developed that accounts for the presence of correlation across pairs, with diagnostics for assessing the reliability of the method. Sequences were obtained for 65% of subjects with high viral load (HVL, n = 117), 54% of subjects with low viral load but not on ART (LVL, n = 180), and 45% of subjects on ART (ART, n = 126). The probability of linkage between two individuals is highest if both have HVL, and lowest if one has LVL and the other has LVL or is on ART. Linkage across groups is high for HVL and lower for LVL and ART. Adjustment for missing data increases the group-wise linkage rates by 40-100%, and changes the relative rates between groups. Bias in inferences regarding HIV viral linkage that arise from differential ability to genotype samples can be reduced by appropriate methods for accommodating missing data.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Average relative bias of adjusted (red) and unadjusted (black) estimators of group-wise linkage rates over general time-reversible evolutionary model simulations.
Figure 2
Figure 2. Estimates of individual-to-individual probability of linkage by groups, , for the full Mochudi data.
Figure 3
Figure 3. Estimates of conditional probability of linkage by groups, , for a 70% sample of the full Mochudi data.
Red line represents the “truth” as observed in the full data.
Figure 4
Figure 4. Estimates of conditional probability of linkage by groups, , for the full Mochudi data.
Figure 5
Figure 5. Realized values of for subsamples of 5 to 95% of Mochudi data.
Red line indicates the observed value of formula image for the full data.
Figure 6
Figure 6. Distribution of unadjusted and bootstrap-adjusted estimators of group-wise linkage probabilities, using proportionate subsampling for a 70% sample of Mochudi data.
Figure 7
Figure 7. Distribution of unadjusted and bootstrap-adjusted estimators of group-wise linkage probabilities, using proportionate subsampling for a 30% sample of Mochudi data.
Figure 8
Figure 8. Diagnostic plots of subsample correlations by group, .
Figure 9
Figure 9. Point estimates and 95% confidence intervals for unadjusted and bootstrap-adjusted estimates of groupwise linkage rates in Mochudi.
The distance cutoff defining linkage between sequences ranges from 0.085 to 0.12.

References

    1. Yerly S, Vora S, Rizzardi P, Chave JP, Vernazza PL, et al. (2001) Acute HIV infection: impact on the spread of HIV and transmission of drug resistance. AIDS 15: 2287–2292. - PubMed
    1. Brenner BG, Roger M, Moisi DD, Oliveira M, Hardy I, et al. (2008) Transmission networks of drug resistance acquired in primary/early stage HIV infection. AIDS 22: 2509–2515. - PMC - PubMed
    1. Hughes GJ, Fearnhill E, Dunn D, Lycett SJ, Rambaut A, et al. (2009) Molecular phylodynamics of the heterosexual HIV epidemic in the United Kingdom. PLoS Pathogens 5: e1000590. - PMC - PubMed
    1. Smith DM, May S, Tweeten S, Drumright L, Pacold ME, et al. (2009) A public health model for the molecular surveillance of HIV transmission in San Diego, California. AIDS 23: 225–232. - PMC - PubMed
    1. Aldous JL, Kosakovsky Pond S, Poon A, Jain S, Qin H, et al. (2012) Characterizing HIV trans- mission networks across the United States. Clinical Infectious Diseases 55: 1135–1143. - PMC - PubMed

Publication types

Substances