Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 24;10(8):e0135469.
doi: 10.1371/journal.pone.0135469. eCollection 2015.

Viral Genetic Linkage Analysis in the Presence of Missing Data

Affiliations

Viral Genetic Linkage Analysis in the Presence of Missing Data

Shelley H Liu et al. PLoS One. .

Abstract

Analyses of viral genetic linkage can provide insight into HIV transmission dynamics and the impact of prevention interventions. For example, such analyses have the potential to determine whether recently-infected individuals have acquired viruses circulating within or outside a given community. In addition, they have the potential to identify characteristics of chronically infected individuals that make their viruses likely to cluster with others circulating within a community. Such clustering can be related to the potential of such individuals to contribute to the spread of the virus, either directly through transmission to their partners or indirectly through further spread of HIV from those partners. Assessment of the extent to which individual (incident or prevalent) viruses are clustered within a community will be biased if only a subset of subjects are observed, especially if that subset is not representative of the entire HIV infected population. To address this concern, we develop a multiple imputation framework in which missing sequences are imputed based on a model for the diversification of viral genomes. The imputation method decreases the bias in clustering that arises from informative missingness. Data from a household survey conducted in a village in Botswana are used to illustrate these methods. We demonstrate that the multiple imputation approach reduces bias in the overall proportion of clustering due to the presence of missing observations.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Schematic of the sequence-imputation method.
Fig 2
Fig 2. Decrease in overall proportion of clustering with deletion of sequences.
Clustering at each number of deletions is averaged over 100 different random deletions of the same number of sequences.
Fig 3
Fig 3. Demonstrating the corrective effect of a representative imputation.
Deleting 100 sequences leads to noticeably lower clustering. Imputation substantially improves estimates, and results in slightly larger error bars.
Fig 4
Fig 4. Estimating the true proportion of clustering in the Mochudi population.
Treating the observed Mochudi data (N = 371) as a biased sample from the population, and imputing the imputed-population dataset (N = 504) based on Botswana and Mochudi-specific age and gender breakdowns to create a database with the same demographic structure as Botswana as a whole. Point estimates and error bars for the 0.10 and 0.15 thresholds of clustering.

References

    1. DeGruttola E, Smith D, Little S, Miller V. Developing and Evaluating Comprehensive HIV Infection Control Strategies: Issues and Challenges. Clin Infect Dis. 2010;Suppl 3(50):S102–7. 10.1086/651480 - DOI - PMC - PubMed
    1. Little SJ, Kosakovsky Pond SL, Anderson CM, Young JA, Wertheim JO, Mehta SR, et al. Using HIV networks to inform real time prevention interventions. PLoS One. 2014;9(6):e98443 Available from: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4047027/pdf/pone.0098443.pdf 10.1371/journal.pone.0098443 - DOI - PMC - PubMed
    1. Vrancken B, Rambaut A, Suchard MA, Drummond A, Baele G, Derdelinckx I, et al. The genealogical population dynamics of HIV-1 in a large transmission chain: bridging within and among host evolutionary rates. PLoS Comput Biol. 2014;10(4):e1003505 Available from: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3974631/pdf/pcbi.1003505.pdf 10.1371/journal.pcbi.1003505 - DOI - PMC - PubMed
    1. Volz EM, Ionides E, Romero-Severson EO, Brandt MG, Mokotoff E, Koopman JS. HIV-1 transmission during early infection in men who have sex with men: a phylodynamic analysis. PLoS Med. 2013;10(12):e1001568; discussion e1001568. Available from: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3858227/pdf/pmed.1001568.pdf 10.1371/journal.pmed.1001568 - DOI - PMC - PubMed
    1. Volz EM, Koelle K, Bedford T. Viral phylodynamics. PLoS Comput Biol. 2013;9(3):e1002947 Available from: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3605911/pdf/pcbi.1002947.pdf 10.1371/journal.pcbi.1002947 - DOI - PMC - PubMed

Publication types

LinkOut - more resources