Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Mar 14:2024.03.11.584522.
doi: 10.1101/2024.03.11.584522.

AUTO-TUNE: SELECTING THE DISTANCE THRESHOLD FOR INFERRING HIV TRANSMISSION CLUSTERS

Affiliations

AUTO-TUNE: SELECTING THE DISTANCE THRESHOLD FOR INFERRING HIV TRANSMISSION CLUSTERS

Steven Weaver et al. bioRxiv. .

Update in

Abstract

Molecular surveillance of viral pathogens and inference of transmission networks from genomic data play an increasingly important role in public health efforts, especially for HIV-1. For many methods, the genetic distance threshold used to connect sequences in the transmission network is a key parameter informing the properties of inferred networks. Using a distance threshold that is too high can result in a network with many spurious links, making it difficult to interpret. Conversely, a distance threshold that is too low can result in a network with too few links, which may not capture key insights into clusters of public health concern. Published research using the HIV-TRACE software package frequently uses the default threshold of 0.015 substitutions/site for HIV pol gene sequences, but in many cases, investigators heuristically select other threshold parameters to better capture the underlying dynamics of the epidemic they are studying. Here, we present a general heuristic scoring approach for tuning a distance threshold adaptively, which seeks to prevent the formation of giant clusters. We prioritize the ratio of the sizes of the largest and the second largest cluster, maximizing the number of clusters present in the network. We apply our scoring heuristic to outbreaks with different characteristics, such as regional or temporal variability, and demonstrate the utility of using the scoring mechanism's suggested distance threshold to identify clusters exhibiting risk factors that would have otherwise been more difficult to identify. For example, while we found that a 0.015 substitutions/site distance threshold is typical for US-like epidemics, recent outbreaks like the CRF07_BC subtype among men who have sex with men (MSM) in China have been found to have a lower optimal threshold of 0.005 to better capture the transition from injected drug use (IDU) to MSM as the primary risk factor. Alternatively, in communities surrounding Lake Victoria in Uganda, where there has been sustained hetero-sexual transmission for many years, we found that a larger distance threshold is necessary to capture a more risk factor-diverse population with sparse sampling over a longer period of time. Such identification may allow for more informed intervention action by respective public health officials.

Keywords: HIV, network; molecular epidemiology; surveillance; transmission cluster.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Statement The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Similar articles

References

    1. Abidi S. H., Aibekova L., Davlidova S., Amangeldiyeva A., Foley B., and Ali S. (2021). Origin and evolution of HIV-1 subtype A6. PLoS One, 16(12):e0260604. doi: 10.1371/journal.pone.0260604. - DOI - PMC - PubMed
    1. Bartlett S. R., Wertheim J. O., Bull R. A., Matthews G. V., Lamoury F. M., Scheffler K., Hellard M., Maher L., Dore G. J., Lloyd A. R., et al. (2017). A molecular transmission network of recent hepatitis c infection in people with and without hiv: Implications for targeted treatment strategies. Journal of viral hepatitis, 24(5):404–411. - PMC - PubMed
    1. Bbosa N., Ssemwanga D., and Kaleebu P. (2020). Short Communication: Choosing the Right Program for the Identification of HIV-1 Transmission Networks from Nucleotide Sequences Sampled from Different Populations. AIDS research and human retroviruses, 36(11):948–951. doi: 10.1089/AID.2020.0033. - DOI - PMC - PubMed
    1. Billings E., Kijak G. H., Sanders-Buell E., Ndembi N., O’Sullivan A. M., Adebajo S., Kokogho A., Milazzo M., Lombardi K., Baral S., Nowak R., Ramadhani H., Gramzinski R., Robb M. L., Michael N. L., Charurat M. E., Ake J., Crowell T. A., Tovanabutra S., and MHRP Viral Sequencing Core and the TRUST/RV368 Study Group. (2019). New subtype b containing hiv-1 circulating recombinant of sub-saharan africa origin in nigerian men who have sex with men. J Acquir Immune Defic Syndr, 81(5):578–584. doi: 10.1097/QAI.0000000000002076. - DOI - PMC - PubMed
    1. Brenner B. G., Ibanescu R.-I., Osman N., Cuadra-Foy E., Oliveira M., Chaillon A., Stephens D., Hardy I., Routy J.-P., Thomas R., Baril J.-G., Leblanc R., Tremblay C., Roger M., and The Montreal Primary Hiv Infection Phi Cohort Study Group, n. (2021). The Role of Phylogenetics in Unravelling Patterns of HIV Transmission towards Epidemic Control: The Quebec Experience (2002–2020). Viruses, 13(8):1643. doi: 10.3390/v13081643. - DOI - PMC - PubMed

Publication types