Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 27;18(10):e1010559.
doi: 10.1371/journal.pcbi.1010559. eCollection 2022 Oct.

Unsupervised machine learning predicts future sexual behaviour and sexually transmitted infections among HIV-positive men who have sex with men

Affiliations

Unsupervised machine learning predicts future sexual behaviour and sexually transmitted infections among HIV-positive men who have sex with men

Sara Andresen et al. PLoS Comput Biol. .

Abstract

Machine learning is increasingly introduced into medical fields, yet there is limited evidence for its benefit over more commonly used statistical methods in epidemiological studies. We introduce an unsupervised machine learning framework for longitudinal features and evaluate it using sexual behaviour data from the last 20 years from over 3'700 participants in the Swiss HIV Cohort Study (SHCS). We use hierarchical clustering to find subgroups of men who have sex with men in the SHCS with similar sexual behaviour up to May 2017, and apply regression to test whether these clusters enhance predictions of sexual behaviour or sexually transmitted diseases (STIs) after May 2017 beyond what can be predicted with conventional parameters. We find that behavioural clusters enhance model performance according to likelihood ratio test, Akaike information criterion and area under the receiver operator characteristic curve for all outcomes studied, and according to Bayesian information criterion for five out of ten outcomes, with particularly good performance for predicting future sexual behaviour and recurrent STIs. We thus assess a methodology that can be used as an alternative means for creating exposure categories from longitudinal data in epidemiological models, and can contribute to the understanding of time-varying risk factors.

PubMed Disclaimer

Conflict of interest statement

I have read the journal’s policy and the authors of this manuscript have the following competing interests: DLB reports honoraria and travel grants outside of the submitted work from Gilead, ViiV and Merck. HFG, outside of this study, reports grants from Swiss HIV Cohort Study, grants from Swiss National Science Foundation, during the conduct of the study; grants from Swiss HIV Cohort Study, grants from Swiss National Science Foundation, grants from NIH, grants from Gilead unrestricted research grant, personal fees from Advisor/consultant for Merck, ViiV healthcare and Gilead sciences and member of DSMB for Merck, grants from Yvonne Jacob Foundation. KEAD’s institution has received research funding unrelated to this publication from Gilead and sponsorship to specialist meetings from MSD. AR reports fees for sitting on advisory boards from Merck Sharp & Dohme and Gilead Sciences; travel grants from Gilead Sciences, Pfizer, and AbbVie; and a research grant from Gilead Sciences, outside of the submitted work. All fees were paid to AR’s institution and not to AR personally.

Figures

Fig 1
Fig 1. Overview of clustering and prediction method.
nsCAI = condomless anal intercourse with non-steady partners.
Fig 2
Fig 2. Trends in nsCAI proportion in behavioural clusters.
The clusters contain the following proportions of the study population (from 0 through 4): 49%, 11%, 17%, 10% and 13%. Cluster 0 consists of participants who never reported nsCAI during the observation period (in this case mid-2001 until the cut-off date). The dashed vertical line on the 1st of May 2017 represents the cut-off date between the observation period (which corresponds to the period used to infer the clusters) and the outcome period. The shaded area shows the time in which nsCAI, nurse/physician-reported STIs and laboratory-confirmed syphilis were recorded as outcomes. nsCAI = condomless anal intercourse with non-steady partners.
Fig 3
Fig 3. Bar plots for Bayesian information criterion values.
Regression models with different combinations of predictor variables to predict nurse/physician-reported STIs (left), laboratory-confirmed syphilis (center) and nsCAI (right) after cut-off. A smaller BIC represents a better prediction. Numbers to the right of the bars represent the p value of the likelihood ratio test between the two models in question. nsCAI = condomless anal intercourse with non-steady partners. BIC = Bayesian information criterion. pLRT = p value of likelihood ratio test comparing the respective models with and without clusters.

References

    1. Roth JA, Radevski G, Marzolini C, Rauch A, Günthard HF, Kouyos RD, et al. Cohort-Derived Machine Learning Models for Individual Prediction of Chronic Kidney Disease in People Living With Human Immunodeficiency Virus: A Prospective Multicenter Cohort Study. J Infect Dis. 2020;(Xx Xxxx):1–11. - PMC - PubMed
    1. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. doi: 10.1038/s41591-018-0300-7 - DOI - PubMed
    1. Mohammed H, Blomquist P, Ogaz D, Duffell S, Furegato M, Checchi M, et al. 100 Years of STIs in the UK: A review of national surveillance data. Sex Transm Infect. 2018;94(8):553–8. doi: 10.1136/sextrans-2017-053273 - DOI - PubMed
    1. Spiteri G, Unemo M, Mårdh O, Amato-Gauci AJ. The resurgence of syphilis in high-income countries in the 2000s: A focus on Europe. Epidemiol Infect. 2019;147. doi: 10.1017/S0950268819000281 - DOI - PMC - PubMed
    1. Pradat P, Huleux T, Raffi F, Delobel P, Valantin MA, Poizot-Martin I, et al. Incidence of new hepatitis C virus infection is still increasing in French MSM living with HIV. Aids. 2018;32(8):1077–82. doi: 10.1097/QAD.0000000000001789 - DOI - PubMed

Publication types