Machine Learning-Based HIV Risk Estimation Using Incidence Rate Ratios
- PMID: 36304038
- PMCID: PMC9580760
- DOI: 10.3389/frph.2021.756405
Machine Learning-Based HIV Risk Estimation Using Incidence Rate Ratios
Abstract
HIV/AIDS is an ongoing global pandemic, with an estimated 39 million infected worldwide. Early detection is anticipated to help improve outcomes and prevent further infections. Point-of-care diagnostics make HIV/AIDS diagnoses available both earlier and to a broader population. Wide-spread and automated HIV risk estimation can offer objective guidance. This supports providers in making an informed decision when considering patients with high HIV risk for HIV testing or pre-exposure prophylaxis (PrEP). We propose a novel machine learning method that allows providers to use the data from a patient's previous stays at the clinic to estimate their HIV risk. All features available in the clinical data are considered, making the set of features objective and independent of expert opinions. The proposed method builds on association rules that are derived from the data. The incidence rate ratio (IRR) is determined for each rule. Given a new patient, the mean IRR of all applicable rules is used to estimate their HIV risk. The method was tested and validated on the publicly available clinical database MIMIC-IV, which consists of around 525,000 hospital stays that included a stay at the intensive care unit or emergency department. We evaluated the method using the area under the receiver operating characteristic curve (AUC). The best performance with an AUC of 0.88 was achieved with a model consisting of 53 rules. A threshold value of 0.66 leads to a sensitivity of 98% and a specificity of 53%. The rules were grouped into drug abuse, psychological illnesses (e.g., PTSD), previously known associations (e.g., pulmonary diseases), and new associations (e.g., certain diagnostic procedures). In conclusion, we propose a novel HIV risk estimation method that builds on existing clinical data. It incorporates a wide range of features, leading to a model that is independent of expert opinions. It supports providers in making informed decisions in the point-of-care diagnostics process by estimating a patient's HIV risk.
Keywords: HIV; artificial intelligence; association rules; bias; clinical data; incidence rate ratio; machine learning; risk estimation.
Copyright © 2021 Haas, Maier and Rothgang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures




Similar articles
-
Rule-Based Models for Risk Estimation and Analysis of In-hospital Mortality in Emergency and Critical Care.Front Med (Lausanne). 2021 Nov 8;8:785711. doi: 10.3389/fmed.2021.785711. eCollection 2021. Front Med (Lausanne). 2021. PMID: 34820408 Free PMC article.
-
Development and validation of an automated HIV prediction algorithm to identify candidates for pre-exposure prophylaxis: a modelling study.Lancet HIV. 2019 Oct;6(10):e696-e704. doi: 10.1016/S2352-3018(19)30139-0. Epub 2019 Jul 5. Lancet HIV. 2019. PMID: 31285182 Free PMC article.
-
A new approach for interpretability and reliability in clinical risk prediction: Acute coronary syndrome scenario.Artif Intell Med. 2021 Jul;117:102113. doi: 10.1016/j.artmed.2021.102113. Epub 2021 May 13. Artif Intell Med. 2021. PMID: 34127242
-
State of the Art of Machine Learning-Enabled Clinical Decision Support in Intensive Care Units: Literature Review.JMIR Med Inform. 2022 Mar 3;10(3):e28781. doi: 10.2196/28781. JMIR Med Inform. 2022. PMID: 35238790 Free PMC article. Review.
-
A Machine Learning Approach to Predicting Need for Hospitalization for Pediatric Asthma Exacerbation at the Time of Emergency Department Triage.Acad Emerg Med. 2018 Dec;25(12):1463-1470. doi: 10.1111/acem.13655. Epub 2018 Nov 29. Acad Emerg Med. 2018. PMID: 30382605
Cited by
-
The status of machine learning in HIV testing in South Africa: a qualitative inquiry with stakeholders in Gauteng province.Front Digit Health. 2025 Aug 1;7:1618781. doi: 10.3389/fdgth.2025.1618781. eCollection 2025. Front Digit Health. 2025. PMID: 40822909 Free PMC article.
-
Predicting HIV Diagnosis Among Emerging Adults Using Electronic Health Records and Health Survey Data in All of Us Research Program.Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:5433-5440. doi: 10.1109/bibm62325.2024.10822296. Epub 2025 Jan 10. Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024. PMID: 39950131 Free PMC article.
-
HIV Risk Score and Prediction Model in the United States: A Scoping Review.AIDS Behav. 2025 Aug;29(8):2388-2407. doi: 10.1007/s10461-025-04702-1. Epub 2025 Apr 5. AIDS Behav. 2025. PMID: 40185966 Free PMC article. Review.
-
Automated HIV Case Identification from the MIMIC-IV Database.AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:555-564. eCollection 2024. AMIA Jt Summits Transl Sci Proc. 2024. PMID: 38827090 Free PMC article.
-
Estimating HIV incidence in Türkiye: results from two mathematical models.BMC Infect Dis. 2025 Mar 17;25(1):367. doi: 10.1186/s12879-025-10718-8. BMC Infect Dis. 2025. PMID: 40097951 Free PMC article.
References
-
- UNAIDS. 2020 Global AIDS Update: Seizing the Moment — Tackling Entrenched Inequalities to End Epidemics. UNAIDS (2020). Available online at: https://www.unaids.org/en/resources/documents/2020/global-aids-report
LinkOut - more resources
Full Text Sources
Miscellaneous