Development of a machine learning prediction model for loss to follow-up in HIV care using routine electronic medical records in a low-resource setting
- PMID: 40389908
- PMCID: PMC12090508
- DOI: 10.1186/s12911-025-03030-7
Development of a machine learning prediction model for loss to follow-up in HIV care using routine electronic medical records in a low-resource setting
Abstract
Background: Despite the global commitment to ending AIDS by 2030, the loss of follow-up (LTFU) in HIV care remains a significant challenge. To address this issue, a data-driven clinical decision tool is crucial for identifying patients at greater risk of LTFU and facilitating personalized and proactive interventions. This study aimed to develop a prediction model to assess the future risk of LTFU in HIV care in Ethiopia.
Methods: The study used a retrospective design in which machine learning (ML) methods were applied to the electronic medical records (EMRs) data of adult HIV-positive individuals who were newly enrolled in antiretroviral therapy between July 2019 and April 2024. The data were collected across eight randomly selected high-volume healthcare facilities. Six supervised ML classifiers-J48 decision tree, random forest, K-nearest neighbors, support vector machine, logistic regression, and naïve Bayes-were utilized for training via Weka 3.8.6 software. The performance of each algorithm was evaluated through a 10-fold cross-validation approach. Algorithm performance was compared via the corrected resampled t test (p < 0.05), and decision curve analysis (DCA) was used to assess the model's clinical utility.
Results: A total of 3,720 individuals' EMR data were analyzed, with 2,575 (69.2%) classified as not LTFU and 1,145 (30.8%) classified as LTFU. On the basis of the ML feature selection process, six strong predictors of LTFU were identified: differentiated service delivery model, adherence, tuberculosis preventive therapy, follow-up period, nutritional status, and address information. The random forest algorithm showed superior performance, with an accuracy of 84.2%, a sensitivity of 82.4%, a specificity of 85.7%, a precision of 83.7%, an F1 score of 83.1%, and an area under the curve of 89.5%. The model demonstrated greater clinical utility, offering greater net benefit than both the 'intervention for all' approach and the 'intervention for none' approach, particularly at threshold probabilities of 10% and above.
Conclusions: This study developed a machine learning-based predictive model for assessing the future risk of LTFU in HIV care within low-resource settings. Notably, the model built via the random forest algorithm exhibited high accuracy and strong discriminative performance, highlighting its positive net benefit for clinical applications. Furthermore, ongoing external validation across diverse populations is important to ensure the model's reliability and generalizability.
Keywords: HIV; Loss to follow-up; Low-resource setting; Machine learning; Model development; Prediction tool.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: This study was approved by the Ethical Review Committee of the College of Health Sciences, Addis Ababa University (Approval No. 061/23/SPH, September 20, 2023). The ethics committee waived the requirement for individual informed consent, as the study used deidentified secondary data. All the data were treated with strict confidentiality and used solely for the purposes of this research. The study was conducted in accordance with the ethical principles of the Declaration of Helsinki. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.
Figures





Similar articles
-
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0. BMC Public Health. 2024. PMID: 38943093 Free PMC article.
-
Development and validation of a risk prediction model for lost to follow-up among adults on active antiretroviral therapy in Ethiopia: a retrospective follow-up study.BMC Infect Dis. 2022 Sep 7;22(1):727. doi: 10.1186/s12879-022-07691-x. BMC Infect Dis. 2022. PMID: 36071386 Free PMC article.
-
Incidence of loss to follow-up and its predictors among HIV-infected under-five children after initiation of antiretroviral therapy in West Amhara Comprehensive Specialized Referral Hospitals, Northwest Ethiopia: a multicenter retrospective follow-up study.BMC Pediatr. 2024 Sep 28;24(1):615. doi: 10.1186/s12887-024-05086-2. BMC Pediatr. 2024. PMID: 39342164 Free PMC article.
-
Rate and predictors of loss to follow-up in HIV care in a low-resource setting: analyzing critical risk periods.BMC Infect Dis. 2024 Oct 18;24(1):1176. doi: 10.1186/s12879-024-10089-6. BMC Infect Dis. 2024. PMID: 39425041 Free PMC article.
-
A data-driven scoring framework for personalized employee health check-ups: Integrating historical laboratory trends and evidence-based prevalence.Int J Med Inform. 2025 Oct;202:105974. doi: 10.1016/j.ijmedinf.2025.105974. Epub 2025 May 18. Int J Med Inform. 2025. PMID: 40403478
References
-
- UNAIDS epidemiological estimates. 2023. [Internet]. Available from: https://www.unaids.org/en/resources/documents/2024/global-aids-update-2024
-
- UNAIDS. Political declaration on HIV and AIDS: Ending inequalities and getting on track to end AIDS by 2030. [Internet]. Available from: https://www.unaids.org/en/resources/documents/2021/2021_political-declar...
-
- PEPFAR Ethiopia (PEPFAR-E). Ethiopia Country Operational Plan COP2020/FY2021 Strategic Direction Summary March 23, 2020.
-
- Standard Operating. Procedures (SOP) for comprehensive HIV/AIDS prevention, treatment, care and support services. Oromia National Regional State Health Bureau; Sept 2018.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous