Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 19;25(1):192.
doi: 10.1186/s12911-025-03030-7.

Development of a machine learning prediction model for loss to follow-up in HIV care using routine electronic medical records in a low-resource setting

Affiliations

Development of a machine learning prediction model for loss to follow-up in HIV care using routine electronic medical records in a low-resource setting

Tamrat Endebu et al. BMC Med Inform Decis Mak. .

Abstract

Background: Despite the global commitment to ending AIDS by 2030, the loss of follow-up (LTFU) in HIV care remains a significant challenge. To address this issue, a data-driven clinical decision tool is crucial for identifying patients at greater risk of LTFU and facilitating personalized and proactive interventions. This study aimed to develop a prediction model to assess the future risk of LTFU in HIV care in Ethiopia.

Methods: The study used a retrospective design in which machine learning (ML) methods were applied to the electronic medical records (EMRs) data of adult HIV-positive individuals who were newly enrolled in antiretroviral therapy between July 2019 and April 2024. The data were collected across eight randomly selected high-volume healthcare facilities. Six supervised ML classifiers-J48 decision tree, random forest, K-nearest neighbors, support vector machine, logistic regression, and naïve Bayes-were utilized for training via Weka 3.8.6 software. The performance of each algorithm was evaluated through a 10-fold cross-validation approach. Algorithm performance was compared via the corrected resampled t test (p < 0.05), and decision curve analysis (DCA) was used to assess the model's clinical utility.

Results: A total of 3,720 individuals' EMR data were analyzed, with 2,575 (69.2%) classified as not LTFU and 1,145 (30.8%) classified as LTFU. On the basis of the ML feature selection process, six strong predictors of LTFU were identified: differentiated service delivery model, adherence, tuberculosis preventive therapy, follow-up period, nutritional status, and address information. The random forest algorithm showed superior performance, with an accuracy of 84.2%, a sensitivity of 82.4%, a specificity of 85.7%, a precision of 83.7%, an F1 score of 83.1%, and an area under the curve of 89.5%. The model demonstrated greater clinical utility, offering greater net benefit than both the 'intervention for all' approach and the 'intervention for none' approach, particularly at threshold probabilities of 10% and above.

Conclusions: This study developed a machine learning-based predictive model for assessing the future risk of LTFU in HIV care within low-resource settings. Notably, the model built via the random forest algorithm exhibited high accuracy and strong discriminative performance, highlighting its positive net benefit for clinical applications. Furthermore, ongoing external validation across diverse populations is important to ensure the model's reliability and generalizability.

Keywords: HIV; Loss to follow-up; Low-resource setting; Machine learning; Model development; Prediction tool.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This study was approved by the Ethical Review Committee of the College of Health Sciences, Addis Ababa University (Approval No. 061/23/SPH, September 20, 2023). The ethics committee waived the requirement for individual informed consent, as the study used deidentified secondary data. All the data were treated with strict confidentiality and used solely for the purposes of this research. The study was conducted in accordance with the ethical principles of the Declaration of Helsinki. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Information gain (IG) of features for predicting loss to follow-up in HIV care. Abbreviations: DSD = Differentiated Service Delivery, TPT = TB Prevention Therapy, WHO = World Health Organization
Fig. 2
Fig. 2
Class distribution after applying class balancing techniques to the target feature, addressing the original imbalanced data. Abbreviations: LTFU = Loss to Follow-Up, SMOTE = Synthetic Minority Oversampling Technique
Fig. 3
Fig. 3
Comparison of different ML algorithms after applying SMOTE to balance the data. Abbreviations: J48 is a decision tree algorithm based on the C4.5 algorithm. K-NN = k-nearest neighbors, LR = logistic regression, RF = random forest, ROC = receiver operating characteristic curve, SVM = support vector machine
Fig. 4
Fig. 4
ROC curve and AUC of the random forest algorithm for predicting LTFU in HIV Care, Ethiopia
Fig. 5
Fig. 5
Decision curve analysis (DCA) assessing the clinical utility of the model for predicting LTFU in HIV care, Ethiopia

Similar articles

References

    1. UNAIDS epidemiological estimates. 2023. [Internet]. Available from: https://www.unaids.org/en/resources/documents/2024/global-aids-update-2024
    1. UNAIDS. Political declaration on HIV and AIDS: Ending inequalities and getting on track to end AIDS by 2030. [Internet]. Available from: https://www.unaids.org/en/resources/documents/2021/2021_political-declar...
    1. PEPFAR Ethiopia (PEPFAR-E). Ethiopia Country Operational Plan COP2020/FY2021 Strategic Direction Summary March 23, 2020.
    1. Standard Operating. Procedures (SOP) for comprehensive HIV/AIDS prevention, treatment, care and support services. Oromia National Regional State Health Bureau; Sept 2018.
    1. Mberi MN, Kuonza LR, Dube NM, Nattey C, Manda S, Summers R. Deter_minants of loss to follow-up in patients on antiretroviral treatment, South Africa, 2004–2012: a cohort study. BMC Health Serv Res. 2015;15(1):1–11. - PMC - PubMed

LinkOut - more resources