Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 27;15(1):8270.
doi: 10.1038/s41467-024-52618-6.

Generalizability assessment of AI models across hospitals in a low-middle and high income country

Affiliations

Generalizability assessment of AI models across hospitals in a low-middle and high income country

Jenny Yang et al. Nat Commun. .

Abstract

The integration of artificial intelligence (AI) into healthcare systems within low-middle income countries (LMICs) has emerged as a central focus for various initiatives aiming to improve healthcare access and delivery quality. In contrast to high-income countries (HICs), which often possess the resources and infrastructure to adopt innovative healthcare technologies, LMICs confront resource limitations such as insufficient funding, outdated infrastructure, limited digital data, and a shortage of technical expertise. Consequently, many algorithms initially trained on data from non-LMIC settings are now being employed in LMIC contexts. However, the effectiveness of these systems in LMICs can be compromised when the unique local contexts and requirements are not adequately considered. In this study, we evaluate the feasibility of utilizing models developed in the United Kingdom (a HIC) within hospitals in Vietnam (a LMIC). Consequently, we present and discuss practical methodologies aimed at improving model performance, emphasizing the critical importance of tailoring solutions to the distinct healthcare systems found in LMICs. Our findings emphasize the necessity for collaborative initiatives and solutions that are sensitive to the local context in order to effectively tackle the healthcare challenges that are unique to these regions.

PubMed Disclaimer

Conflict of interest statement

All authors declare no competing interests.

Figures

Fig. 1
Fig. 1. t-SNE plot of UK and Vietnam datasets with reduced feature set.
Plot includes all positive COVID-19 samples in UK and Vietnam datasets, including the matched/reduced set of features.
Fig. 2
Fig. 2. t-SNE plot of UK and Vietnam datasets with comprehensive feature set.
Plot includes all positive COVID-19 samples in UK and Vietnam datasets, including the comprehensive set of features, which were generated using the GATS technique.
Fig. 3
Fig. 3. COVID-19 diagnosis performance across logistic regression, XGBoost, and neural network models trained on the UK data.
Results are presented as AUROC for the reduced feature set and the comprehensive feature set (GATS-filled), with * representing the comprehensive dataset. Error bars are shown as 95% confidence intervals (CIs), which are computed using 1000 bootstrapped samples drawn from each test set. Source data are provided as a Source Data file.
Fig. 4
Fig. 4. COVID-19 diagnosis AUROC performance at HTD and NHTD using neural network models which were ready-made (the UK-based models) and models which were fine-tuned using transfer learning.
Models trained and tested locally at HTD and NHTD are represented by the horizontal purple and yellow dotted lines, respectively. Results are presented for the reduced feature set and the comprehensive feature set (GATS-filled), with * representing the comprehensive dataset. Error bars are shown as 95% confidence intervals (CIs), which are computed using 1000 bootstrapped samples drawn from each test set. Source data are provided as a Source Data file. TL transfer learning.
Fig. 5
Fig. 5. COVID-19 diagnosis AUPRC performance at HTD and NHTD using neural network models which were ready-made (the UK-based models) and models which were fine-tuned using transfer learning.
Models trained and tested locally at HTD and NHTD are represented by the horizontal purple and yellow dotted lines, respectively. Results are presented for the reduced feature set and the comprehensive feature set (GATS-filled), with * representing the comprehensive dataset. Error bars are shown as 95% confidence intervals (CIs), which are computed using 1000 bootstrapped samples drawn from each test set. Source data are provided as a Source Data file. TL Transfer Learning.

References

    1. Labrique, A. B. et al. Best practices in scaling digital health in low and middle income countries. Glob. Health14, 1–8 (2018). - PMC - PubMed
    1. Yang, J. et al. Mitigating machine learning bias between high income and low-middle income countries for enhanced model fairness and generalizability. Sci. Rep.14, 13318 (2024). - PMC - PubMed
    1. Wang, D. et al. “Brilliant AI doctor” in rural clinics: challenges in AI-powered clinical decision support system deployment. In Proc. CHI Conference on Human Factors in Computing Systems 1–18 (2021).
    1. Alami, H. et al. Artificial intelligence in health care: laying the foundation for responsible, sustainable, and inclusive innovation in low-and middle-income countries. Glob. Health16, 1–6 (2020). - PMC - PubMed
    1. Ciecierski-Holmes, T., Singh, R., Axt, M., Brenner, S. & Barteit, S. Artificial intelligence for strengthening healthcare systems in low-and middle-income countries: a systematic scoping review. npj Digit. Med.5, 162 (2022). - PMC - PubMed

Publication types

LinkOut - more resources