. 2024 Sep 27;15(1):8270.

doi: 10.1038/s41467-024-52618-6.

Generalizability assessment of AI models across hospitals in a low-middle and high income country

Jenny Yang¹, Nguyen Thanh Dung², Pham Ngoc Thach³, Nguyen Thanh Phong², Vu Dinh Phu³, Khiem Dong Phu³, Lam Minh Yen⁴, Doan Bui Xuan Thy⁴, Andrew A S Soltan^{5

6

7}, Louise Thwaites^#^{4

8}, David A Clifton^#^{5

9}

Affiliations

¹ Department Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK. jenny.yang@eng.ox.ac.uk.
² Hospital for Tropical Diseases, Ho Chi Minh, Vietnam.
³ National Hospital for Tropical Diseases, Hanoi, Vietnam.
⁴ Oxford University Clinical Research Unit, Ho Chi Minh, Vietnam.
⁵ Department Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK.
⁶ Oxford Cancer & Haematology Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
⁷ Department of Oncology, University of Oxford, Oxford, UK.
⁸ Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK.
⁹ Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou, China.

^# Contributed equally.

PMID: 39333515
PMCID: PMC11436917
DOI: 10.1038/s41467-024-52618-6

Generalizability assessment of AI models across hospitals in a low-middle and high income country

Jenny Yang et al. Nat Commun. 2024.

. 2024 Sep 27;15(1):8270.

doi: 10.1038/s41467-024-52618-6.

Authors

Affiliations

¹ Department Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK. jenny.yang@eng.ox.ac.uk.
² Hospital for Tropical Diseases, Ho Chi Minh, Vietnam.
³ National Hospital for Tropical Diseases, Hanoi, Vietnam.
⁴ Oxford University Clinical Research Unit, Ho Chi Minh, Vietnam.
⁵ Department Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK.
⁶ Oxford Cancer & Haematology Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
⁷ Department of Oncology, University of Oxford, Oxford, UK.
⁸ Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK.
⁹ Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou, China.

^# Contributed equally.

PMID: 39333515
PMCID: PMC11436917
DOI: 10.1038/s41467-024-52618-6

Abstract

The integration of artificial intelligence (AI) into healthcare systems within low-middle income countries (LMICs) has emerged as a central focus for various initiatives aiming to improve healthcare access and delivery quality. In contrast to high-income countries (HICs), which often possess the resources and infrastructure to adopt innovative healthcare technologies, LMICs confront resource limitations such as insufficient funding, outdated infrastructure, limited digital data, and a shortage of technical expertise. Consequently, many algorithms initially trained on data from non-LMIC settings are now being employed in LMIC contexts. However, the effectiveness of these systems in LMICs can be compromised when the unique local contexts and requirements are not adequately considered. In this study, we evaluate the feasibility of utilizing models developed in the United Kingdom (a HIC) within hospitals in Vietnam (a LMIC). Consequently, we present and discuss practical methodologies aimed at improving model performance, emphasizing the critical importance of tailoring solutions to the distinct healthcare systems found in LMICs. Our findings emphasize the necessity for collaborative initiatives and solutions that are sensitive to the local context in order to effectively tackle the healthcare challenges that are unique to these regions.

PubMed Disclaimer

Conflict of interest statement

All authors declare no competing interests.

Figures

**Fig. 1. t-SNE plot of UK and Vietnam datasets with reduced feature set.**
Plot includes all positive COVID-19 samples in UK and Vietnam datasets, including the matched/reduced set of features.

**Fig. 2. t-SNE plot of UK and Vietnam datasets with comprehensive feature set.**
Plot includes all positive COVID-19 samples in UK and Vietnam datasets, including the comprehensive set of features, which were generated using the GATS technique.

**Fig. 3. COVID-19 diagnosis performance across logistic regression, XGBoost, and neural network models trained on the UK data.**
Results are presented as AUROC for the reduced feature set and the comprehensive feature set (GATS-filled), with * representing the comprehensive dataset. Error bars are shown as 95% confidence intervals (CIs), which are computed using 1000 bootstrapped samples drawn from each test set. Source data are provided as a Source Data file.

**Fig. 4. COVID-19 diagnosis AUROC performance at HTD and NHTD using neural network models which were ready-made (the UK-based models) and models which were fine-tuned using transfer learning.**
Models trained and tested locally at HTD and NHTD are represented by the horizontal purple and yellow dotted lines, respectively. Results are presented for the reduced feature set and the comprehensive feature set (GATS-filled), with * representing the comprehensive dataset. Error bars are shown as 95% confidence intervals (CIs), which are computed using 1000 bootstrapped samples drawn from each test set. Source data are provided as a Source Data file. TL transfer learning.

**Fig. 5. COVID-19 diagnosis AUPRC performance at HTD and NHTD using neural network models which were ready-made (the UK-based models) and models which were fine-tuned using transfer learning.**
Models trained and tested locally at HTD and NHTD are represented by the horizontal purple and yellow dotted lines, respectively. Results are presented for the reduced feature set and the comprehensive feature set (GATS-filled), with * representing the comprehensive dataset. Error bars are shown as 95% confidence intervals (CIs), which are computed using 1000 bootstrapped samples drawn from each test set. Source data are provided as a Source Data file. TL Transfer Learning.

See this image and copyright information in PMC

References

1. Labrique, A. B. et al. Best practices in scaling digital health in low and middle income countries. Glob. Health14, 1–8 (2018). - DOI - PMC - PubMed
1. Yang, J. et al. Mitigating machine learning bias between high income and low-middle income countries for enhanced model fairness and generalizability. Sci. Rep.14, 13318 (2024). - DOI - PMC - PubMed
1. Wang, D. et al. “Brilliant AI doctor” in rural clinics: challenges in AI-powered clinical decision support system deployment. In Proc. CHI Conference on Human Factors in Computing Systems 1–18 (2021).
1. Alami, H. et al. Artificial intelligence in health care: laying the foundation for responsible, sustainable, and inclusive innovation in low-and middle-income countries. Glob. Health16, 1–6 (2020). - DOI - PMC - PubMed
1. Ciecierski-Holmes, T., Singh, R., Axt, M., Brenner, S. & Barteit, S. Artificial intelligence for strengthening healthcare systems in low-and middle-income countries: a systematic scoping review. npj Digit. Med.5, 162 (2022). - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

955681/EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- PubMed Central
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Generalizability assessment of AI models across hospitals in a low-middle and high income country

Affiliations

Generalizability assessment of AI models across hospitals in a low-middle and high income country

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical