Comparative Study

. 2025 Feb 10:13:e63881.

doi: 10.2196/63881.

InfectA-Chat, an Arabic Large Language Model for Infectious Diseases: Comparative Analysis

Yesim Selcuk¹, Eunhui Kim^{1

2}, Insung Ahn^{1

3}

Affiliations

¹ Department of Applied AI, KISTI School, University of Science and Technology, Daejeon, Republic of Korea.
² AI Data Research Team, Department of Data-Centric Problem Solving Research, Division of S&T Digital Convergence, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.
³ Department of Data-Centric Problem Solving Research, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.

PMID: 39928922
PMCID: PMC11851044
DOI: 10.2196/63881

Comparative Study

InfectA-Chat, an Arabic Large Language Model for Infectious Diseases: Comparative Analysis

Yesim Selcuk et al. JMIR Med Inform. 2025.

. 2025 Feb 10:13:e63881.

doi: 10.2196/63881.

Authors

Yesim Selcuk¹, Eunhui Kim^{1

2}, Insung Ahn^{1

3}

Affiliations

¹ Department of Applied AI, KISTI School, University of Science and Technology, Daejeon, Republic of Korea.
² AI Data Research Team, Department of Data-Centric Problem Solving Research, Division of S&T Digital Convergence, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.
³ Department of Data-Centric Problem Solving Research, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.

PMID: 39928922
PMCID: PMC11851044
DOI: 10.2196/63881

Abstract

Background: Infectious diseases have consistently been a significant concern in public health, requiring proactive measures to safeguard societal well-being. In this regard, regular monitoring activities play a crucial role in mitigating the adverse effects of diseases on society. To monitor disease trends, various organizations, such as the World Health Organization (WHO) and the European Centre for Disease Prevention and Control (ECDC), collect diverse surveillance data and make them publicly accessible. However, these platforms primarily present surveillance data in English, which creates language barriers for non-English-speaking individuals and global public health efforts to accurately observe disease trends. This challenge is particularly noticeable in regions such as the Middle East, where specific infectious diseases, such as Middle East respiratory syndrome coronavirus (MERS-CoV), have seen a dramatic increase. For such regions, it is essential to develop tools that can overcome language barriers and reach more individuals to alleviate the negative impacts of these diseases.

Objective: This study aims to address these issues; therefore, we propose InfectA-Chat, a cutting-edge large language model (LLM) specifically designed for the Arabic language but also incorporating English for question and answer (Q&A) tasks. InfectA-Chat leverages its deep understanding of the language to provide users with information on the latest trends in infectious diseases based on their queries.

Methods: This comprehensive study was achieved by instruction tuning the AceGPT-7B and AceGPT-7B-Chat models on a Q&A task, using a dataset of 55,400 Arabic and English domain-specific instruction-following data. The performance of these fine-tuned models was evaluated using 2770 domain-specific Arabic and English instruction-following data, using the GPT-4 evaluation method. A comparative analysis was then performed against Arabic LLMs and state-of-the-art models, including AceGPT-13B-Chat, Jais-13B-Chat, Gemini, GPT-3.5, and GPT-4. Furthermore, to ensure the model had access to the latest information on infectious diseases by regularly updating the data without additional fine-tuning, we used the retrieval-augmented generation (RAG) method.

Results: InfectA-Chat demonstrated good performance in answering questions about infectious diseases by the GPT-4 evaluation method. Our comparative analysis revealed that it outperforms the AceGPT-7B-Chat and InfectA-Chat (based on AceGPT-7B) models by a margin of 43.52%. It also surpassed other Arabic LLMs such as AceGPT-13B-Chat and Jais-13B-Chat by 48.61%. Among the state-of-the-art models, InfectA-Chat achieved a leading performance of 23.78%, competing closely with the GPT-4 model. Furthermore, the RAG method in InfectA-Chat significantly improved document retrieval accuracy. Notably, RAG retrieved more accurate documents based on queries when the top-k parameter value was increased.

Conclusions: Our findings highlight the shortcomings of general Arabic LLMs in providing up-to-date information about infectious diseases. With this study, we aim to empower individuals and public health efforts by offering a bilingual Q&A system for infectious disease monitoring.

Keywords: AceGPT; Arabic large language models; infectious disease monitoring; large language model; multilingual large language model; public health.

©Yesim Selcuk, Eunhui Kim, Insung Ahn. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 10.02.2025.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
Modeling of InfectA-Chat. RAG: retrieval-augmented generation.

**Figure 2**
The overall pipeline of data collection and preprocessing. Q&A: question and answer.

**Figure 3**
The overall pipeline of supervised fine-tuning (instruction tuning).

**Figure 4**
The overall pipeline of Retrieval-Augmented Generation.

**Figure 5**
Training progress of AceGPT-7B–based InfectA-Chat (training loss, validation loss, and learning rate). SFT: supervised fine-tuning.

**Figure 6**
Training progress of AceGPT-7B-Chat–based InfectA-Chat (training loss, validation loss, and learning rate). SFT: supervised fine-tuning.

**Figure 7**
Qualitative performance comparison for AceGPT-7B-Chat, AceGPT-7B–based InfectA-Chat (ours), and AceGPT-7B-Chat–based InfectA-Chat (ours) by GPT-4 evaluation.

**Figure 8**
Qualitative performance comparison for AceGPT-13B-Chat, Jais-13B-Chat, and AceGPT-7B-Chat–based InfectA-Chat (ours) by GPT-4 evaluation.

**Figure 9**
Qualitative performance comparison for InfectA-Chat (ours), GPT-4, GPT-3.5, and Gemini.

**Figure 10**
Qualitative performance results for InfectA-Chat on the recent dataset (March 20, 2024, to June 20, 2024) by GPT-4 evaluation.

**Figure 11**
Accuracy of retriever based on top-k parameter in Retrieval-Augmented Generation pipeline.

See this image and copyright information in PMC

References

1. MERS-CoV worldwide overview. European Centre for Disease Prevention and Control. [2024-06-06]. https://www.ecdc.europa.eu/en/middle-east-respiratory-syndrome-coronavirus .
1. Multi-country outbreak of cholera, external situation report #12 - 14 March 2024. World Health Organization. 2024. Mar 14, [2024-06-06]. https://www.who.int/publications/m/item/multi-country-outbreak-of-choler... .
1. What we do. European Centre for Disease Prevention and Control. [2024-05-02]. https://www.ecdc.europa.eu/en/about-ecdc/what-we-do .
1. OpenAI GPT-4 technical report. arXiv. doi: 10.48550/arxiv.2303.08774. Preprint posted online on March 4, 2024. https://arxiv.org/pdf/2303.08774 - DOI
1. Almazrouei E, Alobeidli H, Alshamsi A, Cappelli A, Cojocaru R, Debbah M, Goffinet É, Hesslow D, Launay J, Malartic Q, Mazzotta D, Noune B, Pannier B, Penedo G. The falcon series of open language models. arXiv. doi: 10.48550/arxiv.2311.16867. Preprint posted online on November 28, 2023. https://arxiv.org/abs/2311.16867 - DOI

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- JMIR Publications
- PubMed Central
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

InfectA-Chat, an Arabic Large Language Model for Infectious Diseases: Comparative Analysis

Affiliations

InfectA-Chat, an Arabic Large Language Model for Infectious Diseases: Comparative Analysis

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical