InfectA-Chat, an Arabic Large Language Model for Infectious Diseases: Comparative Analysis
- PMID: 39928922
- PMCID: PMC11851044
- DOI: 10.2196/63881
InfectA-Chat, an Arabic Large Language Model for Infectious Diseases: Comparative Analysis
Abstract
Background: Infectious diseases have consistently been a significant concern in public health, requiring proactive measures to safeguard societal well-being. In this regard, regular monitoring activities play a crucial role in mitigating the adverse effects of diseases on society. To monitor disease trends, various organizations, such as the World Health Organization (WHO) and the European Centre for Disease Prevention and Control (ECDC), collect diverse surveillance data and make them publicly accessible. However, these platforms primarily present surveillance data in English, which creates language barriers for non-English-speaking individuals and global public health efforts to accurately observe disease trends. This challenge is particularly noticeable in regions such as the Middle East, where specific infectious diseases, such as Middle East respiratory syndrome coronavirus (MERS-CoV), have seen a dramatic increase. For such regions, it is essential to develop tools that can overcome language barriers and reach more individuals to alleviate the negative impacts of these diseases.
Objective: This study aims to address these issues; therefore, we propose InfectA-Chat, a cutting-edge large language model (LLM) specifically designed for the Arabic language but also incorporating English for question and answer (Q&A) tasks. InfectA-Chat leverages its deep understanding of the language to provide users with information on the latest trends in infectious diseases based on their queries.
Methods: This comprehensive study was achieved by instruction tuning the AceGPT-7B and AceGPT-7B-Chat models on a Q&A task, using a dataset of 55,400 Arabic and English domain-specific instruction-following data. The performance of these fine-tuned models was evaluated using 2770 domain-specific Arabic and English instruction-following data, using the GPT-4 evaluation method. A comparative analysis was then performed against Arabic LLMs and state-of-the-art models, including AceGPT-13B-Chat, Jais-13B-Chat, Gemini, GPT-3.5, and GPT-4. Furthermore, to ensure the model had access to the latest information on infectious diseases by regularly updating the data without additional fine-tuning, we used the retrieval-augmented generation (RAG) method.
Results: InfectA-Chat demonstrated good performance in answering questions about infectious diseases by the GPT-4 evaluation method. Our comparative analysis revealed that it outperforms the AceGPT-7B-Chat and InfectA-Chat (based on AceGPT-7B) models by a margin of 43.52%. It also surpassed other Arabic LLMs such as AceGPT-13B-Chat and Jais-13B-Chat by 48.61%. Among the state-of-the-art models, InfectA-Chat achieved a leading performance of 23.78%, competing closely with the GPT-4 model. Furthermore, the RAG method in InfectA-Chat significantly improved document retrieval accuracy. Notably, RAG retrieved more accurate documents based on queries when the top-k parameter value was increased.
Conclusions: Our findings highlight the shortcomings of general Arabic LLMs in providing up-to-date information about infectious diseases. With this study, we aim to empower individuals and public health efforts by offering a bilingual Q&A system for infectious disease monitoring.
Keywords: AceGPT; Arabic large language models; infectious disease monitoring; large language model; multilingual large language model; public health.
©Yesim Selcuk, Eunhui Kim, Insung Ahn. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 10.02.2025.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures











References
-
- MERS-CoV worldwide overview. European Centre for Disease Prevention and Control. [2024-06-06]. https://www.ecdc.europa.eu/en/middle-east-respiratory-syndrome-coronavirus .
-
- Multi-country outbreak of cholera, external situation report #12 - 14 March 2024. World Health Organization. 2024. Mar 14, [2024-06-06]. https://www.who.int/publications/m/item/multi-country-outbreak-of-choler... .
-
- What we do. European Centre for Disease Prevention and Control. [2024-05-02]. https://www.ecdc.europa.eu/en/about-ecdc/what-we-do .
-
- OpenAI GPT-4 technical report. arXiv. doi: 10.48550/arxiv.2303.08774. Preprint posted online on March 4, 2024. https://arxiv.org/pdf/2303.08774 - DOI
-
- Almazrouei E, Alobeidli H, Alshamsi A, Cappelli A, Cojocaru R, Debbah M, Goffinet É, Hesslow D, Launay J, Malartic Q, Mazzotta D, Noune B, Pannier B, Penedo G. The falcon series of open language models. arXiv. doi: 10.48550/arxiv.2311.16867. Preprint posted online on November 28, 2023. https://arxiv.org/abs/2311.16867 - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical