Chinese generative AI models (DeepSeek and Qwen) rival ChatGPT-4 in ophthalmology queries with excellent performance in Arabic and English

Malik Sallam^{1

2}, Israa M Alasfoor^{3

4}, Shahad W Khalid^{3

4}, Rand I Al-Mulla^{3

4}, Amwaj Al-Farajat^{3

4}, Maad M Mijwil^{5

6}, Reem Zahrawi⁷, Mohammed Sallam^{8

9

10

11}, Jan Egger^{12

13

14

15}, Ahmad S Al-Adwan¹⁶

Affiliations

¹ Department of Pathology, Microbiology and Forensic Medicine, School of Medicine, The University of Jordan, Amman, Jordan.
² Department of Clinical Laboratories and Forensic Medicine, Jordan University Hospital, Amman, Jordan.
³ Section of Ophthalmology, Department of Special Surgery, School of Medicine, The University of Jordan, Amman, Jordan.
⁴ Section of Ophthalmology, Department of Special Surgery, Jordan University Hospital, Amman, Jordan.
⁵ College of Administration and Economics, Al-Iraqia University, Baghdad, Iraq.
⁶ Department of Computer Techniques Engineering, Baghdad College of Economic Sciences University, Baghdad, Iraq.
⁷ Department of Ophthalmology, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, United Arab Emirates.
⁸ Department of Pharmacy, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, United Arab Emirates.
⁹ Department of Management, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, United Arab Emirates.
¹⁰ Department of Management, School of Business, International American University, Los Angeles, United States.
¹¹ College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai, United Arab Emirates.
¹² Institute for Artificial Intelligence in Medicine (IKIM), Essen University Hospital (AoR), GirardetstraBe, Germany.
¹³ Center for Virtual and Extended Reality in Medicine (ZvRM), Essen University Hospital (AoR), HufelandstraBe, Germany.
¹⁴ Cancer Research Center Cologne Essen (CCCE), University Medicine Essen (AoR), HufelandstraBe, Germany.
¹⁵ University of Duisburg-Essen, Faculty of Computer Science, Schutzenbahn, Germany.
¹⁶ Department of Business Technology, Al-Ahliyya Amman University, Amman, Jordan.

PMID: 40352182
PMCID: PMC12059827
DOI: 10.52225/narra.v5i1.2371

Chinese generative AI models (DeepSeek and Qwen) rival ChatGPT-4 in ophthalmology queries with excellent performance in Arabic and English

Malik Sallam et al. Narra J. 2025 Apr.

. 2025 Apr;5(1):e2371.

doi: 10.52225/narra.v5i1.2371. Epub 2025 Apr 8.

Authors

Affiliations

¹ Department of Pathology, Microbiology and Forensic Medicine, School of Medicine, The University of Jordan, Amman, Jordan.
² Department of Clinical Laboratories and Forensic Medicine, Jordan University Hospital, Amman, Jordan.
³ Section of Ophthalmology, Department of Special Surgery, School of Medicine, The University of Jordan, Amman, Jordan.
⁴ Section of Ophthalmology, Department of Special Surgery, Jordan University Hospital, Amman, Jordan.
⁵ College of Administration and Economics, Al-Iraqia University, Baghdad, Iraq.
⁶ Department of Computer Techniques Engineering, Baghdad College of Economic Sciences University, Baghdad, Iraq.
⁷ Department of Ophthalmology, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, United Arab Emirates.
⁸ Department of Pharmacy, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, United Arab Emirates.
⁹ Department of Management, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, United Arab Emirates.
¹⁰ Department of Management, School of Business, International American University, Los Angeles, United States.
¹¹ College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai, United Arab Emirates.
¹² Institute for Artificial Intelligence in Medicine (IKIM), Essen University Hospital (AoR), GirardetstraBe, Germany.
¹³ Center for Virtual and Extended Reality in Medicine (ZvRM), Essen University Hospital (AoR), HufelandstraBe, Germany.
¹⁴ Cancer Research Center Cologne Essen (CCCE), University Medicine Essen (AoR), HufelandstraBe, Germany.
¹⁵ University of Duisburg-Essen, Faculty of Computer Science, Schutzenbahn, Germany.
¹⁶ Department of Business Technology, Al-Ahliyya Amman University, Amman, Jordan.

PMID: 40352182
PMCID: PMC12059827
DOI: 10.52225/narra.v5i1.2371

Abstract

The rapid evolution of generative artificial intelligence (genAI) has ushered in a new era of digital medical consultations, with patients turning to AI-driven tools for guidance. The emergence of Chinese-developed genAI models such as DeepSeek-R1 and Qwen-2.5 presented a challenge to the dominance of OpenAI's ChatGPT. The aim of this study was to benchmark the performance of Chinese genAI models against ChatGPT-40 and to assess disparities in performance across English and Arabic. Following the METRICS checklist for genAI evaluation, Qwen-2.5, DeepSeek-R1, and ChatGPT-40 were assessed for completeness, accuracy, and relevance using the CLEAR tool in common patient ophthalmology queries. In English, Qwen-2.5 demonstrated the highest overall performance (CLEAR score: 4.43 ± 0.28), outperforming both DeepSeek-R1 (4.3 ± 0.43) and ChatGPT-40 (4.14 ± 0.41), with p = 0.002. A similar hierarchy emerged in Arabic, with Qwen-2.5 again leading (4.40 ± 0.29), followed by DeepSeek-R1 (4.20 ± 0.49) and ChatGPT-40 (4.14 ± 0.41), with p = 0.007. Each tested genAI model exhibited near-identical performance across the two languages, with ChatGPT-40 demonstrating the most balanced linguistic capabilities (p = 0.957), while Qwen-2.5 and DeepSeek-R1 showed a marginal superiority for English. An in-depth examination of genAI performance across key CLEAR components revealed that Qwen-2.5 consistently excelled in content completeness, factual accuracy, and relevance in both English and Arabic, setting a new benchmark for genAI in medical inquiries. Despite minor linguistic disparities, all three models exhibited robust multilingual capabilities, challenging the long-held assumption that genAI is inherently biased toward English. These findings highlight the evolving nature of AI-driven medical assistance, with Chinese genAI models being able to rival or even surpass ChatGPT-40 in ophthalmology-related queries.

Keywords: DeepSeek; LLM; OpenAI; Qwen; eye disease.

PubMed Disclaimer

Conflict of interest statement

All the authors declare that there are no conflicts of interest.

Figures

**Figure 1.**
Comparison of generative AI (genAI) model performance in English and Arabic using CLEAR overall scores. The p-values were calculated using Kruskal-Wallis test.

**Figure 2.**
Comparison of generative AI (genAI) model performance across ophthalmology query topics. p-values were calculated using Kruskal-Wallis test. Post-hoc analysis results using Mann- Whitney U tests are indicated by the horizontal lines between genAI models, with significant results indicated by asterisk, while statistically insignificant results are indicated by ns.

See this image and copyright information in PMC

References

1. The British Broadcasting Corporation (BBC) . AI named word of the year by Collins Dictionary. Available from: https://www.bbc.com/news/entertainment-arts-67271252. Accessed: 27 February 2025.
1. Mbizo T, Oosterwyk G, Tsibolane P, et al. Cautious optimism: The influence of generative AI tools in software development projects. In: Gerber A, editor. South African computer science and information systems research trends. Cham: Springer Nature Switzerland; 2024.
1. Yusuf A, Pervin N, Roman-Gonzalez M. Generative AI and the future of higher education: A threat to academic integrity or reformation? Evidence from multicultural perspectives. Int J Educ Technol High Educ 2024;21(1):21.
1. Cohen J, Lee G, Greenbaum L, et al. The generative world order: AI, geopolitics, and power. Goldman Sachs 2023. Available from: https://www.goldmansachs.com/insights/articles/the-generative-world-orde... power. Accessed: 27 February 2025.
1. Sallam M. ChatGPT utility in healthcare education, research, and practice: Systematic review on the promising perspectives and valid concerns. Healthcare 2023;11(6):887. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Supplementary concepts

Actions

LinkOut - more resources

Full Text Sources
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Chinese generative AI models (DeepSeek and Qwen) rival ChatGPT-4 in ophthalmology queries with excellent performance in Arabic and English

Affiliations

Chinese generative AI models (DeepSeek and Qwen) rival ChatGPT-4 in ophthalmology queries with excellent performance in Arabic and English

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Supplementary concepts

LinkOut - more resources

Full Text Sources