ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge
- PMID: 37492832
- PMCID: PMC10364849
- DOI: 10.7759/cureus.40895
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge
Abstract
Objective The primary aim of this research was to address the limitations observed in the medical knowledge of prevalent large language models (LLMs) such as ChatGPT, by creating a specialized language model with enhanced accuracy in medical advice. Methods We achieved this by adapting and refining the large language model meta-AI (LLaMA) using a large dataset of 100,000 patient-doctor dialogues sourced from a widely used online medical consultation platform. These conversations were cleaned and anonymized to respect privacy concerns. In addition to the model refinement, we incorporated a self-directed information retrieval mechanism, allowing the model to access and utilize real-time information from online sources like Wikipedia and data from curated offline medical databases. Results The fine-tuning of the model with real-world patient-doctor interactions significantly improved the model's ability to understand patient needs and provide informed advice. By equipping the model with self-directed information retrieval from reliable online and offline sources, we observed substantial improvements in the accuracy of its responses. Conclusion Our proposed ChatDoctor, represents a significant advancement in medical LLMs, demonstrating a significant improvement in understanding patient inquiries and providing accurate advice. Given the high stakes and low error tolerance in the medical field, such enhancements in providing accurate and reliable information are not only beneficial but essential.
Keywords: ai chatbot; chat gpt; gpt; large language model; llama.
Copyright © 2023, Li et al.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures













Similar articles
-
Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study.J Med Internet Res. 2023 Oct 30;25:e49324. doi: 10.2196/49324. J Med Internet Res. 2023. PMID: 37902826 Free PMC article.
-
A Reliable and Accessible Caregiving Language Model (CaLM) to Support Tools for Caregivers: Development and Evaluation Study.JMIR Form Res. 2024 Jul 31;8:e54633. doi: 10.2196/54633. JMIR Form Res. 2024. PMID: 39083337 Free PMC article.
-
EyeGPT for Patient Inquiries and Medical Education: Development and Validation of an Ophthalmology Large Language Model.J Med Internet Res. 2024 Dec 11;26:e60063. doi: 10.2196/60063. J Med Internet Res. 2024. PMID: 39661433 Free PMC article.
-
Large language models in psychiatry: Opportunities and challenges.Psychiatry Res. 2024 Sep;339:116026. doi: 10.1016/j.psychres.2024.116026. Epub 2024 Jun 11. Psychiatry Res. 2024. PMID: 38909412 Review.
-
Large Language Models in Ophthalmology: Potential and Pitfalls.Semin Ophthalmol. 2024 May;39(4):289-293. doi: 10.1080/08820538.2023.2300808. Epub 2024 Jan 5. Semin Ophthalmol. 2024. PMID: 38179986 Review.
Cited by
-
The Potential Breakthroughs with ChatGPT in Parasitology.Iran J Parasitol. 2023 Apr-Jun;18(2):275-278. doi: 10.18502/ijpa.v18i2.13197. Iran J Parasitol. 2023. PMID: 37583636 Free PMC article. No abstract available.
-
Medical language model specialized in extracting cardiac knowledge.Sci Rep. 2024 Nov 23;14(1):29059. doi: 10.1038/s41598-024-80165-z. Sci Rep. 2024. PMID: 39580531 Free PMC article.
-
[Large Language Models: A Comprehensive Guide for Radiologists].J Korean Soc Radiol. 2024 Sep;85(5):861-882. doi: 10.3348/jksr.2024.0080. Epub 2024 Sep 27. J Korean Soc Radiol. 2024. PMID: 39416308 Free PMC article. Review. Korean.
-
Natural Language Processing for Digital Health in the Era of Large Language Models.Yearb Med Inform. 2024 Aug;33(1):229-240. doi: 10.1055/s-0044-1800750. Epub 2025 Apr 8. Yearb Med Inform. 2024. PMID: 40199310 Free PMC article. Review.
-
Vision-language models for medical report generation and visual question answering: a review.Front Artif Intell. 2024 Nov 19;7:1430984. doi: 10.3389/frai.2024.1430984. eCollection 2024. Front Artif Intell. 2024. PMID: 39628839 Free PMC article. Review.
References
-
- Long Ouyang, Jeff Wu, Xu Jiang, et al. Training language models to follow instructions with human feedback. arXiv preprint. [ Apr; 2023 ]. 2022. http://arXiv:2203.02155 p. 0.http://arXiv:2203.02155
-
- Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi. Self-instruct: aligning language model with self generated instructions. arXiv preprint. [ Dec; 2022 ]. 2022. http://arXiv:2212.10560 p. 0.http://arXiv:2212.10560
-
- How does chatgpt perform on the united states medical licensing examination? the implications of large language models for medical education and knowledge assessment. Aidan Gilson, Conrad W Safranek, Thomas Huang, et al. https://mededu.jmir.org/2023/1/e45312/ JMIR Med Educ. 2023;9:45312–42023. - PMC - PubMed
-
- Means: a medical question-answering system combining NLP techniques and semantic web technologies. Abacha AB, Zweigenbaum P. https://www.sciencedirect.com/science/article/pii/S0306457315000515 Inf Process Manag. 2015;51:570–594.
-
- Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, et al. Stanford alpaca: an instruction-following llama model. [ Apr; 2023 ]. 2023. https://github.com/tatsu-lab/stanford_alpaca https://github.com/tatsu-lab/stanford_alpaca
Grants and funding
LinkOut - more resources
Full Text Sources