ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning
- PMID: 39842860
- PMCID: PMC11796095
- DOI: 10.1093/bioinformatics/btaf031
ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning
Abstract
Motivation: LLMs like GPT-4, despite their advancements, often produce hallucinations and struggle with integrating external knowledge effectively. While Retrieval-Augmented Generation (RAG) attempts to address this by incorporating external information, it faces significant challenges such as context length limitations and imprecise vector similarity search. ESCARGOT aims to overcome these issues by combining LLMs with a dynamic Graph of Thoughts and biomedical knowledge graphs, improving output reliability, and reducing hallucinations.
Result: ESCARGOT significantly outperforms industry-standard RAG methods, particularly in open-ended questions that demand high precision. ESCARGOT also offers greater transparency in its reasoning process, allowing for the vetting of both code and knowledge requests, in contrast to the black-box nature of LLM-only or RAG-based approaches.
Availability and implementation: ESCARGOT is available as a pip package and on GitHub at: https://github.com/EpistasisLab/ESCARGOT.
© The Author(s) 2025. Published by Oxford University Press.
Figures
Similar articles
-
KRAGEN: a knowledge graph-enhanced RAG framework for biomedical problem solving using large language models.Bioinformatics. 2024 Jun 3;40(6):btae353. doi: 10.1093/bioinformatics/btae353. Bioinformatics. 2024. PMID: 38830083 Free PMC article.
-
Improving Dietary Supplement Information Retrieval: Development of a Retrieval-Augmented Generation System With Large Language Models.J Med Internet Res. 2025 Mar 19;27:e67677. doi: 10.2196/67677. J Med Internet Res. 2025. PMID: 40106799 Free PMC article.
-
Use of Retrieval-Augmented Large Language Model for COVID-19 Fact-Checking: Development and Usability Study.J Med Internet Res. 2025 Apr 30;27:e66098. doi: 10.2196/66098. J Med Internet Res. 2025. PMID: 40306628 Free PMC article.
-
RAGing ahead in rheumatology: new language model architectures to tame artificial intelligence.Ther Adv Musculoskelet Dis. 2025 Apr 21;17:1759720X251331529. doi: 10.1177/1759720X251331529. eCollection 2025. Ther Adv Musculoskelet Dis. 2025. PMID: 40292012 Free PMC article. Review.
-
Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications.Medicina (Kaunas). 2024 Mar 8;60(3):445. doi: 10.3390/medicina60030445. Medicina (Kaunas). 2024. PMID: 38541171 Free PMC article. Review.
Cited by
-
Fine-tuning LLM hyperparameters to align semantic and physiological contexts of aging-related pathways.Mol Divers. 2025 Jun 6. doi: 10.1007/s11030-025-11226-2. Online ahead of print. Mol Divers. 2025. PMID: 40481378
-
Drug repurposing for Alzheimer's disease using a graph-of-thoughts based large language model to infer drug-disease relationships in a comprehensive knowledge graph.BioData Min. 2025 Aug 5;18(1):51. doi: 10.1186/s13040-025-00466-5. BioData Min. 2025. PMID: 40764997 Free PMC article.
-
Automatic biomarker discovery and enrichment with BRAD.Bioinformatics. 2025 May 6;41(5):btaf159. doi: 10.1093/bioinformatics/btaf159. Bioinformatics. 2025. PMID: 40323323 Free PMC article.
References
-
- Abujabal A, Roy RS, Yahya M et al. Never-ending learning for open-domain question answering over knowledge bases. In: Proceedings of the 2018 World Wide Web Conference (WWW'18), International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, pp. 1053–62, 2018.
-
- Besta M, Blach N, Kubicek A et al. Graph of Thoughts: Solving Elaborate Problems with Large Language Models. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38, pp. 17682–90, Association for the Advancement of Artificial Intelligence (AAAI), 2024. 10.1609/aaai.v38i16.29720 - DOI
-
- Chen B, Zhang Z, Langrené N et al. Unleashing the potential of prompt engineering in large language models: a comprehensive review. arXiv, arXiv:2310.14735, 2023, preprint: not peer reviewed.
-
- Dilocker E, van Luijt B, Voorbach B et al. Weaviate. https://github.com/weaviate/weaviate
-
- Hong S, Zheng X, Chen J et al. Metagpt: meta programming for multi-agent collaborative framework. arXiv, arXiv:2308.00352, 2023, preprint: not peer reviewed.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources