ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning
- PMID: 39842860
- PMCID: PMC11796095
- DOI: 10.1093/bioinformatics/btaf031
ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning
Abstract
Motivation: LLMs like GPT-4, despite their advancements, often produce hallucinations and struggle with integrating external knowledge effectively. While Retrieval-Augmented Generation (RAG) attempts to address this by incorporating external information, it faces significant challenges such as context length limitations and imprecise vector similarity search. ESCARGOT aims to overcome these issues by combining LLMs with a dynamic Graph of Thoughts and biomedical knowledge graphs, improving output reliability, and reducing hallucinations.
Result: ESCARGOT significantly outperforms industry-standard RAG methods, particularly in open-ended questions that demand high precision. ESCARGOT also offers greater transparency in its reasoning process, allowing for the vetting of both code and knowledge requests, in contrast to the black-box nature of LLM-only or RAG-based approaches.
Availability and implementation: ESCARGOT is available as a pip package and on GitHub at: https://github.com/EpistasisLab/ESCARGOT.
© The Author(s) 2025. Published by Oxford University Press.
Figures
References
-
- Abujabal A, Roy RS, Yahya M et al. Never-ending learning for open-domain question answering over knowledge bases. In: Proceedings of the 2018 World Wide Web Conference (WWW'18), International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, pp. 1053–62, 2018.
-
- Besta M, Blach N, Kubicek A et al. Graph of Thoughts: Solving Elaborate Problems with Large Language Models. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38, pp. 17682–90, Association for the Advancement of Artificial Intelligence (AAAI), 2024. 10.1609/aaai.v38i16.29720 - DOI
-
- Chen B, Zhang Z, Langrené N et al. Unleashing the potential of prompt engineering in large language models: a comprehensive review. arXiv, arXiv:2310.14735, 2023, preprint: not peer reviewed.
-
- Dilocker E, van Luijt B, Voorbach B et al. Weaviate. https://github.com/weaviate/weaviate
-
- Hong S, Zheng X, Chen J et al. Metagpt: meta programming for multi-agent collaborative framework. arXiv, arXiv:2308.00352, 2023, preprint: not peer reviewed.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
