Bridging artificial intelligence and biological sciences: a comprehensive review of large language models in bioinformatics
- PMID: 40708223
- PMCID: PMC12289552
- DOI: 10.1093/bib/bbaf357
Bridging artificial intelligence and biological sciences: a comprehensive review of large language models in bioinformatics
Abstract
Large language models (LLMs), representing a breakthrough advancement in artificial intelligence, have demonstrated substantial application value and development potential in bioinformatics research, particularly showing significant progress in the processing and analysis of complex biological data. This comprehensive review systematically examines the development and applications of LLMs in bioinformatics, with particular emphasis on their advancements in protein and nucleic acid structure prediction, omics analysis, drug design and screening, and biomedical literature mining. This work highlights the distinctive capabilities of LLMs in end-to-end learning and knowledge transfer paradigms. Additionally, this paper thoroughly discusses the major challenges confronting LLMs in current applications, including key issues such as model interpretability and data bias. Furthermore, this review comprehensively explores the potential of LLMs in cross-modal learning and interdisciplinary development. In conclusion, this paper aims to systematically summarize the current research status of LLMs in bioinformatics, objectively evaluate their advantages and limitations, and provide insights and recommendations for future research directions, thereby positioning LLMs as essential tools in bioinformatics research and fostering innovative developments in the biomedical field.
Keywords: LLMs; artificial intelligence; bioinformatics; large language models.
© The Author(s) 2025. Published by Oxford University Press.
Figures



Similar articles
-
Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088. Int J Lang Commun Disord. 2025. PMID: 40627744 Review.
-
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.Pharmacoecon Open. 2025 Jul;9(4):501-517. doi: 10.1007/s41669-025-00580-4. Epub 2025 Apr 29. Pharmacoecon Open. 2025. PMID: 40301283 Free PMC article.
-
Large language models in perioperative medicine-applications and future prospects: a narrative review.Can J Anaesth. 2025 Jun;72(6):1000-1014. doi: 10.1007/s12630-025-02980-w. Epub 2025 Jun 9. Can J Anaesth. 2025. PMID: 40490617 Free PMC article. Review.
-
Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review.J Med Internet Res. 2024 Nov 7;26:e22769. doi: 10.2196/22769. J Med Internet Res. 2024. PMID: 39509695 Free PMC article.
-
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916. J Med Internet Res. 2025. PMID: 40644686 Free PMC article. Review.
References
-
- Zhang J, Li H, Tao W. et al. GseaVis: an R package for enhanced visualization of gene set enrichment analysis in biomedicine. Med Research 1:131–5. 10.1002/mdr2.70000. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources