A generalist vision-language foundation model for diverse biomedical tasks
- PMID: 39112796
- DOI: 10.1038/s41591-024-03185-2
A generalist vision-language foundation model for diverse biomedical tasks
Abstract
Traditional biomedical artificial intelligence (AI) models, designed for specific tasks or modalities, often exhibit limited flexibility in real-world deployment and struggle to utilize holistic information. Generalist AI holds the potential to address these limitations due to its versatility in interpreting different data types and generating tailored outputs for diverse needs. However, existing biomedical generalist AI solutions are typically heavyweight and closed source to researchers, practitioners and patients. Here, we describe BiomedGPT, the first open-source and lightweight vision-language foundation model, designed as a generalist capable of performing various biomedical tasks. BiomedGPT achieved state-of-the-art results in 16 out of 25 experiments while maintaining a computing-friendly model scale. We also conducted human evaluations to assess the capabilities of BiomedGPT in radiology visual question answering, report generation and summarization. BiomedGPT exhibits robust prediction ability with a low error rate of 3.8% in question answering, satisfactory performance with an error rate of 8.3% in writing complex radiology reports, and competitive summarization ability with a nearly equivalent preference score to human experts. Our method demonstrates that effective training with diverse data can lead to more practical biomedical AI for improving diagnosis and workflow efficiency.
© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.
Conflict of interest statement
References
MeSH terms
Grants and funding
- R03AG078625/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- R01LM11934/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- R01HL159183/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- R21EY034179/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- POSE: Phase II-2346158/National Science Foundation (NSF)
- CRII-2246067/National Science Foundation (NSF)
- IIS-1909879/National Science Foundation (NSF)
- R01 HG012748/HG/NHGRI NIH HHS/United States
- S00010293/Lehigh University
- FRGS00011497/Lehigh University
- NCS-2319451, MRI-2215789 and IIS-1909879/National Science Foundation (NSF)
- RF1AG057892/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- R01EB19403/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- S001250/Lehigh University
- MRI-2215789/National Science Foundation (NSF)
- NCS-2319451/National Science Foundation (NSF)
- R01EB19403 and R01LM11934/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- R01 EB019403/EB/NIBIB NIH HHS/United States
LinkOut - more resources
Full Text Sources