Foundation models for generalist medical artificial intelligence

Michael Moor^#¹, Oishi Banerjee^#², Zahra Shakeri Hossein Abad³, Harlan M Krumholz⁴, Jure Leskovec¹, Eric J Topol⁵, Pranav Rajpurkar⁶

Affiliations

¹ Department of Computer Science, Stanford University, Stanford, CA, USA.
² Department of Biomedical Informatics, Harvard University, Cambridge, MA, USA.
³ Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada.
⁴ Yale University School of Medicine, Center for Outcomes Research and Evaluation, Yale New Haven Hospital, New Haven, CT, USA.
⁵ Scripps Research Translational Institute, La Jolla, CA, USA. etopol@scripps.edu.
⁶ Department of Biomedical Informatics, Harvard University, Cambridge, MA, USA. pranav_rajpurkar@hms.harvard.edu.

^# Contributed equally.

PMID: 37045921
DOI: 10.1038/s41586-023-05881-4

Review

Foundation models for generalist medical artificial intelligence

Michael Moor et al. Nature. 2023 Apr.

. 2023 Apr;616(7956):259-265.

doi: 10.1038/s41586-023-05881-4. Epub 2023 Apr 12.

Authors

Michael Moor^#¹, Oishi Banerjee^#², Zahra Shakeri Hossein Abad³, Harlan M Krumholz⁴, Jure Leskovec¹, Eric J Topol⁵, Pranav Rajpurkar⁶

Affiliations

¹ Department of Computer Science, Stanford University, Stanford, CA, USA.
² Department of Biomedical Informatics, Harvard University, Cambridge, MA, USA.
³ Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada.
⁴ Yale University School of Medicine, Center for Outcomes Research and Evaluation, Yale New Haven Hospital, New Haven, CT, USA.
⁵ Scripps Research Translational Institute, La Jolla, CA, USA. etopol@scripps.edu.
⁶ Department of Biomedical Informatics, Harvard University, Cambridge, MA, USA. pranav_rajpurkar@hms.harvard.edu.

^# Contributed equally.

PMID: 37045921
DOI: 10.1038/s41586-023-05881-4

Abstract

The exceptionally rapid development of highly flexible, reusable artificial intelligence (AI) models is likely to usher in newfound capabilities in medicine. We propose a new paradigm for medical AI, which we refer to as generalist medical AI (GMAI). GMAI models will be capable of carrying out a diverse set of tasks using very little or no task-specific labelled data. Built through self-supervision on large, diverse datasets, GMAI will flexibly interpret different combinations of medical modalities, including data from imaging, electronic health records, laboratory results, genomics, graphs or medical text. Models will in turn produce expressive outputs such as free-text explanations, spoken recommendations or image annotations that demonstrate advanced medical reasoning abilities. Here we identify a set of high-impact potential applications for GMAI and lay out specific technical capabilities and training datasets necessary to enable them. We expect that GMAI-enabled applications will challenge current strategies for regulating and validating AI devices for medicine and will shift practices associated with the collection of large medical datasets.

PubMed Disclaimer

References

1. Bommasani, R. et al. On the opportunities and risks of foundation models. Preprint at https://arxiv.org/abs/2108.07258 (2022).
1. Reed, S. et al. A generalist agent. In Transactions on Machine Learning Research (2022). This study presented Gato, a generalist model that can carry out a variety of tasks across modalities such as chatting, captioning images, playing video games and controlling a robot arm.
1. Alayrac, J.-B. et al. Flamingo: a Visual Language Model for few-shot learning. In Advances in Neural Information Processing Systems (eds Oh, A. H. et al.) 35, 23716–23736 (2022).
1. Lu, J., Clark, C., Zellers, R., Mottaghi, R. & Kembhavi, A. Unified-IO: a unified model for vision, language, and multi-modal tasks. Preprint at https://arxiv.org/abs/2206.08916 (2022).
1. Brown, T. et al. Language models are few-shot learners. In Advances in Neural Information Processing Systems (eds Larochelle, H. et al.) 33, 1877–1901 (2020). This study presented the language model GPT-3 and discovered that large language models can carry out in-context learning.

Publication types

Actions
Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Foundation models for generalist medical artificial intelligence

Affiliations

Foundation models for generalist medical artificial intelligence

Authors

Affiliations

Abstract

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources