Zero-shot evaluation reveals limitations of single-cell foundation models
- PMID: 40251685
- PMCID: PMC12007350
- DOI: 10.1186/s13059-025-03574-x
Zero-shot evaluation reveals limitations of single-cell foundation models
Abstract
Foundation models such as scGPT and Geneformer have not been rigorously evaluated in a setting where they are used without any further training (i.e., zero-shot). Understanding the performance of models in zero-shot settings is critical to applications that exclude the ability to fine-tune, such as discovery settings where labels are unknown. Our evaluation of the zero-shot performance of Geneformer and scGPT suggests that, in some cases, these models may face reliability challenges and could be outperformed by simpler methods. Our findings underscore the importance of zero-shot evaluations in development and deployment of foundation models in single-cell research.
Keywords: Foundation models; Machine learning; Single-cell.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethical approval and consent to participate: Not applicable. Competing interests: A.X.L., L.C., and A.P.A. are employees of and hold equity in Microsoft.
Figures
References
-
- Bommasani R, Hudson DA, Adeli E, Altman R, Arora S, von Arx S, et al. On the opportunities and risks of foundation models. arXiv. 2022. ArXiv:2108.07258 [cs]. 10.48550/arXiv.2108.07258.
-
- Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language models are few-shot learners. arXiv. 2020. ArXiv:2005.14165 [cs]. 10.48550/arXiv.2005.14165.
-
- Ramesh A, Pavlov M, Goh G, Gray S, Voss C, Radford A, et al. Zero-shot text-to-image generation. arXiv. 2021. ArXiv:2102.12092 [cs]. 10.48550/arXiv.2102.12092.
-
-
Program CSCB, Abdulla S, Aevermann B, Assis P, Badajoz S, Bell SM, et al. CZ CELL
GENE Discover: a single-cell data platform for scalable exploration, analysis and modeling of aggregated data. bioRxiv. 2023. Pages: 2023.10.30.563174 Section: New Results. 10.1101/2023.10.30.563174.
-
Program CSCB, Abdulla S, Aevermann B, Assis P, Badajoz S, Bell SM, et al. CZ CELL
-
- Yang F, Wang W, Wang F, Fang Y, Tang D, Huang J, et al. scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data. Nat Mach Intell. 2022;4(10):852–66. Number: 10 Publisher: Nature Publishing Group. 10.1038/s42256-022-00534-z.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
