This is a preprint.
Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis
- PMID: 37131626
- PMCID: PMC10153208
- DOI: 10.1101/2023.04.16.537094
Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis
Update in
-
Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis.Nat Methods. 2024 Aug;21(8):1462-1465. doi: 10.1038/s41592-024-02235-4. Epub 2024 Mar 25. Nat Methods. 2024. PMID: 38528186 Free PMC article.
Abstract
Cell type annotation is an essential step in single-cell RNA-seq analysis. However, it is a time-consuming process that often requires expertise in collecting canonical marker genes and manually annotating cell types. Automated cell type annotation methods typically require the acquisition of high-quality reference datasets and the development of additional pipelines. We assessed the performance of GPT-4, a highly potent large language model, for cell type annotation, and demonstrated that it can automatically and accurately annotate cell types by utilizing marker gene information generated from standard single-cell RNA-seq analysis pipelines. Evaluated across hundreds of tissue types and cell types, GPT-4 generates cell type annotations exhibiting strong concordance with manual annotations and has the potential to considerably reduce the effort and expertise needed in cell type annotation. We also developed GPTCelltype, an open-source R software package to facilitate cell type annotation by GPT-4.
Conflict of interest statement
Competing Interests All authors declare no competing interests.
Figures
References
-
- Tang F. et al. mrna-seq whole-transcriptome analysis of a single cell. Nat. methods 6, 377–382 (2009). - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources