This is a preprint.
Reference-free and cost-effective automated cell type annotation with GPT-4 in single-cell RNA-seq analysis
- PMID: 37205379
- PMCID: PMC10187429
- DOI: 10.21203/rs.3.rs-2824971/v1
Reference-free and cost-effective automated cell type annotation with GPT-4 in single-cell RNA-seq analysis
Update in
-
Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis.Nat Methods. 2024 Aug;21(8):1462-1465. doi: 10.1038/s41592-024-02235-4. Epub 2024 Mar 25. Nat Methods. 2024. PMID: 38528186 Free PMC article.
Abstract
Cell type annotation is an essential step in single-cell RNA-seq analysis. However, it is a time-consuming process that often requires expertise in collecting canonical marker genes and manually annotating cell types. Automated cell type annotation methods typically require the acquisition of high-quality reference datasets and the development of additional pipelines. We demonstrate that GPT-4, a highly potent large language model, can automatically and accurately annotate cell types by utilizing marker gene information generated from standard single-cell RNA-seq analysis pipelines. Evaluated across hundreds of tissue types and cell types, GPT-4 generates cell type annotations exhibiting strong concordance with manual annotations, and has the potential to considerably reduce the effort and expertise needed in cell type annotation.
Figures
References
-
- Tang F. et al. mrna-seq whole-transcriptome analysis of a single cell. Nat. methods 6, 377–382 (2009). - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources
