This is a preprint.
scATAnno: Automated Cell Type Annotation for single-cell ATAC Sequencing Data
- PMID: 37333088
- PMCID: PMC10274707
- DOI: 10.1101/2023.06.01.543296
scATAnno: Automated Cell Type Annotation for single-cell ATAC Sequencing Data
Abstract
Recent advances in single-cell epigenomic techniques have created a growing demand for scATAC-seq analysis. One key analysis task is to determine cell type identity based on the epigenetic data. We introduce scATAnno, a python package designed to automatically annotate scATAC-seq data using large-scale scATAC-seq reference atlases. This workflow generates the reference atlases from publicly available datasets enabling accurate cell type annotation by integrating query data with reference atlases, without the use of scRNA-seq data. To enhance annotation accuracy, we have incorporated KNN-based and weighted distance-based uncertainty scores to effectively detect cell populations within the query data that are distinct from all cell types in the reference data. We compare and benchmark scATAnno against 7 other published approaches for cell annotation and show superior performance in multiple data sets and metrics. We showcase the utility of scATAnno across multiple datasets, including peripheral blood mononuclear cell (PBMC), Triple Negative Breast Cancer (TNBC), and basal cell carcinoma (BCC), and demonstrate that scATAnno accurately annotates cell types across conditions. Overall, scATAnno is a useful tool for scATAC-seq reference building and cell type annotation in scATAC-seq data and can aid in the interpretation of new scATAC-seq datasets in complex biological systems.
Similar articles
-
Benchmarking automated cell type annotation tools for single-cell ATAC-seq data.Front Genet. 2022 Dec 13;13:1063233. doi: 10.3389/fgene.2022.1063233. eCollection 2022. Front Genet. 2022. PMID: 36583014 Free PMC article.
-
AtacAnnoR: a reference-based annotation tool for single cell ATAC-seq data.Brief Bioinform. 2023 Sep 20;24(5):bbad268. doi: 10.1093/bib/bbad268. Brief Bioinform. 2023. PMID: 37497729
-
scATAcat: cell-type annotation for scATAC-seq data.NAR Genom Bioinform. 2024 Oct 8;6(4):lqae135. doi: 10.1093/nargab/lqae135. eCollection 2024 Sep. NAR Genom Bioinform. 2024. PMID: 39380946 Free PMC article.
-
Fundamental and practical approaches for single-cell ATAC-seq analysis.aBIOTECH. 2022 Sep 27;3(3):212-223. doi: 10.1007/s42994-022-00082-5. eCollection 2022 Sep. aBIOTECH. 2022. PMID: 36313930 Free PMC article. Review.
-
Single-cell ATAC sequencing analysis: From data preprocessing to hypothesis generation.Comput Struct Biotechnol J. 2020 Jun 12;18:1429-1439. doi: 10.1016/j.csbj.2020.06.012. eCollection 2020. Comput Struct Biotechnol J. 2020. PMID: 32637041 Free PMC article. Review.
Publication types
LinkOut - more resources
Full Text Sources